[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN106528275A - Processing method of data tasks and task scheduler - Google Patents

Processing method of data tasks and task scheduler Download PDF

Info

Publication number
CN106528275A
CN106528275A CN201510573492.8A CN201510573492A CN106528275A CN 106528275 A CN106528275 A CN 106528275A CN 201510573492 A CN201510573492 A CN 201510573492A CN 106528275 A CN106528275 A CN 106528275A
Authority
CN
China
Prior art keywords
task
transferred
processor
submitted
storage area
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510573492.8A
Other languages
Chinese (zh)
Inventor
张潮贵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Netease Hangzhou Network Co Ltd
Original Assignee
Netease Hangzhou Network Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Netease Hangzhou Network Co Ltd filed Critical Netease Hangzhou Network Co Ltd
Priority to CN201510573492.8A priority Critical patent/CN106528275A/en
Publication of CN106528275A publication Critical patent/CN106528275A/en
Pending legal-status Critical Current

Links

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a processing method of data tasks. The method comprises the steps that a task to be executed is acquired from a task table, wherein multiple tasks and a dependency item of each task are recorded in the task table, and the tasks recorded in the task table can be acquired by multiple task schedulers; according to the dependency item of the acquired task, dependency information of the task is obtained; according to the dependency information, it is determined that when the current state conforms to execution conditions, the acquired task is submitted to a task processor for data processing. Accordingly, the invention furthermore discloses a task scheduler. By the adoption of the embodiment, the data tasks can be processed concurrently, and work efficiency can be improved.

Description

The processing method of data task and task dispatcher
Technical field
The present invention relates to field of computer technology, more particularly to a kind of processing method of data task and task tune Degree device.
Background technology
Currently, Internet enterprises increasingly pay attention to the collection of big data, such as network selling data, Yong Huhang For data etc..In the face of increasing data volume, most of enterprise selects Hadoop+Hive more ripe at present As distributed data processing framework.
By Hadoop+Hive distributed data processing frameworks, enterprise can easily process the number of more than one hundred million ranks According to.Generally data processing work flow process includes some steps, and each step has the indefinite data processing of quantity Task is constituted, and there may be dependence, it is also possible to independent between these data processing tasks.For There is the task of dependence, need to strictly observe sequencing, can otherwise cause data result mistake.Cause How this, manage daily data processing work flow process, data processing task is carried out efficiently, in an orderly manner, is A big difficult point and emphasis of the big data with.
In the face of problem above, current solution has two kinds:Automated serial perform, by Oozie.
In the technical scheme that automated serial is performed, the newly-built script file of data engineering Shi Shouxian, and In script file in turn, according to the data of each step of daily data processing work flow process under set sequential write Process task, system perform the task of data processing successively according to the script file of automated serial, complete daily Flow chart of data processing.However, as data processing task is probably directly independent, such as index A and finger The calculating of mark B has no association, and corresponding task can be executed in parallel, and in the task of data processing In the case of more than very, automated serial performs task, can substantially reduce operation operating efficiency.On the other hand, If one of data processing task error, the characteristics of operation due to script, whole handling process will skip mistake Miss and continue executing with, and be probably the presence of dependence between data processing task, it is thus possible to cause number Make a mistake according to the result for processing.
The document of a workflow defining in the technical scheme of Oozie, is introduced, is defined in this document The dependence of task, trend of task etc..Oozie can judge which currently has appoint according to this document Perform while business can be safe, which improves reliability compared to the technical scheme of automated serial. But, due to Oozie documents it is excessively complicated, in actual production environment, thousands of up to a hundred data processing tasks Workflow be it is extremely common, if now use Oozie, high document maintenance cost, work will be brought Make inefficiency.
The content of the invention
The embodiment of the present invention proposes a kind of processing method of data task and task dispatcher, can parallel processing number According to task, and improve operating efficiency.
The embodiment of the present invention provides a kind of processing method of data task, including:
A pending task is transferred from task list;Wherein, the task list record multiple tasks and The dependence item of each task, and the task described in the task list can be transferred by multiple tasks scheduler;
According to the dependence item of being transferred for task, the Dependency Specification of the task is obtained;
When judging that current state meets execution condition according to the Dependency Specification, being transferred for task is submitted to Task processor performs data processing.
Further, it is described to transfer a pending task from task list, specifically include:
A task is selected by task queue order from the task list;
According to the mark of the task, the gathering information of the task is inquired about from data storage area and letter is performed Breath;Wherein, the data storage area record has the gathering information of each task in the task list and performs letter Breath, the gathering information are used for recording whether the task is transferred by task dispatcher, the execution information It is carrying out and whether runs succeeded for records the task and whether being submitted to task processor;
When selected task is not transferred, or selected task is transferred but is not submitted to task Computing device, or selected task be submitted to task processor perform but perform failure when, adjust The task is taken, and the task is recorded in the data storage area transferred;
Transferred and be submitted to task processor when selected task and be carrying out, or it is selected Task when being transferred and being run succeeded, continue to select next by task queue order from the task list Individual task.
Yet further, Dependency Specification is included in the dependence item;It is then described according to being transferred for task Dependence item, obtain the Dependency Specification of the task, specially:
From the dependence item of being transferred for task, the Dependency Specification of the task is directly obtained;
Or, mark is judged comprising dependence automatically in the dependence item;It is then described according to appointing for being transferred The dependence item of business, obtains the Dependency Specification of the task, specially:
Mark is judged according to the automatic dependence in the dependence item of being transferred for task, is started to rely on automatically and is sentenced It is disconnected;
The concrete execution content of the task is scanned, the Dependency Specification of the task is therefrom obtained.
In one embodiment, the Startup time that the Dependency Specification is relied on by the task;
When then judging that current state meets execution condition according to the Dependency Specification, being transferred for task is submitted to Data processing is performed to task processor, is specifically included:
Judge Startup time that the task relied on whether earlier than current time;
If so, then judge that current state meets execution condition, and being transferred for task is submitted at the task Reason device performs data processing, and has been submitted at task in the transferred task of data storage area record Reason device is carrying out;
If it is not, then judging that current state does not meet execution condition, in the data storage area, record is transferred Task is not submitted to task processor execution.
In another embodiment, the mark of the task that the Dependency Specification is relied on by the task;
When then judging that current state meets execution condition according to the Dependency Specification, being transferred for task is submitted to Data processing is performed to task processor, is specifically included:
The mark of the task relied on by the task according to being transferred, inquires about from the data storage area and is relied on Task whether run succeeded by the task processor;
If so, then judge that current state meets execution condition, and being transferred for task is submitted at the task Reason device performs data processing, and has been submitted at task in the transferred task of data storage area record Reason device is carrying out;
If it is not, then judging that current state does not meet execution condition, in the data storage area, record is transferred Task is not submitted to task processor execution.
In further embodiment, Startup time that the Dependency Specification is relied on by transferring for task and The mark of being relied on for task;
When then judging that current state meets execution condition according to the Dependency Specification, being transferred for task is submitted to Data processing is performed to task processor, is specifically included:
Judge Startup time that the task relied on whether earlier than current time;
When the Startup time is earlier than current time, the mark of being relied on according to the task for task, from Inquire about whether relied on task is run succeeded by the task processor in the data storage area;
When the tasks carrying success for being relied on, judge that current state meets execution condition, by appointing for being transferred Business is submitted to the task processor and performs data processing, and in appointing that data storage area record is transferred Business has been submitted to task processor and has been carrying out;
When the Startup time is later than current time, or the Startup time is earlier than current time but is relied on Task judge that current state does not meet execution condition when not being performed successfully, then, in the data storage area The transferred task of record is not submitted to task processor execution.
Further, after submit to, also wrap Include:
Receive the task action result that the task processor is returned;
The task action result is stored in into data storage area;
Next pending task is transferred in continuation from the task list, until described in the task list Till task is performed success entirely.
Further, after the task action result that the task processor is returned is received, also include:
Judge whether the task runs succeeded according to the task action result;
If so, the tasks carrying success is then recorded in the data storage area;
If it is not, then record the tasks carrying in the data storage area failing.
Further, after recording the tasks carrying failure in the data storage area, also include:
Alarm mail is generated according to the task action result, and the alarm mail is sent to default is connect Receiving end mouth;
Or, adding one to the number of times of the tasks carrying failure, the number of times after Jia is more than alarm door limit value When, alarm mail is generated according to the task action result, and the alarm mail is sent to default is connect Receiving end mouth.
Correspondingly, the embodiment of the present invention also provides a kind of task dispatcher, including:
Task transfers module, for transferring a pending task from task list;Wherein, the task Token is loaded with the dependence item of multiple tasks and each task, and the task described in the task list can quilt Multiple tasks scheduler is transferred;
Rely on acquisition module, for the dependence item according to being transferred for task, obtain the task according to Bad information;With,
Task processing module, during for judging that current state meets execution condition according to the Dependency Specification, will Being transferred for task is submitted to task processor and performs data processing.
Further, the task is transferred module and is specifically included:
Task choosing unit, for selecting a task by task queue order from the task list;
First query unit, according to the mark of the task, inquires about the tune of the task from data storage area Win the confidence breath and execution information;Wherein, the data storage area record has the tune of each task in the task list Win the confidence breath and execution information, the gathering information is used for recording whether the task is transferred by task dispatcher, Whether the execution information is used for recording the task and whether has been submitted to task processor and be carrying out and Run succeeded;
Task transfers unit, and for not transferred when selected task, or selected task is adjusted Task processor execution is taken but be not submitted to, or selected task has been submitted to task processor and has held When row but execution failure, the task is transferred, and the task is recorded in the data storage area and transferred;
The task choosing unit, is additionally operable to when selected task is transferred and has been submitted at task Reason device is carrying out, or selected task is when being transferred and being run succeeded, and continues from the task list In by task queue order select next task.
Further, Dependency Specification is included in the dependence item, then the dependence acquisition module is specifically used In from the dependence item of being transferred for task, the Dependency Specification of the task is directly obtained;
Or, mark is judged comprising dependence automatically in the dependence item, then the dependence acquisition module, Specifically include:
Start from judging unit, for the dependence item according to being transferred for task in automatic dependence judge Mark, starts;With,
Scanning acquiring unit, for scanning the concrete execution content of the task, therefrom obtains the task Dependency Specification.
In one embodiment, the Startup time that the Dependency Specification is relied on by the task;It is then described Task processing module is specifically included:
Whether the first judging unit, for judging Startup time that the task relied on earlier than current time;
First task submits unit to, for when the Startup time relied on by the task is earlier than current time, Judge that current state meets execution condition, being transferred for task is submitted to into the task processor and performs data Process, and task processor has been submitted in the transferred task of data storage area record be carrying out; With,
First information recording unit, for when the Startup time relied on by the task is later than current time, Judge that current state does not meet execution condition, transferred task is recorded in the data storage area not submitted Perform to task processor.
In another embodiment, the mark of the task that the Dependency Specification is relied on by the task, then The task processing module is specifically included:
Second query unit, the mark of the task for being relied on according to being transferred for task, from the data Inquire about whether relied on task is run succeeded by the task processor in memory block;
Second task submits unit to, for when being relied on for task is run succeeded by the task processor, Judge that current state meets execution condition, being transferred for task is submitted to into the task processor and performs data Process, and task processor has been submitted in the transferred task of data storage area record be carrying out; With,
Second information recording unit, for when being relied on for task is not run succeeded by the task processor, Judge that current state does not meet execution condition, transferred task is recorded in the data storage area not submitted Perform to task processor.
In further embodiment, Startup time that the Dependency Specification is relied on by transferring for task and The mark of being relied on for task, then the task processing module specifically include:
Whether the second judging unit, for judging Startup time that the task relied on earlier than current time;
3rd query unit, for when the Startup time is earlier than current time, according to task institute according to Whether the mark of bad task, inquire about relied on task from the data storage area by the task Reason device runs succeeded;
3rd task submits unit to, for when the tasks carrying for being relied on is successful, judging that current state meets Being transferred for task is submitted to the task processor and performs data processing by execution condition, and in the number Task processor has been submitted to according to the transferred task of memory block record to be carrying out;With,
3rd information recording unit, for being later than current time when the Startup time, or during the startup When carving earlier than current time but being relied on for task is not performed successfully, judge that current state does not meet execution bar Part, records transferred task in the data storage area and is not submitted to task processor execution.
Further, the task dispatcher also includes:
Information receiving module, for receiving the task action result that the task processor is returned;With,
Information unloading module, for being stored in data storage area by the task action result;
The task transfers module, is additionally operable to after the task action result is stored in data storage area, Next pending task is transferred in continuation from the task list, until the task described in the task list Till being performed success entirely.
Further, the task dispatcher also includes:
Second judge module, for after the task action result that the task processor is returned is received, root Judge whether the task runs succeeded according to the task action result;
4th information logging modle, for when the tasks carrying is successful, then in the data storage area Record the tasks carrying success;With,
5th information logging modle, for when the tasks carrying fails, then in the data storage area Record the tasks carrying failure.
Further, the task dispatcher also includes:
Alarm module, after recording the tasks carrying failure in the data storage area, according to institute State task action result and generate alarm mail, and the alarm mail is sent to default receiving port;Or Person, after recording the tasks carrying failure in the data storage area, loses to the tasks carrying The number of times for losing adds one, when the number of times after Jia is more than alarm door limit value, is given birth to according to the task action result Into alarm mail, and the alarm mail is sent to default receiving port.
Implement the embodiment of the present invention, have the advantages that:
The processing method of data task provided in an embodiment of the present invention and task dispatcher, adjust can multiple tasks Degree device concurrently transfers pending task respectively from task list.Wherein, the task list records multiple The dependence item of task and each task, the addition of the dependence item are simple, can improve work Efficiency;And task dispatcher can parse transferred task from the dependence item of being transferred for task Dependence, when judging that current state meets execution condition according to the Dependency Specification, then will be transferred Task submit to task processor perform data processing so that task is performed in an orderly manner, it is ensured that data Process does not malfunction, and improves the operating efficiency of multiple tasks scheduler parallel processing task.
Description of the drawings
Fig. 1 is the schematic flow sheet of one embodiment of the processing method of the data task that the present invention is provided;
The flow process of one embodiment of the step of Fig. 2 is the processing method of the data task provided by Fig. 1 S1 is shown It is intended to;
Fig. 3 is the structural representation of the one embodiment for the task dispatcher that the present invention is provided;
Fig. 4 is the structural representation of one embodiment that the task of the task dispatcher that the present invention is provided transfers module Figure;
Fig. 5 is the structural representation of one embodiment of the dependence acquisition module of the task dispatcher that the present invention is provided Figure;
Fig. 6 is the structural representation of the embodiment one of the task processing module of the task dispatcher that the present invention is provided Figure;
Fig. 7 is the structural representation of the embodiment two of the task processing module of the task dispatcher that the present invention is provided Figure;
Fig. 8 is the structural representation of the embodiment three of the task processing module of the task dispatcher that the present invention is provided Figure;
Fig. 9 is the structure chart provided by the present invention for realizing one embodiment of the system of data task process;
Figure 10 is the schematic flow sheet of another embodiment of the processing method of the data task that the present invention is provided.
Specific embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clearly Chu, it is fully described by, it is clear that described embodiment is only a part of embodiment of the invention, rather than Whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art are not making creation Property work under the premise of the every other embodiment that obtained, belong to the scope of protection of the invention.
The embodiment of the present invention provides a kind of processing method of data task, can make multiple tasks scheduler concurrently Transfer pending task respectively from task list, and can from task list each task dependence item The dependence that parsing is gone out on missions, performs task in an orderly manner according to the Dependency Specification, it is ensured that data processing is not Error, and improve the operating efficiency of multiple tasks scheduler parallel processing task.
For convenience of describing, below only by taking one of task dispatcher as an example, how task dispatcher is carried out The method of task scheduling is described in detail, and the method that other task dispatchers carry out task scheduling is also identical, The present invention is not repeated one by one.
Referring to Fig. 1, it is that the flow process of one embodiment of the processing method of the data task that the present invention is provided is illustrated Figure.The method can be performed by task dispatcher, including step S1 to step S3, specific as follows:
S1, transfers a pending task from task list;Wherein, the task list records multiple Business and the dependence item of each task, and the task described in the task list can be by multiple tasks scheduler Transfer.
In the present embodiment, engineer can flexibly arrange task dispatcher quantity according to the task amount of task list, So that multiple tasks scheduler can concurrently in process task table task.Wherein, multiple tasks scheduling Device can be with regular working, and upon actuation, being transferred in task list automatically for task is processed.Engineer can be with The quantity of task dispatcher and the time of regular working is configured as the case may be.
The task list is generally an executable automated serial script file, records multiple tasks order line And the dependence item of each task order row, engineer only need to directly add behind each task order row Plus the dependence item of the task order row, Dependency Specification of the dependence item comprising the task, you can The configuration of the dependence of each task of the task list is realized, it is simple, improve operating efficiency.
S2, according to the dependence item of being transferred for task, obtains the Dependency Specification of the task.
In the present embodiment, task dispatcher utilizes the matching process of " regular expression ", matching task Dependence item, therefrom obtains the Dependency Specification of task, and for example, regular expression is " s*##@{ } ", tool Body is implemented:Transferred task is read in, when a dependence mark " s*##@" is read, Then behind dependence mark " { } " bracket information for task Dependency Specification, task dispatcher is The Dependency Specification of the task can be obtained from " { } ".
S3, when judging that current state meets execution condition according to the Dependency Specification, being transferred for task is carried Give task processor and perform data processing.
In the present embodiment, the Dependency Specification is used for recording the dependence of task, and the dependence can be with For depending on the Startup time of time, i.e. appointed task, when the Startup time of task is earlier than current time, Task can just submit to task processor execution;The dependence can also be the task that depends on, i.e., this The execution of business need to rely on other tasks, and when being relied on for task is performed successfully, this task is Task processor can be submitted to perform.
Further, it is as shown in Fig. 2 in above-mentioned steps S1, described to transfer one from task list and wait to hold Capable task, specifically includes step S101 to S104, as follows:
S101, selects a task by task queue order from the task list.
Specifically, in one embodiment, the task queue order is:One task then another Order of the task order into team;I.e. task dispatcher can be seriatim selected in order from the beginning of task queue Task.In another embodiment, each task of the task list also includes priority, then described Task queue order be:The priority of each task order from high to low, the task of same levels press one Task then order of another task order into team;That is, task dispatcher can according in task list each The priority orders of task seriatim select task.In the middle of being embodied as, engineer can be according to concrete feelings Condition arranges the order of each pending task in task list.
S102, according to the mark of the task, inquire about from data storage area the task gathering information and Execution information;Wherein, data storage area record have each task in the task list gathering information and Execution information, the gathering information are used for recording whether the task is transferred by task dispatcher, described to hold Whether row information is used for recording the task and whether being submitted to task processor being carrying out and performing into Work(.
In the present embodiment, engineer configures a mark for each task, and the mark can be the task Specific instructions row, or the task simple name.And the tune of each task that data storage area is recorded Win the confidence breath and execution information, the mark associated record with corresponding task.Task dispatcher only need to be deposited in data Storage area searches the mark of the task, you can inquires about gathering information and the execution information of the task, knows the task State.Wherein, each task there may be following five kinds of states:
First, the task is not transferred by task dispatcher, task processor execution is not submitted to yet;
Second, the task is transferred by task dispatcher, but due to not meeting the reasons such as execution condition, and not Submit to task processor execution;
Third, the task is transferred by task dispatcher, and task processor it has been submitted to and has been carrying out;
Fourth, the task is transferred by task dispatcher, and has been submitted to task processor execution, and hold Go successfully;
Fifth, the task is transferred by task dispatcher, and has been submitted to task processor execution, but hold Row failure.
Correspondingly, in the gathering information and execution information of the task recorded by data storage area, can adopt Identified with specifically transferring mark and performing to distinguish five kinds of states of the above.
In the present embodiment, the gathering information of each task for being recorded due to data storage area and execution information, Mark associated record with corresponding task, therefore, after the task of task list is interrupted process, or task When scheduler is again started up, according to the mark of each task in the task list and the state for whether running succeeded Information, skips the successful task of executed, only need to from the task list be not performed successful task from the beginning of, A task is selected to continue with by task queue order, without the 1st from second task list Item task starts to select a task to be processed.
Further, since in the middle of being embodied as, task list regularly can be adjusted, may produce much not Same task list, and the task list transferred by task dispatcher is it is also possible to be interrupted process.In order that task Scheduler after breakpoint is restarted can be transferred the task list being interrupted exactly, and improve treatment effeciency, The embodiment of the present invention can also be further introduced into " time marking ", by time marking come logger task scheduler The task list transferred.
Preferably, the present embodiment can using task dispatcher start date as time marking, or according to After specific algorithm was calculated to the date, time marking is obtained;Or engineer is input into a date, And the date is calculated according to another specific algorithm, obtain time marking.For example, at first day, Multiple tasks scheduler starts, and prepares to process the task in first task table, is then generated according to the date on the same day The very first time identifies, and the very first time mark is associated with the mark of all tasks in first task table, and Record is in data storage area.At second day, multiple tasks scheduler started, and prepared to process the second task list In task, then the second time marking is generated according to the date on the same day, and by second time marking and second The mark association of all tasks in task list, and record in data storage area.Assume at second day, when the After front 100 tasks in two task lists are performed successfully, occur to interrupt.When task dispatcher is again started up When, the second time marking is generated according to the date that the generation of engineer's input is interrupted, and from data storage area The execution state of all tasks that inquiry is associated with the second time marking, knows front 100 in the second task list Item task is performed successfully, then the task dispatcher can skip front 100 tasks, only need to be from described second The 101st task in business table starts, and selects a task to continue with by task queue order, without To start to select from the 1st in second task list task, therefore the embodiment of the present invention not only can be Breakpoint transfers the task list being interrupted after restarting exactly, and can improve treatment effeciency.
S103, when selected task is not transferred, or selected task is transferred but not submitted Perform to task processor, or selected task has been submitted to task processor and has performed but perform failure When, the task is transferred, and the task is recorded in the data storage area transferred.
S104, is transferred and has been submitted to task processor when selected task and be carrying out, or When selected task is transferred and run succeeded, continue to select by task queue order from the task list Select next task.
In the present invention, the acquisition methods of Dependency Specification have two kinds, as follows:
In one embodiment, Dependency Specification is included in the dependence item;It is then described according to being transferred Task dependence item, obtain the Dependency Specification of the task, specially:From being transferred for task In dependence item, the Dependency Specification of the task is directly obtained.
Specifically, the dependence item of the task is " s*##@{ } ", for relying on letter inside " { } " bracket Breath, for example, Dependency Specification is the Startup time { ' time ' relied on by task:${dt}2015-8-16 08:00:00 }, The mark of the Startup time that time is relied on by task, $ { dt } time 2015-8-16 08 below:00:00 is to appoint The concrete Startup time relied on by business;Dependency Specification is the mark { ' rely ' of the task relied on by task:' taskA ' }, For indicating that the task is to rely on running succeeded for task taskA, rely is indicated for the task and depends on The mark for running succeeded of task taskA, the task that taskA is relied on by task are specifically identified;Rely on letter Breath is the simple name { ' name ' of task:' taskB ' }, name is the mark for the simple name of task configuration, and taskB is For the simple name of the task configuration, the simple name of the task and the mark associated record data storage area of the task.With Upper described mark is can be, but not limited to as time, rely, name.
In another embodiment, mark is judged comprising dependence automatically in the dependence item;It is then described According to the dependence item of being transferred for task, the Dependency Specification of the task is obtained, specially:According to institute Automatic dependence in the dependence item of transferring for task judges mark, starts dependence automatically and judges;Scanning institute The concrete execution content of task is stated, the Dependency Specification of the task is therefrom obtained.
Specifically, the dependence item of the task is " s*##@{ } ", inside " { } " bracket be automatically according to Rely and judge mark, the automatic dependence judges that mark is can be, but not limited to as ' autorely ':' 1 ', work as task scheduling When the automatic dependence judgement mark of task is arrived in device scanning, start dependence judgement automatically, according to the mark of the task, Jump in the file of concrete execution content of the task, scanning is concrete to perform content, is used for referring to when scanning to When showing the keyword of Dependency Specification of task, you can obtain Dependency Specification, the keyword can be according to concrete Situation is arranged by engineer, need to only be sought unity of standard.In the present embodiment, due to the concrete execution of task Content is built based on sql like language, thus this it is concrete perform content in building process comprising the task according to Bad information, and after the Dependency Specification of the task follows FROM or JOIN closely, so as to work as task dispatcher When scanning the keyword FROM or JOIN of concrete execution content, task dispatcher is obtained immediately following FROM Or information after JOIN, you can obtain the Dependency Specification of the task.In addition, in the task phase of the task list In the case of many, it is only necessary to directly replicate behind each task of task list and paste comprising dependence automatically The mark of judgement dependence item " s*##@{ ' autorely ':' 1 ' } ", you can realize matching somebody with somebody for dependence item Put, configure simple, and improve operating efficiency, reduce the cost of time and manpower.
In the present invention, Dependency Specification can be to depend on Startup time, or for depending on other tasks, Or both combination, illustrate separately below, to above-mentioned steps S3 in, how to judge to work as according to Dependency Specification Whether front state meets execution condition, performs at data so as to being transferred for task is submitted to task processor The method of reason is described in detail.
In one embodiment, the Startup time that the Dependency Specification is relied on by the task;It is then above-mentioned In step S3, when judging that current state meets execution condition according to the Dependency Specification, by being transferred for task Submit to task processor and perform data processing, specifically include:
Judge Startup time that the task relied on whether earlier than current time;
If so, then judge that current state meets execution condition, and being transferred for task is submitted at the task Reason device performs data processing, and has been submitted at task in the transferred task of data storage area record Reason device is carrying out;
If it is not, then judging that current state does not meet execution condition, in the data storage area, record is transferred Task is not submitted to task processor execution.For example, the task for being recorded in the data storage area In execution information, a specific character is write, or makes certain position in the execution information be sky, remembered with this Record the task and be not submitted to task processor execution.
It should be noted that when judging that current state does not meet execution condition, task dispatcher can also be skipped The task, and continue to select next task by task queue order from the task list, that is, return step Rapid S1 continues to transfer next pending task from task list, and all tasks in task list are all held Till row success.
In the present embodiment, the Startup time relied on by the task, for example, { ' time ':${dt}2015-8-16 08:00:00 }, wherein, the mark of ' time ' for Startup time, specific Startup time is exactly 2015-8-16 08:00:00, as specific Startup time 2015-8-16 08:00:00 earlier than current time, or Startup time 2015-8-16 08:00:00 it is identical with current time when, judge current state meet execution condition, will be transferred Task submit to the task processor and perform data processing.
In another embodiment, the mark of the task that the Dependency Specification is relied on by the task;Then In above-mentioned steps S3, when judging that current state meets execution condition according to the Dependency Specification, by what is transferred Task is submitted to task processor and performs data processing, specifically includes:
The mark of the task relied on by the task according to being transferred, inquires about from the data storage area and is relied on Task whether run succeeded by the task processor;
If so, then judge that current state meets execution condition, and being transferred for task is submitted at the task Reason device performs data processing, and has been submitted at task in the transferred task of data storage area record Reason device is carrying out;
If it is not, then judging that current state does not meet execution condition, in the data storage area, record is transferred Task is not submitted to task processor execution.
It should be noted that when judging that current state does not meet execution condition, task dispatcher can also be skipped The task, and continue to select next task by task queue order from the task list, that is, return step Rapid S1 continues to transfer next pending task from task list, and all tasks in task list are all held Till row success.
In the present embodiment, the mark of the task relied on by task taskC, can be { ' rely ':' taskA ' }, Wherein rely is used for indicating that task taskC depends on running succeeded for task taskA that taskA to be task taskC The task of being relied on is specifically identified;Task that task dispatcher is relied on according to task taskC is specifically identified TaskA, inquires about relied on task taskA from the data storage area whether by the task processor Run succeeded, when taskA is performed successfully, task taskC is submitted to into task processor and performs data Process.The task relied on by the task can be multiple, when the equal executed success of multiple dependence task, institute State task and can just submit to task processor execution data processing.
In further embodiment, Startup time that the Dependency Specification is relied on by transferring for task and The mark of being relied on for task;Then in above-mentioned steps S3, judge that current state meets according to the Dependency Specification During execution condition, being transferred for task is submitted to into task processor and performs data processing, specifically included:
Judge Startup time that the task relied on whether earlier than current time;
When the Startup time is earlier than current time, the mark of being relied on according to the task for task, from Inquire about whether relied on task is run succeeded by the task processor in the data storage area;
When the tasks carrying success for being relied on, judge that current state meets execution condition, by appointing for being transferred Business is submitted to the task processor and performs data processing, and in appointing that data storage area record is transferred Business has been submitted to task processor and has been carrying out;
When the Startup time is later than current time, or the Startup time is earlier than current time but is relied on Task judge that current state does not meet execution condition when not being performed successfully, then, in the data storage area The transferred task of record is not submitted to task processor execution.
It should be noted that when the Startup time is later than current time, or the Startup time is earlier than working as Front moment but when being relied on for task is not performed successfully, judge that current state does not meet execution condition, task Scheduler can also skip the task, and continue to select next by task queue order from the task list Task, i.e. return to step S1 continue to transfer next pending task from task list, in task list All tasks be carried out success till.
In the present embodiment, task taskC is relied on Startup time and the mark of being relied on for task, Can be { ' time ':${dt}2015-8-16 08:00:00}、{‘rely’:‘taskA’};Or be {‘time’:${dt}2015-8-16 08:00:00,‘rely’:' taskA ' }, then when specific Startup time 08:00:00 is early In or be equal to current time, and the task taskA executed for being relied on is when completing, just can be by task taskC Submit to task processor and perform data processing.
Further, after submit to, also wrap Include:
Receive the task action result that the task processor is returned;
The task action result is stored in into data storage area;
Next pending task is transferred in continuation from the task list, until described in the task list Till task is performed success entirely.
In the present embodiment, task dispatcher can be transferred next after task action result is received, Individual pending task, to avoid the task dispatcher from producing conflict in process task.In addition, described Business implementing result can be, but not limited to include that the beginning of tasks carrying is taken, appointed with end time, tasks carrying Business run succeeded or failure information and the reason for perform failure, it is of the invention that task action result is stored in Data storage area, can facilitate the problem in later stage to investigate and optimize, and for example, engineer is from data storage area It was found that the tasks carrying of certain task is time-consuming very high always, it may be considered that further optimize this task.
Further, after the task action result that the task processor is returned is received, also include:
Judge whether the task runs succeeded according to the task action result;
If so, the tasks carrying success is then recorded in the data storage area;
If it is not, then record the tasks carrying in the data storage area failing.
In the present embodiment, the letter of all about task of the task processor recorded in data storage area Breath, all the mark associated storage with the task, in order to task processor according to the mark of the task can be Can the quick search task information.
Further, after recording the tasks carrying failure in the data storage area, also include:
Alarm mail is generated according to the task action result, and the alarm mail is sent to default is connect Receiving end mouth;
Or, adding one to the number of times of the tasks carrying failure, the number of times after Jia is more than alarm door limit value When, alarm mail is generated according to the task action result, and the alarm mail is sent to default is connect Receiving end mouth.
In the present embodiment, it may be preferable that add one using the number of times to the tasks carrying failure, after Jia one Number of times when being more than alarm door limit value, alarm mail is generated according to the task action result, and by the report Alert mail is sent to this alarm method of default receiving port, can further be improved operating efficiency and be appointed The reliability that business is performed.
The processing method of data task provided in an embodiment of the present invention, can be such that multiple tasks scheduler concurrently divides Pending task is not transferred from task list.Wherein, the task list records multiple tasks and each is appointed The dependence item of business, the addition of the dependence item are simple, can improve operating efficiency;And task Scheduler can parse the dependence of transferred task from the dependence item of being transferred for task, When judging that current state meets execution condition according to the Dependency Specification, then being transferred for task is submitted to Task processor performs data processing, so that task is performed in an orderly manner, it is ensured that data processing does not malfunction, And improve the operating efficiency of multiple tasks scheduler parallel processing task.
The system that the embodiment of the present invention also provides a kind of task dispatcher and realizes data task process, Neng Goushi All flow processs of the processing method of above-mentioned data task are applied, is described in detail with reference to Fig. 3 to Figure 10.
Referring to Fig. 3, it is the structural representation of task dispatcher one embodiment that the present invention is provided.
As shown in figure 3, the embodiment of the present invention also provides a kind of task dispatcher, including task transfer module 10, Acquisition module 20 and task processing module 30 is relied on, wherein:
Task transfers module 10, for transferring a pending task from task list;Wherein, described Business token is loaded with the dependence item of multiple tasks and each task, and the task described in the task list can Transferred by multiple tasks scheduler;
Acquisition module 20 is relied on, for the dependence item according to being transferred for task, the task is obtained Dependency Specification;
Task processing module 30, during for judging that current state meets execution condition according to the Dependency Specification, Being transferred for task is submitted to into task processor and performs data processing.
Further, as shown in figure 4, which is the task of the task dispatcher that the present invention is provided transfers module The structural representation of one embodiment, the task transfer module 10, specifically include task choosing unit 11, First query unit 12 and task transfer unit 13, wherein,
Task choosing unit 11, for selecting a task by task queue order from the task list;
First query unit 12, according to the mark of the task, inquires about the task from data storage area Gathering information and execution information;Wherein, data storage area record has each task in the task list Gathering information and execution information, the gathering information are used for recording whether the task is adjusted by task dispatcher Take, the execution information be used for record the task whether be submitted to task processor be carrying out and Whether run succeeded;
Task transfers unit 13, for not transferred when selected task, or selected task by Task processor execution is transferred but be not submitted to, or selected task has been submitted to task processor When performing but performing failure, the task is transferred, and the task is recorded in the data storage area and adjusted Take;
The task choosing unit 11, is additionally operable to when selected task is transferred and has been submitted to task Processor is carrying out, or selected task is when being transferred and being run succeeded, and continues from the task Next task is selected by task queue order in table.
Further, Dependency Specification is included in the dependence item, then the dependence acquisition module 20 is concrete, For, from the dependence item of being transferred for task, directly obtaining the Dependency Specification of the task;
Or, comprising judgement mark is relied on automatically in the dependence item, then as shown in figure 5, which is this The structural representation of one embodiment of the dependence acquisition module of the task dispatcher that invention is provided, the dependence Acquisition module 20, specifically includes startup from judging unit 22 and scans acquiring unit 23, wherein,
Start from judging unit 22, for the dependence item according to being transferred for task in automatic dependence sentence Disconnected mark, starts dependence automatically and judges;
Scanning acquiring unit 23, for scanning the concrete execution content of the task, therefrom obtains the task Dependency Specification.
In one embodiment, the Startup time that the Dependency Specification is relied on by the task, then, such as Shown in Fig. 6, which is the structure of the embodiment one of the task processing module of the task dispatcher that the present invention is provided Schematic diagram, the task processing module 30 specifically include the first judging unit 311, first task and submit single to Unit 312 and first information recording unit 313, wherein,
First judging unit 311, for judge Startup time that the task relied on whether earlier than it is current when Carve;
First task submits unit 312 to, for the Startup time that relied on when the task earlier than current time When, judge that current state meets execution condition, being transferred for task is submitted to into the task processor and is performed Data processing, and task processor is being submitted to in the transferred task of data storage area record Perform;With;
First information recording unit 313, the Startup time for being relied on when the task are later than current time When, judge that current state does not meet execution condition, the data storage area record transferred task not by Submit to task processor execution.
In another embodiment, the mark of the task that the Dependency Specification is relied on by the task, then, As shown in fig. 7, which is the knot of the embodiment two of the task processing module of the task dispatcher that the present invention is provided Structure schematic diagram, the task processing module 30 specifically include the second query unit 321, the second task and submit to Unit 322 and the second information recording unit 323, wherein,
Second query unit 321, the mark of the task for being relied on according to being transferred for task, from the number Whether run succeeded by the task processor according to the relied on task of inquiry in memory block;
Second task submits unit 322 to, is run succeeded by the task processor for working as relied on task When, judge that current state meets execution condition, being transferred for task is submitted to into the task processor and is performed Data processing, and task processor is being submitted to in the transferred task of data storage area record Perform;
Second information recording unit 323, is not run succeeded by the task processor for working as relied on task When, judge that current state does not meet execution condition, the data storage area record transferred task not by Submit to task processor execution.
In further embodiment, Startup time that the Dependency Specification is relied on by transferring for task and The mark of being relied on for task, then, as shown in figure 8, which is the task of the task dispatcher that the present invention is provided The structural representation of the embodiment three of processing module, the task processing module 30 specifically include second and sentence Disconnected unit 331, the 3rd query unit 332, the 3rd task submit unit 333 and the 3rd information recording unit 334 to, Wherein,
Second judging unit 331, for judge Startup time that the task relied on whether earlier than it is current when Carve;
3rd query unit 332, for when the Startup time is earlier than current time, then according to the task The mark of being relied on for task, inquires about relied on task from the data storage area whether by described Business computing device success;
3rd task submits unit 333 to, for when the tasks carrying for being relied on is successful, judging current state symbol Execution condition is closed, being transferred for task is submitted to into the task processor and is performed data processing, and described Record transferred task in data storage area has been submitted to task processor and has been carrying out;
3rd information recording unit 334, for being later than current time, or the startup when the Startup time Moment, earlier than current time but when the task of being relied on is not performed successfully, judges that current state does not meet execution Condition, records transferred task in the data storage area and is not submitted to task processor execution.
Further, the task dispatcher, also including information receiving module 40 and information unloading module 50, Wherein,
Information receiving module 40, for receiving the task action result that the task processor is returned;
Information unloading module 50, for being stored in data storage area by the task action result;
The task transfers module 10, be additionally operable to by the task action result be stored in data storage area it Afterwards, continue to transfer next pending task from the task list, until described in the task list Till task is performed success entirely.
Further, the task dispatcher also includes the second judge module 60, the 4th information logging modle 70 With the 5th information logging modle 80, wherein,
Second judge module 60, for after the task action result that the task processor is returned is received, Judge whether the task runs succeeded according to the task action result;
4th information logging modle 70, for when the tasks carrying is successful, then in the data storage area In record the tasks carrying success;
5th information logging modle 80, for when the tasks carrying fails, then in the data storage area In record tasks carrying failure.
Further, the task dispatcher also includes:
Alarm module 90, after recording the tasks carrying failure in the data storage area, according to The task action result generates alarm mail, and the alarm mail is sent to default receiving port; Or, after recording the tasks carrying failure in the data storage area, to the tasks carrying The number of times of failure adds one, when the number of times after Jia is more than alarm door limit value, according to the task action result Alarm mail is generated, and the alarm mail is sent to default receiving port.
Below in conjunction with Fig. 9 and Figure 10, to realizing that the system and method that data task is processed is illustrated.
Referring to Fig. 9, it is the knot provided by the present invention for realizing one embodiment of the system of data task process Composition.1 random start of multiple tasks scheduler, reads in same task list, and wherein, the task list is recorded There is the dependence item of multiple tasks and each task, multiple tasks scheduler 1 is parallel to the task list Task is processed successively in an orderly manner, the dependence respectively according to transferred task of multiple tasks scheduler 1 Item obtains the Dependency Specification of transferred task, and judges to be transferred according to the Dependency Specification of transferred task respectively The current state of task, wherein, when the mark of the task relied on comprising task by the Dependency Specification of transferred task During knowledge, then the mark of task relied on according to task is needed, relied on task is inquired about to data storage area 2 Execution state, when the current state of transferred task meets execution condition, respectively by being transferred for task Submit to task processor 3 and perform data processing, otherwise, skip this task, continue to transfer next task. In addition, the implementing result that returns task processor 3 respectively of multiple tasks scheduler 1 and transferred task Mark association is stored in data storage area 2, and whether judges transferred task according to the implementing result for returning respectively Run succeeded, if performing failure, generate alarm mail, and alarm mail is sent to into default receiving terminal Mouth 4.
Referring to Figure 10, it is that the flow process of another embodiment of the processing method of the data task that the present invention is provided is shown It is intended to.The embodiment of the present invention provides a kind of processing method of more specifically data task, including step S202 It is to S214, specific as follows:
Step S202:Task T is selected from task list;
Step S203:Judge whether task T can be transferred;I.e. in data storage area, whether query task T Transferred by task dispatcher, if be submitted to task processor and be carrying out and whether run succeeded;
If so, i.e. when task T is not transferred by task dispatcher, or task T is transferred by task dispatcher But it is not submitted to task processor execution, or task T is when being transferred but is performed unsuccessfully by task dispatcher, Execution step S204;
If it is not, i.e. when task T is transferred and run succeeded by task dispatcher, or task T is adjusted by task When degree device is transferred but has been submitted to task processor and is carrying out, return to step S202 continues from described Next task T+1 is selected by task queue order in business table;
Step S204:Judgement task relies on item and whether there is automatic decision mark;
If so, execution step S205;If it is not, direct execution step S206;
Step S205:Scanning is concrete to perform content, obtains Dependency Specification;
Step S206:Judge whether Startup time;
If so, execution step S207;If it is not, execution step S208;
Step S207:Judge Startup time whether earlier than current time;
If so, execution step S208;If it is not, return to step S202, continues to press task from the task list Queue sequence selects next task T+1;
Step S208:Judge whether the mark of relied on task;
If so, execution step S209;If it is not, execution step S210;
Step S209:Whether the relied on task of inquiry runs succeeded;
If so, execution step S210;If it is not, return to step S202, continues to press task from the task list Queue sequence selects next task T+1;
Step S210:Task T is submitted to into task processor and performs data processing
Step S211:Receive the implementing result for returning;
Step S212:Preserve the implementing result for returning
Step S213:Judge whether task runs succeeded;
If so, return to step S202, continues to select next appointing by task queue order from the task list Business T+1;If it is not, execution step S214.
Step S214:Alert process;After alert process, return to step S202 continues from the task list Next task is selected by task queue order.
Implement the embodiment of the present invention, have the advantages that:
Task dispatcher provided in an embodiment of the present invention, can be with other task dispatchers concurrently from task list In transfer pending task, wherein, the task list records the dependence of multiple tasks and each task and closes Set entry, task dispatcher can parse what transferred task was relied on from the dependence item of transferred task According to the Dependency Specification, relation, when judging that current state meets execution condition, then by being transferred for task Submit to task processor and perform data processing, so that task is performed in an orderly manner, it is ensured that data processing is not Error, and improve the operating efficiency of multiple tasks scheduler parallel processing task.
One of ordinary skill in the art will appreciate that all or part of flow process in realizing above-described embodiment method, Computer program be can be by instruct the hardware of correlation to complete, described program can be stored in a calculating In machine read/write memory medium, the program is upon execution, it may include such as the flow process of the embodiment of above-mentioned each method. Wherein, described storage medium can for magnetic disc, CD, read-only memory (Read-Only Memory, ) or random access memory (Random Access Memory, RAM) etc. ROM.
The above is the preferred embodiment of the present invention, it is noted that for the common skill of the art For art personnel, under the premise without departing from the principles of the invention, some improvements and modifications can also be made, this A little improvements and modifications are also considered as protection scope of the present invention.

Claims (18)

1. a kind of processing method of data task, it is characterised in that include:
A pending task is transferred from task list;Wherein, the task list record multiple tasks and The dependence item of each task, and the task described in the task list can be transferred by multiple tasks scheduler;
According to the dependence item of being transferred for task, the Dependency Specification of the task is obtained;
When judging that current state meets execution condition according to the Dependency Specification, being transferred for task is submitted to Task processor performs data processing.
2. the processing method of data task as claimed in claim 1, it is characterised in that described from task list In transfer a pending task, specifically include:
A task is selected by task queue order from the task list;
According to the mark of the task, the gathering information of the task is inquired about from data storage area and letter is performed Breath;Wherein, the data storage area record has the gathering information of each task in the task list and performs letter Breath, the gathering information are used for recording whether the task is transferred by task dispatcher, the execution information It is carrying out and whether runs succeeded for records the task and whether being submitted to task processor;
When selected task is not transferred, or selected task is transferred but is not submitted to task Computing device, or selected task be submitted to task processor perform but perform failure when, adjust The task is taken, and the task is recorded in the data storage area transferred;
Transferred and be submitted to task processor when selected task and be carrying out, or it is selected Task when being transferred and being run succeeded, continue to select next by task queue order from the task list Individual task.
3. the processing method of data task as claimed in claim 1, it is characterised in that the dependence Dependency Specification is included in;Then the dependence item according to being transferred for task, obtains the task Dependency Specification, specially:
From the dependence item of being transferred for task, the Dependency Specification of the task is directly obtained;
Or, mark is judged comprising dependence automatically in the dependence item;It is then described according to appointing for being transferred The dependence item of business, obtains the Dependency Specification of the task, specially:
Mark is judged according to the automatic dependence in the dependence item of being transferred for task, is started to rely on automatically and is sentenced It is disconnected;
The concrete execution content of the task is scanned, the Dependency Specification of the task is therefrom obtained.
4. the processing method of data task as claimed in claim 1, it is characterised in that the Dependency Specification The Startup time relied on by the task;
Then it is described according to the Dependency Specification judge current state meet execution condition when, by being transferred for task Submit to task processor and perform data processing, specifically include:
Judge Startup time that the task relied on whether earlier than current time;
If so, then judge that current state meets execution condition, and being transferred for task is submitted at the task Reason device performs data processing, and has been submitted at task in the transferred task of data storage area record Reason device is carrying out;
If it is not, then judging that current state does not meet execution condition, in the data storage area, record is transferred Task is not submitted to task processor execution.
5. the processing method of data task as claimed in claim 1, it is characterised in that the Dependency Specification The mark of being relied on by the task for task;
Then it is described according to the Dependency Specification judge current state meet execution condition when, by being transferred for task Submit to task processor and perform data processing, specifically include:
The mark of the task relied on by the task according to being transferred, inquires about from the data storage area and is relied on Task whether run succeeded by the task processor;
If so, then judge that current state meets execution condition, and being transferred for task is submitted at the task Reason device performs data processing, and has been submitted at task in the transferred task of data storage area record Reason device is carrying out;
If it is not, then judging that current state does not meet execution condition, in the data storage area, record is transferred Task is not submitted to task processor execution.
6. the processing method of data task as claimed in claim 1, it is characterised in that the Dependency Specification The mark of the Startup time relied on by transferring for task and being relied on for task;
Then it is described according to the Dependency Specification judge current state meet execution condition when, by being transferred for task Submit to task processor and perform data processing, specifically include:
Judge Startup time that the task relied on whether earlier than current time;
When the Startup time is earlier than current time, the mark of being relied on according to the task for task, from Inquire about whether relied on task is run succeeded by the task processor in the data storage area;
When the tasks carrying success for being relied on, judge that current state meets execution condition, by appointing for being transferred Business is submitted to the task processor and performs data processing, and in appointing that data storage area record is transferred Business has been submitted to task processor and has been carrying out;
When the Startup time is later than current time, or the Startup time is earlier than current time but is relied on Task judge that current state does not meet execution condition when not being performed successfully, then, in the data storage area The transferred task of record is not submitted to task processor execution.
7. the processing method of the data task as described in any one of claim 1 to 6, it is characterised in that After being transferred for task is submitted to task processor execution data processing, also include:
Receive the task action result that the task processor is returned;
The task action result is stored in into data storage area;
Next pending task is transferred in continuation from the task list, until described in the task list Till task is performed success entirely.
8. the processing method of data task as claimed in claim 7, it is characterised in that receiving described appointing After the task action result that business processor is returned, also include:
Judge whether the task runs succeeded according to the task action result;
If so, the tasks carrying success is then recorded in the data storage area;
If it is not, then record the tasks carrying in the data storage area failing.
9. the processing method of data task as claimed in claim 8, it is characterised in that deposit in the data After recording the tasks carrying failure in storage area, also include:
Alarm mail is generated according to the task action result, and the alarm mail is sent to default is connect Receiving end mouth;
Or, adding one to the number of times of the tasks carrying failure, the number of times after Jia is more than alarm door limit value When, alarm mail is generated according to the task action result, and the alarm mail is sent to default is connect Receiving end mouth.
10. a kind of task dispatcher, it is characterised in that include:
Task transfers module, for transferring a pending task from task list;Wherein, the task Token is loaded with the dependence item of multiple tasks and each task, and the task described in the task list can quilt Multiple tasks scheduler is transferred;
Rely on acquisition module, for the dependence item according to being transferred for task, obtain the task according to Bad information;With,
Task processing module, during for judging that current state meets execution condition according to the Dependency Specification, will Being transferred for task is submitted to task processor and performs data processing.
11. task dispatchers as claimed in claim 10, it is characterised in that the task transfers module tool Body includes:
Task choosing unit, for selecting a task by task queue order from the task list;
First query unit, according to the mark of the task, inquires about the tune of the task from data storage area Win the confidence breath and execution information;Wherein, the data storage area record has the tune of each task in the task list Win the confidence breath and execution information, the gathering information is used for recording whether the task is transferred by task dispatcher, Whether the execution information is used for recording the task and whether has been submitted to task processor and be carrying out and Run succeeded;
Task transfers unit, and for not transferred when selected task, or selected task is adjusted Task processor execution is taken but be not submitted to, or selected task has been submitted to task processor and has held When row but execution failure, the task is transferred, and the task is recorded in the data storage area and transferred;
The task choosing unit, is additionally operable to when selected task is transferred and has been submitted at task Reason device is carrying out, or selected task is when being transferred and being run succeeded, and continues from the task list In by task queue order select next task.
12. task dispatchers as claimed in claim 10, it is characterised in that wrap in the dependence item Containing Dependency Specification, then the dependence acquisition module is specifically for from the dependence item of being transferred for task, Directly obtain the Dependency Specification of the task;
Or, mark is judged comprising dependence automatically in the dependence item, then the dependence acquisition module tool Body includes:
Start from judging unit, for the dependence item according to being transferred for task in automatic dependence judge Mark, starts;With,
Scanning acquiring unit, for scanning the concrete execution content of the task, therefrom obtains the task Dependency Specification.
13. task dispatchers as claimed in claim 10, it is characterised in that the Dependency Specification is described The Startup time relied on by task;
Then the task processing module is specifically included:
Whether the first judging unit, for judging Startup time that the task relied on earlier than current time;
First task submits unit to, for when the Startup time relied on by the task is earlier than current time, Judge that current state meets execution condition, being transferred for task is submitted to into the task processor and performs data Process, and task processor has been submitted in the transferred task of data storage area record be carrying out; With,
First information recording unit, for when the Startup time relied on by the task is later than current time, Judge that current state does not meet execution condition, transferred task is recorded in the data storage area not submitted Perform to task processor.
14. task dispatchers as claimed in claim 10, it is characterised in that the Dependency Specification is described The mark of the task relied on by task;
Then the task processing module is specifically included:
Second query unit, the mark of the task for being relied on according to being transferred for task, from the data Inquire about whether relied on task is run succeeded by the task processor in memory block;
Second task submits unit to, for when being relied on for task is run succeeded by the task processor, Judge that current state meets execution condition, being transferred for task is submitted to into the task processor and performs data Process, and task processor has been submitted in the transferred task of data storage area record be carrying out; With,
Second information recording unit, for when being relied on for task is not run succeeded by the task processor, Judge that current state does not meet execution condition, transferred task is recorded in the data storage area not submitted Perform to task processor.
15. task dispatchers as claimed in claim 10, it is characterised in that the Dependency Specification is by adjusting Startup time and the mark of being relied on for task that taking for task is relied on;
Then the task processing module is specifically included:
Whether the second judging unit, for judging Startup time that the task relied on earlier than current time;
3rd query unit, for when the Startup time is earlier than current time, according to task institute according to Whether the mark of bad task, inquire about relied on task from the data storage area by the task Reason device runs succeeded;
3rd task submits unit to, for when the tasks carrying for being relied on is successful, judging that current state meets Being transferred for task is submitted to the task processor and performs data processing by execution condition, and in the number Task processor has been submitted to according to the transferred task of memory block record to be carrying out;With,
3rd information recording unit, for being later than current time when the Startup time, or during the startup When carving earlier than current time but being relied on for task is not performed successfully, judge that current state does not meet execution bar Part, records transferred task in the data storage area and is not submitted to task processor execution.
16. task dispatchers as described in any one of claim 10 to 15, it is characterised in that described Business scheduler also includes:
Information receiving module, for receiving the task action result that the task processor is returned;
Information unloading module, for being stored in data storage area by the task action result;
The task transfers module, is additionally operable to after the task action result is stored in data storage area, Next pending task is transferred in continuation from the task list, until the task described in the task list Till being performed success entirely.
17. task dispatchers as claimed in claim 16, it is characterised in that the task dispatcher is also wrapped Include:
Second judge module, for after the task action result that the task processor is returned is received, root Judge whether the task runs succeeded according to the task action result;
4th information logging modle, for when the tasks carrying is successful, then in the data storage area Record the tasks carrying success;
5th information logging modle, for when the tasks carrying fails, then in the data storage area Record the tasks carrying failure.
18. task dispatchers as claimed in claim 17, it is characterised in that the task dispatcher is also wrapped Include:
Alarm module, after recording the tasks carrying failure in the data storage area, according to institute State task action result and generate alarm mail, and the alarm mail is sent to default receiving port;
Or, after recording the tasks carrying failure in the data storage area, to the task The number of times for performing failure adds one, when the number of times after Jia is more than alarm door limit value, according to the tasks carrying As a result alarm mail is generated, and the alarm mail is sent to default receiving port.
CN201510573492.8A 2015-09-10 2015-09-10 Processing method of data tasks and task scheduler Pending CN106528275A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510573492.8A CN106528275A (en) 2015-09-10 2015-09-10 Processing method of data tasks and task scheduler

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510573492.8A CN106528275A (en) 2015-09-10 2015-09-10 Processing method of data tasks and task scheduler

Publications (1)

Publication Number Publication Date
CN106528275A true CN106528275A (en) 2017-03-22

Family

ID=58346069

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510573492.8A Pending CN106528275A (en) 2015-09-10 2015-09-10 Processing method of data tasks and task scheduler

Country Status (1)

Country Link
CN (1) CN106528275A (en)

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107341061A (en) * 2017-07-20 2017-11-10 郑州云海信息技术有限公司 A kind of data dispatch processing method and processing device
CN107402956A (en) * 2017-06-07 2017-11-28 网易(杭州)网络有限公司 Data processing method, equipment and the computer-readable recording medium of big task
CN107423131A (en) * 2017-03-29 2017-12-01 努比亚技术有限公司 A kind of sharing method and server
CN108108235A (en) * 2017-12-07 2018-06-01 北京荣之联科技股份有限公司 Task processing method and device
CN108182111A (en) * 2018-01-23 2018-06-19 百度在线网络技术(北京)有限公司 Task scheduling system, method and apparatus
CN108319499A (en) * 2018-02-07 2018-07-24 麒麟合盛网络技术股份有限公司 Method for scheduling task and device
CN108694564A (en) * 2018-06-07 2018-10-23 阿里巴巴集团控股有限公司 A kind of task status control method and device
CN108805723A (en) * 2018-05-24 2018-11-13 中国平安人寿保险股份有限公司 Data creation method, equipment, storage medium based on loss ratio system and device
CN109271238A (en) * 2017-07-12 2019-01-25 北京京东尚科信息技术有限公司 Support the task scheduling apparatus and method of a variety of programming languages
CN109375993A (en) * 2018-08-21 2019-02-22 中国平安人寿保险股份有限公司 Detection method, device, computer equipment and the storage medium of dispatch list
CN109471705A (en) * 2017-09-08 2019-03-15 杭州海康威视数字技术股份有限公司 Method, equipment and system, the computer equipment of task schedule
CN109933414A (en) * 2017-12-18 2019-06-25 飞狐信息技术(天津)有限公司 A kind of method for scheduling task, apparatus and system
CN110119306A (en) * 2019-05-22 2019-08-13 无锡华云数据技术服务有限公司 Operation Automatic dispatching equalization methods, device, equipment and storage medium
CN110673938A (en) * 2019-09-23 2020-01-10 腾讯科技(深圳)有限公司 Task processing method, system, server and storage medium
CN110764883A (en) * 2018-07-26 2020-02-07 浙江宇视科技有限公司 Start-stop control method and device for service group
CN110766341A (en) * 2019-10-31 2020-02-07 北京东软望海科技有限公司 Control method and device for optimizing job scheduling, computer equipment and storage medium
CN110807048A (en) * 2019-10-30 2020-02-18 泰康保险集团股份有限公司 Automatic task processing method and device, computer storage medium and electronic equipment
CN111190725A (en) * 2019-08-09 2020-05-22 腾讯科技(深圳)有限公司 Task processing method and device, storage medium and server
WO2020186809A1 (en) * 2019-03-19 2020-09-24 深圳壹账通智能科技有限公司 Big data platform-based hive task scheduling method and apparatus, device, and storage medium
CN112052575A (en) * 2020-08-27 2020-12-08 浙江工业大学 Automatic working principle screening method based on function execution sequence
CN112069367A (en) * 2020-09-07 2020-12-11 上海金仕达软件科技有限公司 Task scheduling method and system
CN112905321A (en) * 2021-02-07 2021-06-04 北京红山信息科技研究院有限公司 Event response type task triggering method and device, electronic equipment and storage medium
CN113760471A (en) * 2020-06-01 2021-12-07 阿里巴巴集团控股有限公司 Task processing system and method, continuous integration system, execution machine and storage device
CN114650287A (en) * 2022-04-06 2022-06-21 中国工商银行股份有限公司 Breakpoint continuous making method, device and system of data file

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101425024A (en) * 2008-10-24 2009-05-06 中国移动通信集团山东有限公司 Multitasking method and device
CN101567013A (en) * 2009-06-02 2009-10-28 阿里巴巴集团控股有限公司 Method and apparatus for implementing ETL scheduling
US20110131448A1 (en) * 2009-11-30 2011-06-02 Iron Mountain, Incorporated Performing a workflow having a set of dependancy-related predefined activities on a plurality of task servers
US20110321051A1 (en) * 2010-06-25 2011-12-29 Ebay Inc. Task scheduling based on dependencies and resources
CN103064745A (en) * 2013-01-09 2013-04-24 苏州亿倍信息技术有限公司 Method and system for distributing and processing tasks
CN103294533A (en) * 2012-10-30 2013-09-11 北京安天电子设备有限公司 Task flow control method and task flow control system
CN104536811A (en) * 2014-12-26 2015-04-22 广州华多网络科技有限公司 HIVE task based task scheduling method and device

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101425024A (en) * 2008-10-24 2009-05-06 中国移动通信集团山东有限公司 Multitasking method and device
CN101567013A (en) * 2009-06-02 2009-10-28 阿里巴巴集团控股有限公司 Method and apparatus for implementing ETL scheduling
US20110131448A1 (en) * 2009-11-30 2011-06-02 Iron Mountain, Incorporated Performing a workflow having a set of dependancy-related predefined activities on a plurality of task servers
US20110321051A1 (en) * 2010-06-25 2011-12-29 Ebay Inc. Task scheduling based on dependencies and resources
CN103294533A (en) * 2012-10-30 2013-09-11 北京安天电子设备有限公司 Task flow control method and task flow control system
CN103064745A (en) * 2013-01-09 2013-04-24 苏州亿倍信息技术有限公司 Method and system for distributing and processing tasks
CN104536811A (en) * 2014-12-26 2015-04-22 广州华多网络科技有限公司 HIVE task based task scheduling method and device

Cited By (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107423131A (en) * 2017-03-29 2017-12-01 努比亚技术有限公司 A kind of sharing method and server
CN107423131B (en) * 2017-03-29 2021-01-15 努比亚技术有限公司 Sharing method and server
CN107402956A (en) * 2017-06-07 2017-11-28 网易(杭州)网络有限公司 Data processing method, equipment and the computer-readable recording medium of big task
CN107402956B (en) * 2017-06-07 2020-02-21 网易有道信息技术(杭州)有限公司 Data processing method and device for large task and computer readable storage medium
CN109271238A (en) * 2017-07-12 2019-01-25 北京京东尚科信息技术有限公司 Support the task scheduling apparatus and method of a variety of programming languages
CN107341061A (en) * 2017-07-20 2017-11-10 郑州云海信息技术有限公司 A kind of data dispatch processing method and processing device
CN109471705A (en) * 2017-09-08 2019-03-15 杭州海康威视数字技术股份有限公司 Method, equipment and system, the computer equipment of task schedule
CN108108235B (en) * 2017-12-07 2021-10-29 哈勃智远(北京)科技有限公司 Task processing method and device
CN108108235A (en) * 2017-12-07 2018-06-01 北京荣之联科技股份有限公司 Task processing method and device
CN109933414A (en) * 2017-12-18 2019-06-25 飞狐信息技术(天津)有限公司 A kind of method for scheduling task, apparatus and system
CN108182111A (en) * 2018-01-23 2018-06-19 百度在线网络技术(北京)有限公司 Task scheduling system, method and apparatus
CN108182111B (en) * 2018-01-23 2022-07-15 百度在线网络技术(北京)有限公司 Task scheduling system, method and device
CN108319499A (en) * 2018-02-07 2018-07-24 麒麟合盛网络技术股份有限公司 Method for scheduling task and device
CN108319499B (en) * 2018-02-07 2021-06-04 麒麟合盛网络技术股份有限公司 Task scheduling method and device
CN108805723A (en) * 2018-05-24 2018-11-13 中国平安人寿保险股份有限公司 Data creation method, equipment, storage medium based on loss ratio system and device
CN108694564A (en) * 2018-06-07 2018-10-23 阿里巴巴集团控股有限公司 A kind of task status control method and device
CN110764883A (en) * 2018-07-26 2020-02-07 浙江宇视科技有限公司 Start-stop control method and device for service group
CN110764883B (en) * 2018-07-26 2022-08-23 浙江宇视科技有限公司 Start-stop control method and device for service group
CN109375993B (en) * 2018-08-21 2024-04-05 中国平安人寿保险股份有限公司 Method, device, computer equipment and storage medium for detecting schedule
CN109375993A (en) * 2018-08-21 2019-02-22 中国平安人寿保险股份有限公司 Detection method, device, computer equipment and the storage medium of dispatch list
WO2020186809A1 (en) * 2019-03-19 2020-09-24 深圳壹账通智能科技有限公司 Big data platform-based hive task scheduling method and apparatus, device, and storage medium
CN110119306A (en) * 2019-05-22 2019-08-13 无锡华云数据技术服务有限公司 Operation Automatic dispatching equalization methods, device, equipment and storage medium
CN111190725A (en) * 2019-08-09 2020-05-22 腾讯科技(深圳)有限公司 Task processing method and device, storage medium and server
CN110673938B (en) * 2019-09-23 2021-05-28 腾讯科技(深圳)有限公司 Task processing method, system, server and storage medium
CN110673938A (en) * 2019-09-23 2020-01-10 腾讯科技(深圳)有限公司 Task processing method, system, server and storage medium
CN110807048A (en) * 2019-10-30 2020-02-18 泰康保险集团股份有限公司 Automatic task processing method and device, computer storage medium and electronic equipment
CN110766341A (en) * 2019-10-31 2020-02-07 北京东软望海科技有限公司 Control method and device for optimizing job scheduling, computer equipment and storage medium
CN113760471A (en) * 2020-06-01 2021-12-07 阿里巴巴集团控股有限公司 Task processing system and method, continuous integration system, execution machine and storage device
CN112052575A (en) * 2020-08-27 2020-12-08 浙江工业大学 Automatic working principle screening method based on function execution sequence
CN112069367A (en) * 2020-09-07 2020-12-11 上海金仕达软件科技有限公司 Task scheduling method and system
CN112905321A (en) * 2021-02-07 2021-06-04 北京红山信息科技研究院有限公司 Event response type task triggering method and device, electronic equipment and storage medium
CN114650287A (en) * 2022-04-06 2022-06-21 中国工商银行股份有限公司 Breakpoint continuous making method, device and system of data file
CN114650287B (en) * 2022-04-06 2024-04-30 中国工商银行股份有限公司 Breakpoint continuous method, device and system for data file

Similar Documents

Publication Publication Date Title
CN106528275A (en) Processing method of data tasks and task scheduler
US8255399B2 (en) Data classifier
US20170046450A1 (en) Enhanced search suggestion for personal information services
CN107958028A (en) Method, apparatus, storage medium and the terminal of data acquisition
US7257637B2 (en) Method for transforming data between business protocols
WO2016060547A1 (en) Emulating manual system of filing using electronic document and electronic file
US8707259B2 (en) Framework for open dynamic reflective applications
CN111400011B (en) Real-time task scheduling method, system, equipment and readable storage medium
US11567735B1 (en) Systems and methods for integration of multiple programming languages within a pipelined search query
CN104423982B (en) The processing method and processing equipment of request
CN109635024A (en) A kind of data migration method and system
CN112559525B (en) Data checking system, method, device and server
US20140108367A1 (en) Client apparatus and database server for resumable transaction and method thereof
CN105718307B (en) Process management method and management of process device
CN108345658A (en) Algorithm calculates decomposing process, server and the storage medium of track
CN110838338B (en) Biological analysis item establishment system, biological analysis item establishment method, storage medium, and electronic device
US20090187913A1 (en) Ordering multiple resources
CN112817799B (en) Method and device for accessing multiple data sources based on Spring framework
CN111290855B (en) GPU card management method, system and storage medium for multiple GPU servers in distributed environment
CN105095416B (en) A kind of method and apparatus realizing content in the search and promoting
CN116263717A (en) Order service processing method and device based on event
JP5622049B2 (en) Batch processing system and batch processing method
US20070214127A1 (en) Scalable data extraction from data stores
US8166018B2 (en) Browsing a list of data items
US7530013B2 (en) Optimizer of interdependent information

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20170322

RJ01 Rejection of invention patent application after publication