CN110110863A - A kind of distributed machines study tune ginseng system based on celery - Google Patents
A kind of distributed machines study tune ginseng system based on celery Download PDFInfo
- Publication number
- CN110110863A CN110110863A CN201910400730.3A CN201910400730A CN110110863A CN 110110863 A CN110110863 A CN 110110863A CN 201910400730 A CN201910400730 A CN 201910400730A CN 110110863 A CN110110863 A CN 110110863A
- Authority
- CN
- China
- Prior art keywords
- task
- node
- parameter
- celery
- message
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/54—Interprogram communication
- G06F9/547—Remote procedure calls [RPC]; Web services
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2209/00—Indexing scheme relating to G06F9/00
- G06F2209/54—Indexing scheme relating to G06F9/54
- G06F2209/547—Messaging middleware
Landscapes
- Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Medical Informatics (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
Abstract
Automatic seeking is learnt as the distributed machines of core using Celery task queue frame join system the invention discloses a kind of, comprising the following steps: whole system includes a host node and several from node.Host node generates initial parameter task according to the hyper parameter of configuration and is put into message-oriented middleware, message-oriented middleware is monitored from node, when listening to task, task is obtained from node and is run, after end of run, host node is sent to from node by operation result, host node is generated parameter task using parameter generation algorithm according to the result run and places into message-oriented middleware, and so on, until reaching the number of iterations of setting, system stops, and eventually finds proper hyper parameter.
Description
Technical field
The invention belongs to machine learning fields, are related to automaton study aspect, design a kind of machine based on celery
Study is distributed to adjust ginseng system.
Background technique
With the continuous development of IT application to our society, all trades and professions can all generate a large amount of basic data.Excavating number
When according to value, machine learning modeling technique can be used.During modeling, in addition to preparing data, what is most taken time and effort is exactly
Various super ginseng combinations are attempted, the process of model optimum efficiency is found.Even with experienced algorithm engineering teacher and data section
Scholar is sometimes also difficult to hold rule therein, can only repeatedly attempt.Since data volume is big, one time cut-and-try process will consume
Take for a long time, along with hyper parameter number of combinations is big, the number of trial is also very more, when can expend very long for single machine trial
Between.The time is modeled to shorten, it would be desirable to be employed computer cluster, be joined using multiple host come distributed execution automatic seeking
Journey.
Summary of the invention
Ginseng, which is sought, the technical problem to be solved by the present invention is to conventional machines study uses grid search and random mostly
Search, parameter search require a great deal of time without directiveness to find suitable hyper parameter, this system uses distributed
Mode search for hyper parameter, and the generation of hyper parameter is guidance with previous training result, greatly shorten parameter optimization
Time.
In order to realize the target of foregoing invention, the present invention provides following technical schemes:
Machine learning distribution tune ginseng system based on celery, comprising the following steps:
S1, code and data is submitted to specified directory and to be distributed to distance host by web interface;
S2, page configuration hyper parameter type and range;
S3, system generate initial parameter task and are put into message-oriented middleware;
S4, task is obtained from node to message-oriented middleware and is executed;
S5, after having executed from node, result is returned into host node;
S6, the host node trained result before generate hyper parameter task using parameter generation algorithm and store to message
Middleware supplies to consume from node.
S7, when frequency of training reaches specified value, whole system just stops.
By above step, suitable hyper parameter is finally found with the shorter time.
Main advantages of the present invention are using realizing distributed operation based on celery task queue frame, among message
Part uses redis, and system coupling is good, and scalability is strong.After the completion of its subsystem ought be trained once when running, it can will transport
Row result feeds back to host and the parameter of next round is instructed to generate, and generates parameter directional, shortens the parameter optimization time.In addition
This system is operated using web page, and code configuration parameter need to be only submitted on the page, so that it may task is run, it is simple and convenient.
Detailed description of the invention
Fig. 1 system general frame.
Fig. 2 distributed structure/architecture figure.
Fig. 3 parameter task generation module.
Specific embodiment
Specific implementation design scheme of the invention is described in further detail below with reference to Fig. 1 Fig. 2 Fig. 3.
As shown in figure one, figure two, this system realizes distributed system using celery task queue frame, among message
That part relies on is redis.Whole system includes a host node master and several from node work, and host node mainly manages
Entire cluster and parameter task generate, and the parameter task of generation is put into message-oriented middleware, from node monitoring information middleware,
When there is task, acquisition task is simultaneously executed, and thereby realizes distribution.
As shown in Figure 3, when having executed a task from node, implementing result can be sent to host node, host point root
According to task of having completed as a result, using parameter optimization algorithm, the optimization algorithms such as Bayes, TPE can be used and generate hyper parameter
Task realizes the directional guidance of generation of hyper parameter.
Claims (4)
1. a kind of machine learning distribution tune ginseng system based on celery, which comprises the following steps:
S1, code and data is submitted to specified directory and to be distributed to distance host by web interface;
S2, page configuration hyper parameter type and range;
S3, system generate initial parameter task according to configuration and are put into message-oriented middleware;
S4, task is obtained from node to message-oriented middleware and is executed;
S5, after having executed task from node, result is returned into host node;
S6, the host node trained result before continue generation hyper parameter task using parameter generation algorithm and store to message
Middleware supplies to consume from node;
S7, when frequency of training reaches specified value, whole system just stops.
2. the machine learning distribution tune ginseng system according to claim 1 based on celery, which is characterized in that step S1
The management of middle cluster uses socket communication protocol to complete the mutual access between host, and efficiency of transmission is high, stability
It is good.
3. the machine learning distribution tune ginseng system according to claim 1 based on celery, which is characterized in that step S3
, in S4, S5, generation and the consumption of task use celery to dispatch, and using redis as message-oriented middleware, facilitate decoupling
It closes.
4. the machine learning distribution tune ginseng system according to claim 1 based on celery, which is characterized in that hyper parameter
Generating with housebroken result is foundation guide parameters generation, while the part is integrated with many kinds of parameters optimizing algorithm, including
Bayes, TPE, Anneal etc., can also artificial spreading parameter optimizing algorithm, shorten the parameter optimization time.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910400730.3A CN110110863A (en) | 2019-05-15 | 2019-05-15 | A kind of distributed machines study tune ginseng system based on celery |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910400730.3A CN110110863A (en) | 2019-05-15 | 2019-05-15 | A kind of distributed machines study tune ginseng system based on celery |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110110863A true CN110110863A (en) | 2019-08-09 |
Family
ID=67490080
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910400730.3A Pending CN110110863A (en) | 2019-05-15 | 2019-05-15 | A kind of distributed machines study tune ginseng system based on celery |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110110863A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112202858A (en) * | 2020-09-22 | 2021-01-08 | 苏宁消费金融有限公司 | Jenkins cluster management method and system based on cell distributed scheduling framework |
CN113568757A (en) * | 2021-09-22 | 2021-10-29 | 中建电子商务有限责任公司 | Large-scale distributed inference engine and system based on deep learning |
CN113608722A (en) * | 2021-07-31 | 2021-11-05 | 云南电网有限责任公司信息中心 | Algorithm packaging method based on distributed technology |
-
2019
- 2019-05-15 CN CN201910400730.3A patent/CN110110863A/en active Pending
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112202858A (en) * | 2020-09-22 | 2021-01-08 | 苏宁消费金融有限公司 | Jenkins cluster management method and system based on cell distributed scheduling framework |
CN112202858B (en) * | 2020-09-22 | 2022-06-17 | 苏宁消费金融有限公司 | Jenkins cluster management method and system based on cell distributed scheduling framework |
CN113608722A (en) * | 2021-07-31 | 2021-11-05 | 云南电网有限责任公司信息中心 | Algorithm packaging method based on distributed technology |
CN113568757A (en) * | 2021-09-22 | 2021-10-29 | 中建电子商务有限责任公司 | Large-scale distributed inference engine and system based on deep learning |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108932588B (en) | Hydropower station group optimal scheduling system with separated front end and rear end and method | |
CN110110863A (en) | A kind of distributed machines study tune ginseng system based on celery | |
CN104885078A (en) | Method for two-stage query optimization in massively parallel processing database clusters | |
CN103488537A (en) | Method and device for executing data ETL (Extraction, Transformation and Loading) | |
Xue et al. | Optimizing ontology alignment through memetic algorithm based on partial reference alignment | |
Gagné et al. | Distributed beagle: An environment for parallel and distributed evolutionary computations | |
CN103793467A (en) | Method for optimizing real-time query on big data on basis of hyper-graphs and dynamic programming | |
CN104133891A (en) | Method for storing massive structural data based on relational database | |
CN110765582B (en) | Self-organization center K-means microgrid scene division method based on Markov chain | |
CN106601235A (en) | Semi-supervision multitask characteristic selecting speech recognition method | |
CN104951306A (en) | Data processing method and system based on real-time calculation frame | |
CN113312177B (en) | Wireless edge computing system and optimizing method based on federal learning | |
CN110109658A (en) | A kind of ROS code generator and code generating method based on formalized model | |
CN116306769A (en) | Bayesian optimization method and system containing countermeasure generation network | |
CN115827189A (en) | Large-scale intelligent cluster task scheduling optimization method | |
CN104462329A (en) | Operation process digging method suitable for diversified environment | |
Czarnul et al. | Simulation of parallel similarity measure computations for large data sets | |
CN111695701B (en) | System for realizing data set construction processing based on federal learning and construction generation method thereof | |
CN102779025A (en) | Parallel PLSA (Probabilistic Latent Semantic Analysis) method based on Hadoop | |
CN115688495B (en) | Distributed LVC simulation system collaborative planning method, server and storage medium | |
CN116227647B (en) | Digital twin deduction optimization method and system for power system | |
Liu et al. | Research on a union algorithm of multiple concept lattices | |
CN114021977A (en) | Typical composite scene reduction method and system for energy storage planning | |
Wang et al. | An expansion planning approach for intelligent grids with speculative parallelism | |
CN103793745A (en) | Distributed particle swarm optimization method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20190809 |
|
WD01 | Invention patent application deemed withdrawn after publication |