[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN110110863A - A kind of distributed machines study tune ginseng system based on celery - Google Patents

A kind of distributed machines study tune ginseng system based on celery Download PDF

Info

Publication number
CN110110863A
CN110110863A CN201910400730.3A CN201910400730A CN110110863A CN 110110863 A CN110110863 A CN 110110863A CN 201910400730 A CN201910400730 A CN 201910400730A CN 110110863 A CN110110863 A CN 110110863A
Authority
CN
China
Prior art keywords
task
node
parameter
celery
message
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910400730.3A
Other languages
Chinese (zh)
Inventor
刘宇为
夏凡
饶雪云
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chengdu Univeristy of Technology
Original Assignee
Chengdu Univeristy of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chengdu Univeristy of Technology filed Critical Chengdu Univeristy of Technology
Priority to CN201910400730.3A priority Critical patent/CN110110863A/en
Publication of CN110110863A publication Critical patent/CN110110863A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/54Interprogram communication
    • G06F9/547Remote procedure calls [RPC]; Web services
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2209/00Indexing scheme relating to G06F9/00
    • G06F2209/54Indexing scheme relating to G06F9/54
    • G06F2209/547Messaging middleware

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Medical Informatics (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

Automatic seeking is learnt as the distributed machines of core using Celery task queue frame join system the invention discloses a kind of, comprising the following steps: whole system includes a host node and several from node.Host node generates initial parameter task according to the hyper parameter of configuration and is put into message-oriented middleware, message-oriented middleware is monitored from node, when listening to task, task is obtained from node and is run, after end of run, host node is sent to from node by operation result, host node is generated parameter task using parameter generation algorithm according to the result run and places into message-oriented middleware, and so on, until reaching the number of iterations of setting, system stops, and eventually finds proper hyper parameter.

Description

A kind of distributed machines study tune ginseng system based on celery
Technical field
The invention belongs to machine learning fields, are related to automaton study aspect, design a kind of machine based on celery Study is distributed to adjust ginseng system.
Background technique
With the continuous development of IT application to our society, all trades and professions can all generate a large amount of basic data.Excavating number When according to value, machine learning modeling technique can be used.During modeling, in addition to preparing data, what is most taken time and effort is exactly Various super ginseng combinations are attempted, the process of model optimum efficiency is found.Even with experienced algorithm engineering teacher and data section Scholar is sometimes also difficult to hold rule therein, can only repeatedly attempt.Since data volume is big, one time cut-and-try process will consume Take for a long time, along with hyper parameter number of combinations is big, the number of trial is also very more, when can expend very long for single machine trial Between.The time is modeled to shorten, it would be desirable to be employed computer cluster, be joined using multiple host come distributed execution automatic seeking Journey.
Summary of the invention
Ginseng, which is sought, the technical problem to be solved by the present invention is to conventional machines study uses grid search and random mostly Search, parameter search require a great deal of time without directiveness to find suitable hyper parameter, this system uses distributed Mode search for hyper parameter, and the generation of hyper parameter is guidance with previous training result, greatly shorten parameter optimization Time.
In order to realize the target of foregoing invention, the present invention provides following technical schemes:
Machine learning distribution tune ginseng system based on celery, comprising the following steps:
S1, code and data is submitted to specified directory and to be distributed to distance host by web interface;
S2, page configuration hyper parameter type and range;
S3, system generate initial parameter task and are put into message-oriented middleware;
S4, task is obtained from node to message-oriented middleware and is executed;
S5, after having executed from node, result is returned into host node;
S6, the host node trained result before generate hyper parameter task using parameter generation algorithm and store to message Middleware supplies to consume from node.
S7, when frequency of training reaches specified value, whole system just stops.
By above step, suitable hyper parameter is finally found with the shorter time.
Main advantages of the present invention are using realizing distributed operation based on celery task queue frame, among message Part uses redis, and system coupling is good, and scalability is strong.After the completion of its subsystem ought be trained once when running, it can will transport Row result feeds back to host and the parameter of next round is instructed to generate, and generates parameter directional, shortens the parameter optimization time.In addition This system is operated using web page, and code configuration parameter need to be only submitted on the page, so that it may task is run, it is simple and convenient.
Detailed description of the invention
Fig. 1 system general frame.
Fig. 2 distributed structure/architecture figure.
Fig. 3 parameter task generation module.
Specific embodiment
Specific implementation design scheme of the invention is described in further detail below with reference to Fig. 1 Fig. 2 Fig. 3.
As shown in figure one, figure two, this system realizes distributed system using celery task queue frame, among message That part relies on is redis.Whole system includes a host node master and several from node work, and host node mainly manages Entire cluster and parameter task generate, and the parameter task of generation is put into message-oriented middleware, from node monitoring information middleware, When there is task, acquisition task is simultaneously executed, and thereby realizes distribution.
As shown in Figure 3, when having executed a task from node, implementing result can be sent to host node, host point root According to task of having completed as a result, using parameter optimization algorithm, the optimization algorithms such as Bayes, TPE can be used and generate hyper parameter Task realizes the directional guidance of generation of hyper parameter.

Claims (4)

1. a kind of machine learning distribution tune ginseng system based on celery, which comprises the following steps:
S1, code and data is submitted to specified directory and to be distributed to distance host by web interface;
S2, page configuration hyper parameter type and range;
S3, system generate initial parameter task according to configuration and are put into message-oriented middleware;
S4, task is obtained from node to message-oriented middleware and is executed;
S5, after having executed task from node, result is returned into host node;
S6, the host node trained result before continue generation hyper parameter task using parameter generation algorithm and store to message Middleware supplies to consume from node;
S7, when frequency of training reaches specified value, whole system just stops.
2. the machine learning distribution tune ginseng system according to claim 1 based on celery, which is characterized in that step S1 The management of middle cluster uses socket communication protocol to complete the mutual access between host, and efficiency of transmission is high, stability It is good.
3. the machine learning distribution tune ginseng system according to claim 1 based on celery, which is characterized in that step S3 , in S4, S5, generation and the consumption of task use celery to dispatch, and using redis as message-oriented middleware, facilitate decoupling It closes.
4. the machine learning distribution tune ginseng system according to claim 1 based on celery, which is characterized in that hyper parameter Generating with housebroken result is foundation guide parameters generation, while the part is integrated with many kinds of parameters optimizing algorithm, including Bayes, TPE, Anneal etc., can also artificial spreading parameter optimizing algorithm, shorten the parameter optimization time.
CN201910400730.3A 2019-05-15 2019-05-15 A kind of distributed machines study tune ginseng system based on celery Pending CN110110863A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910400730.3A CN110110863A (en) 2019-05-15 2019-05-15 A kind of distributed machines study tune ginseng system based on celery

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910400730.3A CN110110863A (en) 2019-05-15 2019-05-15 A kind of distributed machines study tune ginseng system based on celery

Publications (1)

Publication Number Publication Date
CN110110863A true CN110110863A (en) 2019-08-09

Family

ID=67490080

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910400730.3A Pending CN110110863A (en) 2019-05-15 2019-05-15 A kind of distributed machines study tune ginseng system based on celery

Country Status (1)

Country Link
CN (1) CN110110863A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112202858A (en) * 2020-09-22 2021-01-08 苏宁消费金融有限公司 Jenkins cluster management method and system based on cell distributed scheduling framework
CN113568757A (en) * 2021-09-22 2021-10-29 中建电子商务有限责任公司 Large-scale distributed inference engine and system based on deep learning
CN113608722A (en) * 2021-07-31 2021-11-05 云南电网有限责任公司信息中心 Algorithm packaging method based on distributed technology

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112202858A (en) * 2020-09-22 2021-01-08 苏宁消费金融有限公司 Jenkins cluster management method and system based on cell distributed scheduling framework
CN112202858B (en) * 2020-09-22 2022-06-17 苏宁消费金融有限公司 Jenkins cluster management method and system based on cell distributed scheduling framework
CN113608722A (en) * 2021-07-31 2021-11-05 云南电网有限责任公司信息中心 Algorithm packaging method based on distributed technology
CN113568757A (en) * 2021-09-22 2021-10-29 中建电子商务有限责任公司 Large-scale distributed inference engine and system based on deep learning

Similar Documents

Publication Publication Date Title
CN108932588B (en) Hydropower station group optimal scheduling system with separated front end and rear end and method
CN110110863A (en) A kind of distributed machines study tune ginseng system based on celery
CN104885078A (en) Method for two-stage query optimization in massively parallel processing database clusters
CN103488537A (en) Method and device for executing data ETL (Extraction, Transformation and Loading)
Xue et al. Optimizing ontology alignment through memetic algorithm based on partial reference alignment
Gagné et al. Distributed beagle: An environment for parallel and distributed evolutionary computations
CN103793467A (en) Method for optimizing real-time query on big data on basis of hyper-graphs and dynamic programming
CN104133891A (en) Method for storing massive structural data based on relational database
CN110765582B (en) Self-organization center K-means microgrid scene division method based on Markov chain
CN106601235A (en) Semi-supervision multitask characteristic selecting speech recognition method
CN104951306A (en) Data processing method and system based on real-time calculation frame
CN113312177B (en) Wireless edge computing system and optimizing method based on federal learning
CN110109658A (en) A kind of ROS code generator and code generating method based on formalized model
CN116306769A (en) Bayesian optimization method and system containing countermeasure generation network
CN115827189A (en) Large-scale intelligent cluster task scheduling optimization method
CN104462329A (en) Operation process digging method suitable for diversified environment
Czarnul et al. Simulation of parallel similarity measure computations for large data sets
CN111695701B (en) System for realizing data set construction processing based on federal learning and construction generation method thereof
CN102779025A (en) Parallel PLSA (Probabilistic Latent Semantic Analysis) method based on Hadoop
CN115688495B (en) Distributed LVC simulation system collaborative planning method, server and storage medium
CN116227647B (en) Digital twin deduction optimization method and system for power system
Liu et al. Research on a union algorithm of multiple concept lattices
CN114021977A (en) Typical composite scene reduction method and system for energy storage planning
Wang et al. An expansion planning approach for intelligent grids with speculative parallelism
CN103793745A (en) Distributed particle swarm optimization method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20190809

WD01 Invention patent application deemed withdrawn after publication