[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN105959363A - Big data cluster deployment method capable of adapting to hardware configuration - Google Patents

Big data cluster deployment method capable of adapting to hardware configuration Download PDF

Info

Publication number
CN105959363A
CN105959363A CN201610264394.0A CN201610264394A CN105959363A CN 105959363 A CN105959363 A CN 105959363A CN 201610264394 A CN201610264394 A CN 201610264394A CN 105959363 A CN105959363 A CN 105959363A
Authority
CN
China
Prior art keywords
puppet
agent
master
configuration
data sets
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610264394.0A
Other languages
Chinese (zh)
Inventor
唐明
常梦楠
任红雷
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Electronic Technology Cyber Security Co Ltd
Original Assignee
China Electronic Technology Cyber Security Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Electronic Technology Cyber Security Co Ltd filed Critical China Electronic Technology Cyber Security Co Ltd
Priority to CN201610264394.0A priority Critical patent/CN105959363A/en
Publication of CN105959363A publication Critical patent/CN105959363A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1095Replication or mirroring of data, e.g. scheduling or transport for data synchronisation between network nodes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L63/00Network architectures or network communication protocols for network security
    • H04L63/08Network architectures or network communication protocols for network security for authentication of entities

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Computer Hardware Design (AREA)
  • Computer Security & Cryptography (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Stored Programmes (AREA)

Abstract

The invention discloses a big data cluster deployment method capable of adapting to hardware configuration. Synchronization requests are transmitted to a puppet-master by puppet-agents, and after the puppet-agents pass the authentication, the puppet-agents are used to detect server hardware configuration information, and are used to transmit the server hardware configuration information to the puppet-master; the puppet-master is used to generate pseudo codes, which are transmitted to the puppet-agents; the puppet-agents are used to execute the content of the pseudo codes belonging to the own nodes, and then automatic deployment of a cluster is completed, and synchronization is completed. By adopting the big data cluster deployment method capable of adapting to the hardware configuration, automatic batch deployment of a big data environment is realized, and the processes from the detection of the hardware information to the acquisition of the software installation and the generation of the configuration files are realized by programming, and therefore tedious operations such as manual software installation and configuration file changing are not required. The big data cluster deployment method is advantageous in that the workload of the big data cluster set-up deployment is relieved, the time cost of the cluster deployment is saved, and the subsequent unified management and the subsequent unified operation and maintenance are facilitated.

Description

A kind of large data sets group's dispositions method of adaptive hardware configuration
Technical field
The present invention relates to large data sets group's dispositions method of a kind of adaptive hardware configuration.
Background technology
Large data sets group includes data batch processing and real-time streaming processes framework, also includes certification and authority control The systems such as system, the assembly related to is various, such as hadoop, hive, kafka, storm, spark, zooke Relation of interdependence is there may be between eper, kerberos, sentry etc., and different assembly.Big number The installation of these assemblies and the configuration of dependence is mainly included according to the deployment of cluster.
Puppet is a kind of cluster configuration management instrument.This instrument typically uses main frame-broker architecture, uses Its built-in puppet modeling language is to cluster resource, including user account, specific file and file, soft End-state and the dependence of part bag and service etc. are described, to complete automatization's clustered deploy(ment) and to join Put.
The role served as due to server in distributed type assemblies differs, and the server brand that different clusters use is also Be not quite similar, so must account for hardware configuration when clustered deploy(ment) is installed, as cpu core number, internal memory, File system division etc..Traditional dispositions method manually, needs to carry out repeatedly manual amendment, efficiency Low and error-prone.
Summary of the invention
In order to overcome the shortcoming of prior art, the invention provides the large data sets of a kind of adaptive hardware configuration Group's dispositions method.
The technical solution adopted in the present invention is: the large data sets group side of deployment of a kind of adaptive hardware configuration Method, comprises the steps:
Step one, puppet-agent send synchronization request to puppet-master;
Puppet-agent is authenticated by step 2, puppet-master;
Step 3, puppet-agent detecting server hardware configuration information are also sent to puppet-master;
Step 4, puppet-master combine in the information analysis manifest that puppet-agent sends The configuration content of corresponding node, generates false code and is sent to puppet-agent;
Step 5, puppet-agent receive false code, and each puppet-agent performs to one's name to save The false code content of point, the automatization completing cluster disposes, and terminates this subsynchronous.
Compared with prior art, the positive effect of the present invention is: the present invention realizes the automatization of big data environment Batch is disposed, and getting software installation generate with configuration file from hardware information detection is all programming realization, nothing Need manual installation software and the tedious work of change configuration file, be the big data of a kind of adaptive hardware configuration Cluster Automation arranging method.The method alleviates large data sets group and builds the workload of deployment, has saved collection The time cost that group disposes, and facilitate follow-up unified management and O&M.
Accompanying drawing explanation
Examples of the present invention will be described by way of reference to the accompanying drawings, wherein:
Fig. 1 is large data sets group's dispositions method FB(flow block) of a kind of adaptive hardware configuration;
Fig. 2 is a kind of method flow block diagram of self-defined fact variable acquisition hardware configuration in puppet.
Detailed description of the invention
Large data sets group's dispositions method of a kind of adaptive hardware configuration, as it is shown in figure 1, comprise the steps:
Step one, puppet-agent send synchronization request to puppet-master;
Puppet-agent is authenticated by step 2, puppet-master:
If request synchronizes for the first time, need puppet-master that puppet-agent is authenticated.This Embodiment uses automatic authentication registration mode, is automatically performed certification after sending synchronization request.Automatically the side of registration Formula needs to delete the existing certificate of master and the agent end of puppet, then revises puppet-master The configuration file autosign.conf of middle puppet, content changes into: *.So after synchronization request sends, Certification can be automatically performed.
Step 3, puppet-agent detecting server hardware configuration information are also sent to puppet-master:
Puppet-agent calls facter, facter and detects some hardware informations of main frame, and saves as fa Ct variable, then transmits these information to puppet-master.Including at puppet-master end certainly The fact variable of definition, as a example by hadoop module, the file hadoop.rb of definition fact variable is synchronizing Before be saved in puppet-master end, this document can before puppet-agent calls facter detection hardware by Master end is sent to the puppet catalogue of agent end: in/var/lib/puppet/lib/facter/.This step Successful execution need to rely on a kind of method that self-defined fact variable obtains hardware configuration in puppet, the method Flow process as in figure 2 it is shown, specifically comprise the following steps that
1) creating .rb file in the hadoop module of puppet, position is /etc/puppet/modules/ha Doop/lib/facter/hadoop.rb, utilizes facter instrument to obtain server hardware configuration information, saves as variable. As a example by hadoop_hdfs_data_dirs variable defined in hadoop.rb file, herein below defines ha Doop_hdfs_data_dirs variable, if the catalogue of existence/data [0-9] * form in file system, then h The value of adoop_hdfs_data_dirs is /data [0-9] */hadoop/dfs/data, otherwise hadoop_hdfs_data The value of _ dirs is /data/hadoop/dfs/data.
In this embodiment, the value of cluster namenode node hadoop_hdfs_data_dirs is /data/hado Op/dfs/data, the value of all datanode node hadoop_hdfs_data_dirs is /data9/hadoop/dfs/ data,/data4/hadoop/dfs/data,/data8/hadoop/dfs/data,/data10/hadoop/dfs/data,/data7/h adoop/dfs/data,/data3/hadoop/dfs/data,/data2/hadoop/dfs/data,/data5/hadoop/dfs/dat a,/data1/hadoop/dfs/data,/data6/hadoop/dfs/data.It can be seen that by self-defined fact variable, Reach the effect of adaptive server hardware configuration.
2) in hadoop module, above-mentioned custom variable is used.Defined variable hadoop_hdfs_data_di The purpose of rs is for the value energy of dfs.datanode.data.dir in the configuration file hdfs-site.xml of hadoop Enough dynamically generate according to hardware configuration, then need the variable of definition to be used in the template file of this configuration file In hdfs-site.xml.erb, position is /etc/puppet/modules/hadoop/templates/hdfs-site.xml.e rb.The position using variable hadoop_hdfs_data_dirs in this template file is as follows.
3) puppet.conf in all nodes of cluster adds pluginsync, to support above making by oneself Justice fact variable mode.Puppet-master end, adds: pluginsyn in [main] of puppet.conf C=true;At puppet-agent end, add in [agent] of puppet.conf: pluginsync=t rue。
Step 4, puppet-master combine in the information analysis manifest that puppet-agent sends corresponding The configuration content of node, generate false code (catalog) and be sent to puppet-agent:
Puppet-master finds node configuration corresponding in manifest according to the host name of puppet-agent, And resolve configuring content by the node of puppet modeling language programming realization.Meanwhile, f Act variable replacement becomes the actual value that puppet-agent is transmitted through.Analysis result is puppet generation built-in for puppet Code (catalog), is then sent to corresponding puppet-agent false code.
Step 5, puppet-agent receive false code, and each puppet-agent performs to one's name node False code content, carries out configuration file transmission simultaneously, and the automatization completing cluster disposes.Terminate this subsynchronous.
Puppet-agent judges either with or without File file upon execution, if it has, then to puppet-master Request, completes the transmission of file.For different puppet-agent, in the template of puppet-master end The fact variate-value life that file hdfs-site.xml.erb can detect according to corresponding puppet-agent and send Become configuration file hdfs-site.xml, and be sent to the appointment position of corresponding puppet-agent.After being finished Terminate to synchronize, after all puppet-agent are with EOS, i.e. complete the deployment of large data sets group.
In above step, puppet-master communicates with all puppet-agent and uses ssl to connect, so Request first needs to be authenticated, to guarantee the safety of subsequent transmission data when connecting.
The present invention mainly uses puppet and facter programming realization large data sets group to dispose, it is not necessary to according to firmly Part configuration is revised manually, has reached the purpose of adaptive hardware configuration, has saved engineering construction Manpower and time cost.It should be noted that in above-described embodiment simply as a example by hadoop module for The bright present invention, when Practical Project is disposed, further relates to other modules, such as hive, hbase, storm etc., combines Closing these modules install and rely on configuration, can complete large data sets group builds deployment.

Claims (8)

1. large data sets group's dispositions method of an adaptive hardware configuration, it is characterised in that: include as follows Step:
Step one, puppet-agent send synchronization request to puppet-master;
Puppet-agent is authenticated by step 2, puppet-master;
Step 3, puppet-agent detecting server hardware configuration information are also sent to puppet-master;
Step 4, puppet-master combine in the information analysis manifest that puppet-agent sends The configuration content of corresponding node, generates false code and is sent to puppet-agent;
Step 5, puppet-agent receive false code, and each puppet-agent performs to one's name to save The false code content of point, the automatization completing cluster disposes, and terminates this subsynchronous.
Large data sets group's dispositions method of a kind of adaptive hardware the most according to claim 1 configuration, It is characterized in that: described certification is automatic authentication registration mode, automatically during registration, delete the maste of puppet The existing certificate of r and agent end, then the configuration file autosig of puppet in amendment puppet-master N.conf, content changes " * " into.
Large data sets group's dispositions method of a kind of adaptive hardware the most according to claim 1 configuration, It is characterized in that: puppet-agent, then will be hard by calling facter detecting server hardware configuration information Part configuration information saves as fact variable.
Large data sets group's dispositions method of a kind of adaptive hardware the most according to claim 3 configuration, It is characterized in that: the customizing method of described fact variable comprises the steps:
1) creating .rb file in the hadoop module of puppet, position is /etc/puppet/modules/ha Doop/lib/facter/hadoop.rb, utilizes facter instrument to obtain server hardware configuration information, saves as variable;
2) in hadoop module, above-mentioned variable is used;
3) puppet.conf in all nodes of cluster adds pluginsync.
Large data sets group's dispositions method of a kind of adaptive hardware the most according to claim 4 configuration, It is characterized in that: the method adding pluginsync in the puppet.conf in all nodes of cluster is: right In puppet-master end, in [main] of puppet.conf, add " pluginsync=true ";For Puppet-agent end, adds " pluginsync=true " in [agent] of puppet.conf.
Large data sets group's dispositions method of a kind of adaptive hardware the most according to claim 1 configuration, It is characterized in that: the puppet-master described in step 4 combines the information of puppet-agent transmission and enters The method that row resolves is: it is right that puppet-master finds in manifest according to the host name of puppet-agent The node configuration answered, and solve configuring content by the node of puppet modeling language programming realization Analysis;Meanwhile, fact variable replacement is become the actual value that puppet-agent is transmitted through.
Large data sets group's dispositions method of a kind of adaptive hardware the most according to claim 1 configuration, It is characterized in that: puppet-agent described in step 5 performs to sentence during the false code content of to one's name node Disconnected either with or without File file, if it has, then ask to puppet-master, complete the transmission of file.
Large data sets group's dispositions method of a kind of adaptive hardware the most according to claim 1 configuration, It is characterized in that: puppet-master and all puppet-agent communicate and all use ssl to connect.
CN201610264394.0A 2016-04-26 2016-04-26 Big data cluster deployment method capable of adapting to hardware configuration Pending CN105959363A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610264394.0A CN105959363A (en) 2016-04-26 2016-04-26 Big data cluster deployment method capable of adapting to hardware configuration

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610264394.0A CN105959363A (en) 2016-04-26 2016-04-26 Big data cluster deployment method capable of adapting to hardware configuration

Publications (1)

Publication Number Publication Date
CN105959363A true CN105959363A (en) 2016-09-21

Family

ID=56915790

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610264394.0A Pending CN105959363A (en) 2016-04-26 2016-04-26 Big data cluster deployment method capable of adapting to hardware configuration

Country Status (1)

Country Link
CN (1) CN105959363A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106533753A (en) * 2016-11-07 2017-03-22 广州视源电子科技股份有限公司 Role configuration method and device of distributed system
CN107634852A (en) * 2017-08-17 2018-01-26 新华三大数据技术有限公司 The method and apparatus for supervising big data cluster
CN108958744A (en) * 2018-06-21 2018-12-07 北京京东金融科技控股有限公司 Dispositions method, device, medium and the electronic equipment of big data distributed type assemblies
CN110784546A (en) * 2019-10-31 2020-02-11 浙江大华技术股份有限公司 Distributed cluster deployment method, server and storage device
CN111597536A (en) * 2020-05-19 2020-08-28 重庆第二师范学院 Hadoop cluster kerberos high-availability authentication method

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103678007A (en) * 2013-12-13 2014-03-26 浪潮(北京)电子信息产业有限公司 Method and system for deploying software in batches
CN104519100A (en) * 2013-09-29 2015-04-15 重庆新媒农信科技有限公司 Method for automatic heterogeneous platform file synchronization and puppet server

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104519100A (en) * 2013-09-29 2015-04-15 重庆新媒农信科技有限公司 Method for automatic heterogeneous platform file synchronization and puppet server
CN103678007A (en) * 2013-12-13 2014-03-26 浪潮(北京)电子信息产业有限公司 Method and system for deploying software in batches

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
刘宇: "《Puppet实战》", 31 January 2014 *

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106533753A (en) * 2016-11-07 2017-03-22 广州视源电子科技股份有限公司 Role configuration method and device of distributed system
CN106533753B (en) * 2016-11-07 2019-12-24 广州视源电子科技股份有限公司 Role configuration method and device of distributed system
CN107634852A (en) * 2017-08-17 2018-01-26 新华三大数据技术有限公司 The method and apparatus for supervising big data cluster
CN107634852B (en) * 2017-08-17 2018-12-11 新华三大数据技术有限公司 The method and apparatus for supervising big data cluster
CN108958744A (en) * 2018-06-21 2018-12-07 北京京东金融科技控股有限公司 Dispositions method, device, medium and the electronic equipment of big data distributed type assemblies
CN108958744B (en) * 2018-06-21 2022-08-09 京东科技控股股份有限公司 Deployment method, device, medium and electronic equipment of big data distributed cluster
CN110784546A (en) * 2019-10-31 2020-02-11 浙江大华技术股份有限公司 Distributed cluster deployment method, server and storage device
CN111597536A (en) * 2020-05-19 2020-08-28 重庆第二师范学院 Hadoop cluster kerberos high-availability authentication method

Similar Documents

Publication Publication Date Title
CN105959363A (en) Big data cluster deployment method capable of adapting to hardware configuration
CN103986738B (en) A kind of synchronous method between multiple terminals and system
US20190205315A1 (en) System and method for synchronizing data between communication devices in a networked environment without a central server
CN116472592A (en) System and method for remote monitoring and control of electrochromic glazing
CN112565415B (en) Cross-region resource management system and method based on cloud edge cooperation
CN106713391B (en) Session information sharing method and sharing system
CN103412768A (en) Zookeeper cluster automatic-deployment method based on script program
CN103873517A (en) Method, device and system for data synchronization
CN106020916A (en) Method for updating application, terminal equipment and server
CN103685530A (en) Automatic upgrade control method and system for WLAN current network APs
CN104991849A (en) Method for monitoring system resource occupation of Linux process through zabbix
CN104519100A (en) Method for automatic heterogeneous platform file synchronization and puppet server
CN102833092A (en) Method and system for managing cloud nodes and central server
CN105607606A (en) Data acquisition device and data acquisition method based on double-mainboard framework
CN113824801B (en) Intelligent integration terminal unified access management component system
CN105338037A (en) Dynamic scheduling method and system
CN109672731A (en) A kind of distributed node information monitoring method, system and application
CN110430231B (en) Management method and system for ubiquitous power Internet of things intelligent terminal APP
CN104113594A (en) Method and system for file uploading based on JS uploading assembly
CN106571943A (en) Distributed-type configuration cluster capacity-expanding method and device
CN104915291A (en) Terminal restart verification method and system
CN112491614B (en) Configuration information online automatic validation method and system for embedded equipment
Gong et al. Intelligent networking model at the edge of the power Internet of Things
CN114327556A (en) Device and method for realizing application configuration hot update based on Internet of things edge cloud cooperation
CN105490999A (en) Distributed storage system based on extensible messaging and presence protocol (XMPP)

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20160921

RJ01 Rejection of invention patent application after publication