[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN113805925A - Online upgrading method, device, equipment and medium for distributed cluster management software - Google Patents

Online upgrading method, device, equipment and medium for distributed cluster management software Download PDF

Info

Publication number
CN113805925A
CN113805925A CN202111138029.2A CN202111138029A CN113805925A CN 113805925 A CN113805925 A CN 113805925A CN 202111138029 A CN202111138029 A CN 202111138029A CN 113805925 A CN113805925 A CN 113805925A
Authority
CN
China
Prior art keywords
node
management software
upgrading
upgraded
target
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111138029.2A
Other languages
Chinese (zh)
Inventor
韩宾
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Jinan data Technology Co ltd
Original Assignee
Inspur Jinan data Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Jinan data Technology Co ltd filed Critical Inspur Jinan data Technology Co ltd
Priority to CN202111138029.2A priority Critical patent/CN113805925A/en
Publication of CN113805925A publication Critical patent/CN113805925A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/60Software deployment
    • G06F8/65Updates
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/70Software maintenance or management
    • G06F8/71Version control; Configuration management

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Security & Cryptography (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Stored Programmes (AREA)

Abstract

The invention discloses a method, a device, equipment and a medium for online upgrading distributed cluster management software; in the scheme, before the management software of each node is upgraded, an upgrade package, a cluster environment and each node environment need to be checked, and the upgrade package, the cluster environment and each node environment are automatically repaired when the detection fails, so that the upgrade failure caused by the occurrence of problems in the upgrade process can be avoided, and the upgrade efficiency is improved; in addition, the upgrading mode of the main node can avoid the shutdown of the management software in the upgrading process, so that the normal operation of the management software is not influenced.

Description

Online upgrading method, device, equipment and medium for distributed cluster management software
Technical Field
The present invention relates to the field of software upgrading technologies, and in particular, to a method, an apparatus, a device, and a medium for online upgrading distributed cluster management software.
Background
In the internet era, distributed storage clusters carry massive computing and storage work of the internet. The cluster management software takes over the responsibility of managing the entire cluster. In a traditional upgrading mode, management software needs to be closed before upgrading, and check items before upgrading are too single, so that the whole upgrading process fails due to various problems which can be avoided before upgrading in the upgrading process. Therefore, how to improve the upgrading efficiency without affecting the function of the management software is a problem to be solved by those skilled in the art.
Disclosure of Invention
The invention aims to provide a method, a device, equipment and a medium for online upgrading of distributed cluster management software, so that the upgrading efficiency is improved on the premise of not influencing the functions of the management software.
In order to achieve the above object, the present invention provides an online upgrade method for distributed cluster management software, including:
before upgrading the management software, executing a checking task to obtain a checking result; the inspection task is used for inspecting an upgrade package, a cluster environment and each node environment;
judging whether a target check item which does not pass the check exists in the check result;
if yes, automatically repairing the target check item; if the slave nodes do not exist, the management software of each slave node is upgraded;
after the management software of each slave node is upgraded, the management software of the original master node is upgraded, the upgraded original master node is used as a target master node, and the virtual IP of the management software of the original master node is drifted to the target master node; and upgrading the management software of the original main node, and taking the upgraded original main node as a target standby main node of the target main node.
Wherein, the executing the checking task obtains the checking result, including:
adding the inspection task to a task manager to monitor execution progress of the inspection task through the task manager;
and executing the inspection task to obtain the inspection result of each inspection item, and displaying the inspection result of each inspection item through an interface.
When each node environment is checked, the executing of the checking task to obtain the checking result includes:
dividing nodes to be checked into different node groups;
and sequentially checking each node group, and when checking each node group, checking each node to be checked in the node group in a concurrent mode.
Wherein, the upgrading the management software of each slave node comprises:
and adding the management software upgrading task of each slave node to an online upgrading task manager, and upgrading the management software of each slave node in a concurrent mode through the online upgrading task manager.
When the management software of each slave node is upgraded, the method further comprises the following steps:
and if the management software of the slave node fails to be upgraded, returning the management software version of the slave node which fails to be upgraded to the management software version before being upgraded.
Before the original master node is upgraded, the online upgrading method further comprises the following steps:
and migrating the data information in the database of the original main node to the database of the target main node according to the updated database format.
After the upgraded original master node is used as the target standby master node of the target master node, the method further comprises the following steps:
checking each node in the distributed cluster;
if the checking result is abnormal, generating upgrading abnormal prompt information; the abnormal inspection result comprises at least one of the following conditions: the upgraded management software versions of the nodes are inconsistent, the upgraded management software is abnormal in service and the upgraded management software is abnormal in database.
In order to achieve the above object, the present invention further provides an online upgrade apparatus for distributed cluster management software, including:
the pre-upgrade check module is used for executing a check task to obtain a check result before the management software is upgraded; the inspection task is used for inspecting an upgrade package, a cluster environment and each node environment;
the judging module is used for judging whether a target checking item which cannot pass the checking exists in the checking result;
the restoration module is used for automatically restoring the target check item when the check result has the target check item which cannot pass the check;
the first upgrading module is used for upgrading the management software of each slave node when the checking result does not have a target checking item which cannot pass the checking;
the second upgrading module is used for upgrading the management software of the original standby main node after the management software of each slave node is upgraded, and the upgraded original standby main node is used as a target main node;
the IP drifting module is used for drifting the virtual IP of the management software of the original main node to the target main node;
and the third upgrading module is used for upgrading the management software of the original main node and taking the upgraded original main node as a target standby main node of the target main node.
To achieve the above object, the present invention further provides an electronic device comprising:
a memory for storing a computer program;
and the processor is used for realizing the steps of the online upgrading method of the distributed cluster management software when executing the computer program.
To achieve the above object, the present invention further provides a computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, realizes the steps of the above method for online upgrading of distributed cluster management software.
According to the above scheme, the online upgrade method for distributed cluster management software provided by the embodiment of the invention includes: before upgrading the management software, executing a checking task to obtain a checking result; the inspection task is used for inspecting an upgrade package, a cluster environment and each node environment; judging whether a target check item which fails in the check exists in the check result; if yes, automatically repairing the target check item; if the slave nodes do not exist, the management software of each slave node is upgraded; after the management software of each slave node is upgraded, the management software of the original master node is upgraded, the upgraded original master node is used as a target master node, and the virtual IP of the management software of the original master node is drifted to the target master node; and upgrading the management software of the original main node, and taking the upgraded original main node as a target standby main node of the target main node. Therefore, before the management software of each node is upgraded, the upgrade package, the cluster environment and the environment of each node need to be checked, and the nodes are automatically repaired when the detection fails, so that the problem in the upgrading process can be avoided, the upgrading failure can be avoided, and the upgrading efficiency can be improved; in addition, the upgrading mode of the main node can avoid the shutdown of the management software in the upgrading process, so that the normal operation of the management software is not influenced. The invention also discloses an online upgrading device, equipment and a medium of the distributed cluster management software, and the technical effects can be realized.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is a schematic flow chart of an online upgrade method for distributed cluster management software according to an embodiment of the present invention;
FIG. 2 is a schematic view of a pre-upgrade inspection process disclosed in an embodiment of the present invention;
fig. 3 is a schematic structural diagram of an online upgrade apparatus for distributed cluster management software according to an embodiment of the present invention;
fig. 4 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The embodiment of the invention discloses a method, a device, equipment and a medium for online upgrading distributed cluster management software, which aim to improve the upgrading efficiency on the premise of not influencing the functions of the management software.
Referring to fig. 1, a schematic flow chart of an online upgrade method for distributed cluster management software according to an embodiment of the present invention includes:
s101, before the management software is upgraded, executing a checking task to obtain a checking result; the inspection task is used for inspecting the upgrade package, the cluster environment and the environment of each node;
it should be noted that, in order to avoid the upgrade failure caused by the problem avoidable before the upgrade, in the present solution, before the upgrade, an inspection task is required to be executed, where the inspection task is used to inspect an upgrade package, a cluster environment, and each node environment; the step of checking the upgrade package refers to checking the correctness of the upgrade package, the step of checking the cluster environment refers to checking the service state, the service version, the network state and the like, and the step of checking the node environment refers to checking the hardware version, the software version, the hardware matching degree and the like of the node; such as: calculating an MD5 value of the upgrade package through an MD5(Message-Digest Algorithm 5) information Digest Algorithm, comparing the MD5 value with an original MD5 value of the upgrade package, if the MD5 value is the same as the original MD5 value of the upgrade package, indicating that the upgrade package is correct, verifying that the upgrade package passes, and then checking the next check item to obtain a final check result; the final inspection result includes the inspection results of the respective inspection items.
It is understood that, before upgrading, the user can customize the setting of the check items and the check item threshold through the interface, such as: if the check entry is the remaining memory space and the threshold of the check entry is 80%, the detection is not passed if the remaining memory space is less than 80% during the check of the remaining memory space. This kind of mode through interface setting, the flexibility that multiplicable inspection item set up promotes the richness of inspection content, and convenience of customers sets up and looks over, improves user and uses experience, improves upgrading efficiency.
S102, judging whether a target check item which cannot pass the check exists in the check result; if yes, executing S103; if not, executing S104;
s103, automatically repairing the target inspection item, and continuously executing S102;
specifically, when the inspection task is executed to obtain the inspection result, the inspection task can be added to the task manager, and the execution progress of the inspection task is monitored through the task manager; and executing the inspection task to obtain the inspection result of each inspection item, and displaying the inspection result of each inspection item through an interface. That is to say: after the inspection task is added into the task manager, the execution progress of the inspection task can be monitored by the task manager, wherein: the monitoring of the execution progress of the inspection task comprises the following aspects: abnormal condition processing, breakpoint execution, automatic task pull-up and the like; the abnormal condition processing means that automatic repair is carried out according to the recommendation of a repair system aiming at the inspection items which are not passed by the inspection, such as: if the check item of the residual memory space does not pass, the residual memory space can be improved in a memory cleaning mode, so that the aim of automatic repair is fulfilled; the breakpoint execution and task automatic pull-up are as follows: when the cluster is powered off or the background task is interrupted due to other conditions, the unexecuted checking task can be automatically pulled up to continue to be executed after the cluster is recovered. The task manager can also display the inspection progress and the inspection result through an interface, such as: and outputting and displaying the upgrade package inspection result, the cluster environment inspection result and each node inspection result, displaying the inspection failed items in red font, and repairing the inspection failed items through a system or manually.
In this embodiment, for an ultra-large scale cluster, a packet inspection method may be adopted to improve inspection efficiency. Specifically, when each node environment is inspected, the node to be inspected can be divided into different node groups, each node group is inspected in sequence, and when each node group is inspected, each node to be inspected in the node group is inspected in a concurrent mode. Namely: the node checking flow in the scheme is as follows: the method has the advantages that the groups are executed in series, and the groups are concurrent, so that the inspection efficiency is improved, and meanwhile, the consumption of cluster resources is reduced. Referring to fig. 2, it can be seen from fig. 2 that in the present disclosure, during inspection, first, an inspection item and an inspection item threshold are set, then an inspection task is added to a task manager, the task manager executes upgrade package inspection, cluster environment inspection, and node environment inspection, when executing node environment inspection, first, nodes need to be grouped, then node environment inspection instructions are issued according to groups, and if all node groups execute the inspection task, an inspection result is recorded, and inspection before upgrade is completed.
It should be noted that, in the present solution, only if all the check items in the check result before the upgrade pass the check, the upgrade of the management software is allowed, and if there is a case that the target check item fails to pass the check, at this time, even if the upgrade is directly performed, the upgrade may fail due to an abnormal condition, so that when there is a target check item that fails to pass the check in the check result, the upgrade is not allowed. In addition, according to the scheme, after the examination is finished before the upgrade, a user can download the examination report, and can select to generate word and html reports during the downloading, so that the information is conveniently filed.
S104, upgrading the management software of each slave node;
it should be noted that, in the process of online upgrading management software, the management software upgrade tasks of the slave nodes need to be added to the online upgrade task manager, and the management software of the slave nodes is upgraded in a concurrent manner through the online upgrade task manager. Namely: according to the scheme, firstly, a slave node list is required to be obtained, a slave node upgrading task is added into an online upgrading task manager, and the online upgrading task manager sends the upgrading task to each slave node and receives an upgrading result. According to the scheme, when the slave node is upgraded through the online upgrade task manager, each upgrade task can be managed and monitored. The online upgrade task manager can monitor the use condition of the cluster resources in real time, and can ensure that the node upgrade tasks are executed maximally and parallelly under the condition of not excessively consuming the cluster resources when controlling all the slave nodes to execute the upgrade tasks concurrently; the online upgrade task manager can also implement a breakpoint execution function, that is: when the cluster is powered off or the background task is interrupted due to other conditions, the upgrading task which is not completed by execution can be automatically pulled up to be continuously executed after the cluster is recovered.
The online upgrade task manager may also implement a version rollback function, that is: and if the upgrade failure of the management software of the slave node is detected, returning the management software version of the slave node with the upgrade failure to the management software version before the upgrade, and reporting the upgrade failure information. When the management software is upgraded, each slave node can perform exception handling through the exception handling module if the execution exception condition is met, the upgrade is continuously performed if the processing is successful, the node upgrade is quitted if the processing is failed, and the exception information is reported. Such as: in the upgrading process, the upgrading is failed due to the occurrence of network abnormality, automatic repair can be performed at the moment, if the repair is successful, the upgrading is continued, and if the upgrading is failed, the abnormal information is reported.
S105, after the management software of each slave node is upgraded, the management software of the original master node is upgraded, the upgraded original master node is used as a target master node, and the virtual IP of the management software of the original master node is drifted to the target master node;
and S106, upgrading the management software of the original main node, and taking the upgraded original main node as a target standby main node of the target main node.
In this embodiment, after the management software of each slave node is upgraded, the master node needs to be upgraded, where the master node in this scheme includes an original master node and an original standby master node, the original master node is a virtual IP node, that is, a node where the virtual IP of the management software is located, and the original standby master node is a master node without a virtual IP mounted thereon. In order to ensure the normal operation of management software, the original standby main node can be upgraded to obtain a target main node, the virtual IP of the management software of the original main node is drifted to the target main node which is upgraded successfully, then the original main node is upgraded, and the original main node which is upgraded successfully is used as the target standby main node of the target main node. The process is specifically described here, and includes two parts, namely, backup master node upgrade and master node upgrade, where the backup master node upgrade process is as follows: removing the standby main nodes to be upgraded from the main node queue, adding the standby main nodes to the newly-built main node queue, then executing the upgrade of the standby main nodes, wherein the upgrade of all the standby main nodes is serial upgrade until all the standby main nodes are upgraded; the upgrading process of the main node comprises the following steps: and drifting the virtual IP node to the standby main node which is upgraded, wherein the standby main node becomes the node where the virtual IP of the new management software is located, then executing the upgrade of the original main node, and adding the original main node into a standby node queue after the upgrade of the original main node is finished.
It should be noted that, in the present solution, before the original master node is upgraded, the data information in the database of the original master node needs to be migrated to the database of the target master node according to the upgraded database format. Therefore, according to the database data migration mode, before the node where the virtual IP is located is upgraded, the database information on the node where the virtual IP is located can be migrated to the database of the new node where the virtual IP is located according to the upgraded database format, and the upgraded target host node can directly access the data in the database.
Furthermore, after the upgrade is finished, each node in the distributed cluster needs to be checked; if the checking result is abnormal, generating upgrading abnormal prompt information; the abnormal detection result includes at least one of the following conditions: the upgraded management software versions of the nodes are inconsistent, the upgraded management software is abnormal in service, the database is abnormal and the like. And then generating a final upgrade report, wherein the upgrade report can be an html upgrade report, the detailed upgrade process and the inspection result are recorded in the report, and the report has portability and is convenient for downloading and archiving.
In conclusion, according to the scheme, a user can automatically set the inspection items and the inspection item threshold value through the interface before upgrading, upgrading inspection contents are rich, and upgrading efficiency is improved; aiming at the super-large scale cluster, a grouping inspection mode can be adopted, and high concurrent execution capacity is provided while the occupation of cluster resources is reduced. In the checking process, the automatic repairing function is also provided, and the repairing processing is carried out aiming at the abnormal executable of foreseeable or known reasons, so as to submit the checking efficiency; the scheme can also realize the online upgrade of the management software in a virtual IP drift mode, and realize the self upgrade of the management software under the condition of ensuring the page function. The online upgrade task manager can also realize functions of upgrade scheduling, breakpoint execution, version rollback and the like under a large-scale cluster, and can efficiently execute a managed software upgrade task under the condition of ensuring that the original cluster service is not influenced.
The upgrade apparatus, device, and medium provided in the embodiments of the present invention are introduced below, and the upgrade apparatus, device, and medium described below and the upgrade method described above may be referred to each other.
Referring to fig. 3, a schematic structural diagram of an online upgrade apparatus for distributed cluster management software provided in an embodiment of the present invention includes:
the pre-upgrade check module 11 is configured to execute a check task to obtain a check result before the management software is upgraded; the inspection task is used for inspecting an upgrade package, a cluster environment and each node environment;
a judging module 12, configured to judge whether a target inspection item failing to be inspected exists in the inspection result;
a repairing module 13, configured to automatically repair a target inspection item when the inspection result has the target inspection item that fails to be inspected;
the first upgrading module 14 is configured to upgrade the management software of each slave node when the checking result does not have a target checking item for which checking fails;
the second upgrading module 15 is configured to upgrade the management software of the original standby master node after the management software of each slave node is upgraded, and use the upgraded original standby master node as a target master node;
the IP drifting module 16 is used for drifting the virtual IP of the management software of the original main node to the target main node;
and a third upgrading module 17, configured to upgrade the management software of the original master node, and use the upgraded original master node as a target standby master node of the target master node.
Wherein the pre-upgrade check module comprises:
the monitoring unit is used for adding the inspection task to a task manager so as to monitor the execution progress of the inspection task through the task manager;
the execution unit is used for executing the inspection task to obtain the inspection result of each inspection item;
and the display unit is used for displaying the inspection result of each inspection item through an interface.
Wherein the pre-upgrade check module comprises:
the dividing unit is used for dividing the nodes to be checked into different node groups;
and the checking unit is used for checking each node group in sequence, and when checking each node group, all nodes to be checked in the node group are checked in a concurrent mode.
The first upgrading module is specifically configured to: and adding the management software upgrading task of each slave node to an online upgrading task manager, and upgrading the management software of each slave node in a concurrent mode through the online upgrading task manager.
Wherein the first upgrade module is further configured to: and if the management software of the slave node fails to be upgraded, returning the management software version of the slave node which fails to be upgraded to the management software version before being upgraded.
Wherein, the device still includes:
and the data migration module is used for migrating the data information in the database of the original main node to the database of the target main node according to the updated database format before the original main node is updated.
Wherein, the device still includes:
the upgraded inspection module is used for inspecting each node in the distributed cluster after the upgraded original main node is used as a target standby main node of the target main node;
the information generation module is used for generating upgrading abnormity prompt information when the inspection result is abnormal; the abnormal inspection result comprises at least one of the following conditions: the upgraded management software versions of the nodes are inconsistent, the upgraded management software is abnormal in service and the upgraded management software is abnormal in database.
Referring to fig. 4, an electronic device according to an embodiment of the present invention includes:
a memory 21 for storing a computer program;
the processor 22 is configured to implement the steps of the online upgrade method for distributed cluster management software according to the foregoing method embodiments when executing the computer program.
In this embodiment, the device may be a PC (Personal Computer), or may be a terminal device such as a smart phone, a tablet Computer, a palmtop Computer, or a portable Computer.
The device may include a memory 21, a processor 22, and a bus 23.
The memory 21 includes at least one type of readable storage medium, which includes a flash memory, a hard disk, a multimedia card, a card type memory (e.g., SD or DX memory, etc.), a magnetic memory, a magnetic disk, an optical disk, and the like. The memory 21 may in some embodiments be an internal storage unit of the device, for example a hard disk of the device. The memory 21 may also be an external storage device of the device in other embodiments, such as a plug-in hard disk, Smart Media Card (SMC), Secure Digital (SD) Card, Flash memory Card (Flash Card), etc. provided on the device. Further, the memory 21 may also include both an internal storage unit of the device and an external storage device. The memory 21 may be used not only to store application software installed in the device and various types of data such as program codes for performing an upgrade method, etc., but also to temporarily store data that has been output or is to be output.
Processor 22, which in some embodiments may be a Central Processing Unit (CPU), controller, microcontroller, microprocessor or other data Processing chip, executes program code or processes data stored in memory 21, such as program code for performing upgrade methods.
The bus 23 may be a Peripheral Component Interconnect (PCI) bus, an Extended Industry Standard Architecture (EISA) bus, or the like. The bus may be divided into an address bus, a data bus, a control bus, etc. For ease of illustration, only one thick line is shown in FIG. 4, but this does not indicate only one bus or one type of bus.
Further, the device may further include a network interface 24, and the network interface 24 may optionally include a wired interface and/or a wireless interface (e.g., WI-FI interface, bluetooth interface, etc.), which are generally used to establish a communication connection between the device and other electronic devices.
Optionally, the device may further comprise a user interface 25, the user interface 25 may comprise a Display (Display), an input unit such as a Keyboard (Keyboard), and the optional user interface 25 may also comprise a standard wired interface, a wireless interface. Alternatively, in some embodiments, the display may be an LED display, a liquid crystal display, a touch-sensitive liquid crystal display, an OLED (Organic Light-Emitting Diode) touch device, or the like. The display, which may also be referred to as a display screen or display unit, is suitable for displaying information processed in the device and for displaying a visualized user interface.
Fig. 4 shows only the device with the components 21-25, and it will be understood by those skilled in the art that the structure shown in fig. 4 does not constitute a limitation of the device, and may comprise fewer or more components than those shown, or some components may be combined, or a different arrangement of components.
The embodiment of the present invention further provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the steps of the online upgrade method for distributed cluster management software described in the above method embodiment are implemented.
Wherein the storage medium may include: various media capable of storing program codes, such as a usb disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk.
In summary, in the scheme, an online upgrade scheme for large-scale distributed cluster management software is provided, which aims to quickly and simply complete the version upgrade of the cluster management software on the premise of not influencing the management software and the cluster function; specifically, before the management software is upgraded, the upgrade package, the cluster environment and each node environment can be checked, the check result is output, the check item which cannot pass the check is repaired, and the upgrade failure caused by avoidable problems in the upgrade process is avoided; when the management software is upgraded, the management software can be upgraded without interrupting the service of the management software, and after the management software is upgraded, the upgraded cluster environment can be checked to judge whether the cluster nodes are upgraded successfully or not and output abnormal cluster contents.
The embodiments in the present description are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other.
The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims (10)

1. An online upgrade method for distributed cluster management software is characterized by comprising the following steps:
before upgrading the management software, executing a checking task to obtain a checking result; the inspection task is used for inspecting an upgrade package, a cluster environment and each node environment;
judging whether a target check item which does not pass the check exists in the check result;
if yes, automatically repairing the target check item; if the slave nodes do not exist, the management software of each slave node is upgraded;
after the management software of each slave node is upgraded, the management software of the original master node is upgraded, the upgraded original master node is used as a target master node, and the virtual IP of the management software of the original master node is drifted to the target master node; and upgrading the management software of the original main node, and taking the upgraded original main node as a target standby main node of the target main node.
2. The online upgrade method according to claim 1, wherein the performing of the inspection task obtains an inspection result, including:
adding the inspection task to a task manager to monitor execution progress of the inspection task through the task manager;
and executing the inspection task to obtain the inspection result of each inspection item, and displaying the inspection result of each inspection item through an interface.
3. The online upgrade method according to claim 1, wherein, when checking the node environments, the performing the check task to obtain the check result includes:
dividing nodes to be checked into different node groups;
and sequentially checking each node group, and when checking each node group, checking each node to be checked in the node group in a concurrent mode.
4. The online upgrading method according to claim 1, wherein the upgrading the management software of each slave node includes:
and adding the management software upgrading task of each slave node to an online upgrading task manager, and upgrading the management software of each slave node in a concurrent mode through the online upgrading task manager.
5. The online upgrade method according to claim 4, wherein, when upgrading the management software of each slave node, the method further comprises:
and if the management software of the slave node fails to be upgraded, returning the management software version of the slave node which fails to be upgraded to the management software version before being upgraded.
6. The online upgrade method according to claim 1, wherein before the upgrade of the original master node, the online upgrade method further comprises:
and migrating the data information in the database of the original main node to the database of the target main node according to the updated database format.
7. The online upgrade method according to any one of claims 1 to 6, wherein after the upgraded original master node is used as a target standby master node of the target master node, the method further includes:
checking each node in the distributed cluster;
if the checking result is abnormal, generating upgrading abnormal prompt information; the abnormal inspection result comprises at least one of the following conditions: the upgraded management software versions of the nodes are inconsistent, the upgraded management software is abnormal in service and the upgraded management software is abnormal in database.
8. An online upgrade device for distributed cluster management software, comprising:
the pre-upgrade check module is used for executing a check task to obtain a check result before the management software is upgraded; the inspection task is used for inspecting an upgrade package, a cluster environment and each node environment;
the judging module is used for judging whether a target checking item which cannot pass the checking exists in the checking result;
the restoration module is used for automatically restoring the target check item when the check result has the target check item which cannot pass the check;
the first upgrading module is used for upgrading the management software of each slave node when the checking result does not have a target checking item which cannot pass the checking;
the second upgrading module is used for upgrading the management software of the original standby main node after the management software of each slave node is upgraded, and the upgraded original standby main node is used as a target main node;
the IP drifting module is used for drifting the virtual IP of the management software of the original main node to the target main node;
and the third upgrading module is used for upgrading the management software of the original main node and taking the upgraded original main node as a target standby main node of the target main node.
9. An electronic device, comprising:
a memory for storing a computer program;
a processor for implementing the steps of the method for online upgrade of distributed cluster management software according to any of claims 1 to 7 when executing the computer program.
10. A computer-readable storage medium, characterized in that a computer program is stored thereon, which computer program, when being executed by a processor, carries out the steps of the method for online upgrade of distributed cluster management software according to any of the claims 1 to 7.
CN202111138029.2A 2021-09-27 2021-09-27 Online upgrading method, device, equipment and medium for distributed cluster management software Pending CN113805925A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111138029.2A CN113805925A (en) 2021-09-27 2021-09-27 Online upgrading method, device, equipment and medium for distributed cluster management software

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111138029.2A CN113805925A (en) 2021-09-27 2021-09-27 Online upgrading method, device, equipment and medium for distributed cluster management software

Publications (1)

Publication Number Publication Date
CN113805925A true CN113805925A (en) 2021-12-17

Family

ID=78938745

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111138029.2A Pending CN113805925A (en) 2021-09-27 2021-09-27 Online upgrading method, device, equipment and medium for distributed cluster management software

Country Status (1)

Country Link
CN (1) CN113805925A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114500525A (en) * 2021-12-24 2022-05-13 天翼云科技有限公司 Method, device, computer equipment and medium for updating nodes in distributed system
CN114640586A (en) * 2022-05-19 2022-06-17 飞狐信息技术(天津)有限公司 Cluster version upgrading method and device, storage medium and equipment
CN117422418A (en) * 2023-11-07 2024-01-19 深圳市鹏中科技有限公司 Method, device, equipment and storage medium for developing operation link visualization

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107193611A (en) * 2017-05-31 2017-09-22 郑州云海信息技术有限公司 The method for upgrading software and system of a kind of group system
CN108667641A (en) * 2017-03-30 2018-10-16 杭州海康威视数字技术股份有限公司 Distributed system upgrade method, distributed system and updating and management equipment
CN108829420A (en) * 2018-06-12 2018-11-16 郑州云海信息技术有限公司 A kind of distributed memory system method for upgrading software based on RPM packet
CN109101370A (en) * 2018-08-30 2018-12-28 重庆富民银行股份有限公司 The date storage method of principal and subordinate's bi-directional synchronization
CN110879718A (en) * 2019-11-15 2020-03-13 北京浪潮数据技术有限公司 maridb upgrading method and device, electronic equipment and storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108667641A (en) * 2017-03-30 2018-10-16 杭州海康威视数字技术股份有限公司 Distributed system upgrade method, distributed system and updating and management equipment
CN107193611A (en) * 2017-05-31 2017-09-22 郑州云海信息技术有限公司 The method for upgrading software and system of a kind of group system
CN108829420A (en) * 2018-06-12 2018-11-16 郑州云海信息技术有限公司 A kind of distributed memory system method for upgrading software based on RPM packet
CN109101370A (en) * 2018-08-30 2018-12-28 重庆富民银行股份有限公司 The date storage method of principal and subordinate's bi-directional synchronization
CN110879718A (en) * 2019-11-15 2020-03-13 北京浪潮数据技术有限公司 maridb upgrading method and device, electronic equipment and storage medium

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114500525A (en) * 2021-12-24 2022-05-13 天翼云科技有限公司 Method, device, computer equipment and medium for updating nodes in distributed system
CN114500525B (en) * 2021-12-24 2024-04-26 天翼云科技有限公司 Method, device, computer equipment and medium for updating nodes in distributed system
CN114640586A (en) * 2022-05-19 2022-06-17 飞狐信息技术(天津)有限公司 Cluster version upgrading method and device, storage medium and equipment
CN117422418A (en) * 2023-11-07 2024-01-19 深圳市鹏中科技有限公司 Method, device, equipment and storage medium for developing operation link visualization

Similar Documents

Publication Publication Date Title
KR102493449B1 (en) Edge computing test methods, devices, electronic devices and computer-readable media
CN113805925A (en) Online upgrading method, device, equipment and medium for distributed cluster management software
US20120159443A1 (en) System and method for reducing test effort by object risk analysis
US10042744B2 (en) Adopting an existing automation script to a new framework
US10795793B1 (en) Method and system for simulating system failures using domain-specific language constructs
CN111338656A (en) Method and device for installing software package to target host and computer equipment
CN110851471A (en) Distributed log data processing method, device and system
CN112256593B (en) Program processing method and device, computer equipment and readable storage medium
CN111417019A (en) Method and device for processing plug-in abnormity, computer equipment and storage medium
CN111694684A (en) Abnormal construction method and device of storage equipment, electronic equipment and storage medium
CN114116330A (en) Server performance test method, system, terminal and storage medium
CN109002317B (en) PCBA firmware upgrading method and system and PCBA
CN115757099A (en) Automatic test method and device for platform firmware protection recovery function
CN106528248B (en) Downloading method and system of mobile terminal based on software integrity
JP2023044720A (en) Computer implemented method for recovering crashed application, computer program product, and remote computer server (remote recovery of crashed process)
CN115729679A (en) Task processing method and device, computer readable storage medium and electronic device
CN114124835A (en) Interface-based data transmission method, device, equipment and medium
CN113505079A (en) Application program testing method, device, equipment and medium
JP2012181737A (en) Computer system
CN112379973A (en) Heavy loading method and device
CN114327588A (en) Method and device for processing code submission log
Lim et al. TAOS-CI: lightweight & modular continuous integration system for edge computing
CN117971564B (en) Data recovery method, device, computer equipment and storage medium
CN110908725A (en) Application program starting method and device, electronic equipment and readable medium
US20240330477A1 (en) System and method for evaluating active backups using penetration testing

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination