CN113805925A - Online upgrading method, device, equipment and medium for distributed cluster management software - Google Patents
Online upgrading method, device, equipment and medium for distributed cluster management software Download PDFInfo
- Publication number
- CN113805925A CN113805925A CN202111138029.2A CN202111138029A CN113805925A CN 113805925 A CN113805925 A CN 113805925A CN 202111138029 A CN202111138029 A CN 202111138029A CN 113805925 A CN113805925 A CN 113805925A
- Authority
- CN
- China
- Prior art keywords
- node
- management software
- upgrading
- upgraded
- target
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 52
- 238000007689 inspection Methods 0.000 claims description 90
- 230000002159 abnormal effect Effects 0.000 claims description 25
- 238000004590 computer program Methods 0.000 claims description 11
- 230000008569 process Effects 0.000 abstract description 16
- 238000001514 detection method Methods 0.000 abstract description 4
- 230000006870 function Effects 0.000 description 9
- 238000012545 processing Methods 0.000 description 7
- 230000008439 repair process Effects 0.000 description 6
- 238000010586 diagram Methods 0.000 description 3
- 239000004973 liquid crystal related substance Substances 0.000 description 2
- 230000005012 migration Effects 0.000 description 2
- 238000013508 migration Methods 0.000 description 2
- 238000012544 monitoring process Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000005856 abnormality Effects 0.000 description 1
- 238000004140 cleaning Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/60—Software deployment
- G06F8/65—Updates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/70—Software maintenance or management
- G06F8/71—Version control; Configuration management
Landscapes
- Engineering & Computer Science (AREA)
- Software Systems (AREA)
- General Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Computer Security & Cryptography (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Stored Programmes (AREA)
Abstract
The invention discloses a method, a device, equipment and a medium for online upgrading distributed cluster management software; in the scheme, before the management software of each node is upgraded, an upgrade package, a cluster environment and each node environment need to be checked, and the upgrade package, the cluster environment and each node environment are automatically repaired when the detection fails, so that the upgrade failure caused by the occurrence of problems in the upgrade process can be avoided, and the upgrade efficiency is improved; in addition, the upgrading mode of the main node can avoid the shutdown of the management software in the upgrading process, so that the normal operation of the management software is not influenced.
Description
Technical Field
The present invention relates to the field of software upgrading technologies, and in particular, to a method, an apparatus, a device, and a medium for online upgrading distributed cluster management software.
Background
In the internet era, distributed storage clusters carry massive computing and storage work of the internet. The cluster management software takes over the responsibility of managing the entire cluster. In a traditional upgrading mode, management software needs to be closed before upgrading, and check items before upgrading are too single, so that the whole upgrading process fails due to various problems which can be avoided before upgrading in the upgrading process. Therefore, how to improve the upgrading efficiency without affecting the function of the management software is a problem to be solved by those skilled in the art.
Disclosure of Invention
The invention aims to provide a method, a device, equipment and a medium for online upgrading of distributed cluster management software, so that the upgrading efficiency is improved on the premise of not influencing the functions of the management software.
In order to achieve the above object, the present invention provides an online upgrade method for distributed cluster management software, including:
before upgrading the management software, executing a checking task to obtain a checking result; the inspection task is used for inspecting an upgrade package, a cluster environment and each node environment;
judging whether a target check item which does not pass the check exists in the check result;
if yes, automatically repairing the target check item; if the slave nodes do not exist, the management software of each slave node is upgraded;
after the management software of each slave node is upgraded, the management software of the original master node is upgraded, the upgraded original master node is used as a target master node, and the virtual IP of the management software of the original master node is drifted to the target master node; and upgrading the management software of the original main node, and taking the upgraded original main node as a target standby main node of the target main node.
Wherein, the executing the checking task obtains the checking result, including:
adding the inspection task to a task manager to monitor execution progress of the inspection task through the task manager;
and executing the inspection task to obtain the inspection result of each inspection item, and displaying the inspection result of each inspection item through an interface.
When each node environment is checked, the executing of the checking task to obtain the checking result includes:
dividing nodes to be checked into different node groups;
and sequentially checking each node group, and when checking each node group, checking each node to be checked in the node group in a concurrent mode.
Wherein, the upgrading the management software of each slave node comprises:
and adding the management software upgrading task of each slave node to an online upgrading task manager, and upgrading the management software of each slave node in a concurrent mode through the online upgrading task manager.
When the management software of each slave node is upgraded, the method further comprises the following steps:
and if the management software of the slave node fails to be upgraded, returning the management software version of the slave node which fails to be upgraded to the management software version before being upgraded.
Before the original master node is upgraded, the online upgrading method further comprises the following steps:
and migrating the data information in the database of the original main node to the database of the target main node according to the updated database format.
After the upgraded original master node is used as the target standby master node of the target master node, the method further comprises the following steps:
checking each node in the distributed cluster;
if the checking result is abnormal, generating upgrading abnormal prompt information; the abnormal inspection result comprises at least one of the following conditions: the upgraded management software versions of the nodes are inconsistent, the upgraded management software is abnormal in service and the upgraded management software is abnormal in database.
In order to achieve the above object, the present invention further provides an online upgrade apparatus for distributed cluster management software, including:
the pre-upgrade check module is used for executing a check task to obtain a check result before the management software is upgraded; the inspection task is used for inspecting an upgrade package, a cluster environment and each node environment;
the judging module is used for judging whether a target checking item which cannot pass the checking exists in the checking result;
the restoration module is used for automatically restoring the target check item when the check result has the target check item which cannot pass the check;
the first upgrading module is used for upgrading the management software of each slave node when the checking result does not have a target checking item which cannot pass the checking;
the second upgrading module is used for upgrading the management software of the original standby main node after the management software of each slave node is upgraded, and the upgraded original standby main node is used as a target main node;
the IP drifting module is used for drifting the virtual IP of the management software of the original main node to the target main node;
and the third upgrading module is used for upgrading the management software of the original main node and taking the upgraded original main node as a target standby main node of the target main node.
To achieve the above object, the present invention further provides an electronic device comprising:
a memory for storing a computer program;
and the processor is used for realizing the steps of the online upgrading method of the distributed cluster management software when executing the computer program.
To achieve the above object, the present invention further provides a computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, realizes the steps of the above method for online upgrading of distributed cluster management software.
According to the above scheme, the online upgrade method for distributed cluster management software provided by the embodiment of the invention includes: before upgrading the management software, executing a checking task to obtain a checking result; the inspection task is used for inspecting an upgrade package, a cluster environment and each node environment; judging whether a target check item which fails in the check exists in the check result; if yes, automatically repairing the target check item; if the slave nodes do not exist, the management software of each slave node is upgraded; after the management software of each slave node is upgraded, the management software of the original master node is upgraded, the upgraded original master node is used as a target master node, and the virtual IP of the management software of the original master node is drifted to the target master node; and upgrading the management software of the original main node, and taking the upgraded original main node as a target standby main node of the target main node. Therefore, before the management software of each node is upgraded, the upgrade package, the cluster environment and the environment of each node need to be checked, and the nodes are automatically repaired when the detection fails, so that the problem in the upgrading process can be avoided, the upgrading failure can be avoided, and the upgrading efficiency can be improved; in addition, the upgrading mode of the main node can avoid the shutdown of the management software in the upgrading process, so that the normal operation of the management software is not influenced. The invention also discloses an online upgrading device, equipment and a medium of the distributed cluster management software, and the technical effects can be realized.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is a schematic flow chart of an online upgrade method for distributed cluster management software according to an embodiment of the present invention;
FIG. 2 is a schematic view of a pre-upgrade inspection process disclosed in an embodiment of the present invention;
fig. 3 is a schematic structural diagram of an online upgrade apparatus for distributed cluster management software according to an embodiment of the present invention;
fig. 4 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The embodiment of the invention discloses a method, a device, equipment and a medium for online upgrading distributed cluster management software, which aim to improve the upgrading efficiency on the premise of not influencing the functions of the management software.
Referring to fig. 1, a schematic flow chart of an online upgrade method for distributed cluster management software according to an embodiment of the present invention includes:
s101, before the management software is upgraded, executing a checking task to obtain a checking result; the inspection task is used for inspecting the upgrade package, the cluster environment and the environment of each node;
it should be noted that, in order to avoid the upgrade failure caused by the problem avoidable before the upgrade, in the present solution, before the upgrade, an inspection task is required to be executed, where the inspection task is used to inspect an upgrade package, a cluster environment, and each node environment; the step of checking the upgrade package refers to checking the correctness of the upgrade package, the step of checking the cluster environment refers to checking the service state, the service version, the network state and the like, and the step of checking the node environment refers to checking the hardware version, the software version, the hardware matching degree and the like of the node; such as: calculating an MD5 value of the upgrade package through an MD5(Message-Digest Algorithm 5) information Digest Algorithm, comparing the MD5 value with an original MD5 value of the upgrade package, if the MD5 value is the same as the original MD5 value of the upgrade package, indicating that the upgrade package is correct, verifying that the upgrade package passes, and then checking the next check item to obtain a final check result; the final inspection result includes the inspection results of the respective inspection items.
It is understood that, before upgrading, the user can customize the setting of the check items and the check item threshold through the interface, such as: if the check entry is the remaining memory space and the threshold of the check entry is 80%, the detection is not passed if the remaining memory space is less than 80% during the check of the remaining memory space. This kind of mode through interface setting, the flexibility that multiplicable inspection item set up promotes the richness of inspection content, and convenience of customers sets up and looks over, improves user and uses experience, improves upgrading efficiency.
S102, judging whether a target check item which cannot pass the check exists in the check result; if yes, executing S103; if not, executing S104;
s103, automatically repairing the target inspection item, and continuously executing S102;
specifically, when the inspection task is executed to obtain the inspection result, the inspection task can be added to the task manager, and the execution progress of the inspection task is monitored through the task manager; and executing the inspection task to obtain the inspection result of each inspection item, and displaying the inspection result of each inspection item through an interface. That is to say: after the inspection task is added into the task manager, the execution progress of the inspection task can be monitored by the task manager, wherein: the monitoring of the execution progress of the inspection task comprises the following aspects: abnormal condition processing, breakpoint execution, automatic task pull-up and the like; the abnormal condition processing means that automatic repair is carried out according to the recommendation of a repair system aiming at the inspection items which are not passed by the inspection, such as: if the check item of the residual memory space does not pass, the residual memory space can be improved in a memory cleaning mode, so that the aim of automatic repair is fulfilled; the breakpoint execution and task automatic pull-up are as follows: when the cluster is powered off or the background task is interrupted due to other conditions, the unexecuted checking task can be automatically pulled up to continue to be executed after the cluster is recovered. The task manager can also display the inspection progress and the inspection result through an interface, such as: and outputting and displaying the upgrade package inspection result, the cluster environment inspection result and each node inspection result, displaying the inspection failed items in red font, and repairing the inspection failed items through a system or manually.
In this embodiment, for an ultra-large scale cluster, a packet inspection method may be adopted to improve inspection efficiency. Specifically, when each node environment is inspected, the node to be inspected can be divided into different node groups, each node group is inspected in sequence, and when each node group is inspected, each node to be inspected in the node group is inspected in a concurrent mode. Namely: the node checking flow in the scheme is as follows: the method has the advantages that the groups are executed in series, and the groups are concurrent, so that the inspection efficiency is improved, and meanwhile, the consumption of cluster resources is reduced. Referring to fig. 2, it can be seen from fig. 2 that in the present disclosure, during inspection, first, an inspection item and an inspection item threshold are set, then an inspection task is added to a task manager, the task manager executes upgrade package inspection, cluster environment inspection, and node environment inspection, when executing node environment inspection, first, nodes need to be grouped, then node environment inspection instructions are issued according to groups, and if all node groups execute the inspection task, an inspection result is recorded, and inspection before upgrade is completed.
It should be noted that, in the present solution, only if all the check items in the check result before the upgrade pass the check, the upgrade of the management software is allowed, and if there is a case that the target check item fails to pass the check, at this time, even if the upgrade is directly performed, the upgrade may fail due to an abnormal condition, so that when there is a target check item that fails to pass the check in the check result, the upgrade is not allowed. In addition, according to the scheme, after the examination is finished before the upgrade, a user can download the examination report, and can select to generate word and html reports during the downloading, so that the information is conveniently filed.
S104, upgrading the management software of each slave node;
it should be noted that, in the process of online upgrading management software, the management software upgrade tasks of the slave nodes need to be added to the online upgrade task manager, and the management software of the slave nodes is upgraded in a concurrent manner through the online upgrade task manager. Namely: according to the scheme, firstly, a slave node list is required to be obtained, a slave node upgrading task is added into an online upgrading task manager, and the online upgrading task manager sends the upgrading task to each slave node and receives an upgrading result. According to the scheme, when the slave node is upgraded through the online upgrade task manager, each upgrade task can be managed and monitored. The online upgrade task manager can monitor the use condition of the cluster resources in real time, and can ensure that the node upgrade tasks are executed maximally and parallelly under the condition of not excessively consuming the cluster resources when controlling all the slave nodes to execute the upgrade tasks concurrently; the online upgrade task manager can also implement a breakpoint execution function, that is: when the cluster is powered off or the background task is interrupted due to other conditions, the upgrading task which is not completed by execution can be automatically pulled up to be continuously executed after the cluster is recovered.
The online upgrade task manager may also implement a version rollback function, that is: and if the upgrade failure of the management software of the slave node is detected, returning the management software version of the slave node with the upgrade failure to the management software version before the upgrade, and reporting the upgrade failure information. When the management software is upgraded, each slave node can perform exception handling through the exception handling module if the execution exception condition is met, the upgrade is continuously performed if the processing is successful, the node upgrade is quitted if the processing is failed, and the exception information is reported. Such as: in the upgrading process, the upgrading is failed due to the occurrence of network abnormality, automatic repair can be performed at the moment, if the repair is successful, the upgrading is continued, and if the upgrading is failed, the abnormal information is reported.
S105, after the management software of each slave node is upgraded, the management software of the original master node is upgraded, the upgraded original master node is used as a target master node, and the virtual IP of the management software of the original master node is drifted to the target master node;
and S106, upgrading the management software of the original main node, and taking the upgraded original main node as a target standby main node of the target main node.
In this embodiment, after the management software of each slave node is upgraded, the master node needs to be upgraded, where the master node in this scheme includes an original master node and an original standby master node, the original master node is a virtual IP node, that is, a node where the virtual IP of the management software is located, and the original standby master node is a master node without a virtual IP mounted thereon. In order to ensure the normal operation of management software, the original standby main node can be upgraded to obtain a target main node, the virtual IP of the management software of the original main node is drifted to the target main node which is upgraded successfully, then the original main node is upgraded, and the original main node which is upgraded successfully is used as the target standby main node of the target main node. The process is specifically described here, and includes two parts, namely, backup master node upgrade and master node upgrade, where the backup master node upgrade process is as follows: removing the standby main nodes to be upgraded from the main node queue, adding the standby main nodes to the newly-built main node queue, then executing the upgrade of the standby main nodes, wherein the upgrade of all the standby main nodes is serial upgrade until all the standby main nodes are upgraded; the upgrading process of the main node comprises the following steps: and drifting the virtual IP node to the standby main node which is upgraded, wherein the standby main node becomes the node where the virtual IP of the new management software is located, then executing the upgrade of the original main node, and adding the original main node into a standby node queue after the upgrade of the original main node is finished.
It should be noted that, in the present solution, before the original master node is upgraded, the data information in the database of the original master node needs to be migrated to the database of the target master node according to the upgraded database format. Therefore, according to the database data migration mode, before the node where the virtual IP is located is upgraded, the database information on the node where the virtual IP is located can be migrated to the database of the new node where the virtual IP is located according to the upgraded database format, and the upgraded target host node can directly access the data in the database.
Furthermore, after the upgrade is finished, each node in the distributed cluster needs to be checked; if the checking result is abnormal, generating upgrading abnormal prompt information; the abnormal detection result includes at least one of the following conditions: the upgraded management software versions of the nodes are inconsistent, the upgraded management software is abnormal in service, the database is abnormal and the like. And then generating a final upgrade report, wherein the upgrade report can be an html upgrade report, the detailed upgrade process and the inspection result are recorded in the report, and the report has portability and is convenient for downloading and archiving.
In conclusion, according to the scheme, a user can automatically set the inspection items and the inspection item threshold value through the interface before upgrading, upgrading inspection contents are rich, and upgrading efficiency is improved; aiming at the super-large scale cluster, a grouping inspection mode can be adopted, and high concurrent execution capacity is provided while the occupation of cluster resources is reduced. In the checking process, the automatic repairing function is also provided, and the repairing processing is carried out aiming at the abnormal executable of foreseeable or known reasons, so as to submit the checking efficiency; the scheme can also realize the online upgrade of the management software in a virtual IP drift mode, and realize the self upgrade of the management software under the condition of ensuring the page function. The online upgrade task manager can also realize functions of upgrade scheduling, breakpoint execution, version rollback and the like under a large-scale cluster, and can efficiently execute a managed software upgrade task under the condition of ensuring that the original cluster service is not influenced.
The upgrade apparatus, device, and medium provided in the embodiments of the present invention are introduced below, and the upgrade apparatus, device, and medium described below and the upgrade method described above may be referred to each other.
Referring to fig. 3, a schematic structural diagram of an online upgrade apparatus for distributed cluster management software provided in an embodiment of the present invention includes:
the pre-upgrade check module 11 is configured to execute a check task to obtain a check result before the management software is upgraded; the inspection task is used for inspecting an upgrade package, a cluster environment and each node environment;
a judging module 12, configured to judge whether a target inspection item failing to be inspected exists in the inspection result;
a repairing module 13, configured to automatically repair a target inspection item when the inspection result has the target inspection item that fails to be inspected;
the first upgrading module 14 is configured to upgrade the management software of each slave node when the checking result does not have a target checking item for which checking fails;
the second upgrading module 15 is configured to upgrade the management software of the original standby master node after the management software of each slave node is upgraded, and use the upgraded original standby master node as a target master node;
the IP drifting module 16 is used for drifting the virtual IP of the management software of the original main node to the target main node;
and a third upgrading module 17, configured to upgrade the management software of the original master node, and use the upgraded original master node as a target standby master node of the target master node.
Wherein the pre-upgrade check module comprises:
the monitoring unit is used for adding the inspection task to a task manager so as to monitor the execution progress of the inspection task through the task manager;
the execution unit is used for executing the inspection task to obtain the inspection result of each inspection item;
and the display unit is used for displaying the inspection result of each inspection item through an interface.
Wherein the pre-upgrade check module comprises:
the dividing unit is used for dividing the nodes to be checked into different node groups;
and the checking unit is used for checking each node group in sequence, and when checking each node group, all nodes to be checked in the node group are checked in a concurrent mode.
The first upgrading module is specifically configured to: and adding the management software upgrading task of each slave node to an online upgrading task manager, and upgrading the management software of each slave node in a concurrent mode through the online upgrading task manager.
Wherein the first upgrade module is further configured to: and if the management software of the slave node fails to be upgraded, returning the management software version of the slave node which fails to be upgraded to the management software version before being upgraded.
Wherein, the device still includes:
and the data migration module is used for migrating the data information in the database of the original main node to the database of the target main node according to the updated database format before the original main node is updated.
Wherein, the device still includes:
the upgraded inspection module is used for inspecting each node in the distributed cluster after the upgraded original main node is used as a target standby main node of the target main node;
the information generation module is used for generating upgrading abnormity prompt information when the inspection result is abnormal; the abnormal inspection result comprises at least one of the following conditions: the upgraded management software versions of the nodes are inconsistent, the upgraded management software is abnormal in service and the upgraded management software is abnormal in database.
Referring to fig. 4, an electronic device according to an embodiment of the present invention includes:
a memory 21 for storing a computer program;
the processor 22 is configured to implement the steps of the online upgrade method for distributed cluster management software according to the foregoing method embodiments when executing the computer program.
In this embodiment, the device may be a PC (Personal Computer), or may be a terminal device such as a smart phone, a tablet Computer, a palmtop Computer, or a portable Computer.
The device may include a memory 21, a processor 22, and a bus 23.
The memory 21 includes at least one type of readable storage medium, which includes a flash memory, a hard disk, a multimedia card, a card type memory (e.g., SD or DX memory, etc.), a magnetic memory, a magnetic disk, an optical disk, and the like. The memory 21 may in some embodiments be an internal storage unit of the device, for example a hard disk of the device. The memory 21 may also be an external storage device of the device in other embodiments, such as a plug-in hard disk, Smart Media Card (SMC), Secure Digital (SD) Card, Flash memory Card (Flash Card), etc. provided on the device. Further, the memory 21 may also include both an internal storage unit of the device and an external storage device. The memory 21 may be used not only to store application software installed in the device and various types of data such as program codes for performing an upgrade method, etc., but also to temporarily store data that has been output or is to be output.
The bus 23 may be a Peripheral Component Interconnect (PCI) bus, an Extended Industry Standard Architecture (EISA) bus, or the like. The bus may be divided into an address bus, a data bus, a control bus, etc. For ease of illustration, only one thick line is shown in FIG. 4, but this does not indicate only one bus or one type of bus.
Further, the device may further include a network interface 24, and the network interface 24 may optionally include a wired interface and/or a wireless interface (e.g., WI-FI interface, bluetooth interface, etc.), which are generally used to establish a communication connection between the device and other electronic devices.
Optionally, the device may further comprise a user interface 25, the user interface 25 may comprise a Display (Display), an input unit such as a Keyboard (Keyboard), and the optional user interface 25 may also comprise a standard wired interface, a wireless interface. Alternatively, in some embodiments, the display may be an LED display, a liquid crystal display, a touch-sensitive liquid crystal display, an OLED (Organic Light-Emitting Diode) touch device, or the like. The display, which may also be referred to as a display screen or display unit, is suitable for displaying information processed in the device and for displaying a visualized user interface.
Fig. 4 shows only the device with the components 21-25, and it will be understood by those skilled in the art that the structure shown in fig. 4 does not constitute a limitation of the device, and may comprise fewer or more components than those shown, or some components may be combined, or a different arrangement of components.
The embodiment of the present invention further provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the steps of the online upgrade method for distributed cluster management software described in the above method embodiment are implemented.
Wherein the storage medium may include: various media capable of storing program codes, such as a usb disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk.
In summary, in the scheme, an online upgrade scheme for large-scale distributed cluster management software is provided, which aims to quickly and simply complete the version upgrade of the cluster management software on the premise of not influencing the management software and the cluster function; specifically, before the management software is upgraded, the upgrade package, the cluster environment and each node environment can be checked, the check result is output, the check item which cannot pass the check is repaired, and the upgrade failure caused by avoidable problems in the upgrade process is avoided; when the management software is upgraded, the management software can be upgraded without interrupting the service of the management software, and after the management software is upgraded, the upgraded cluster environment can be checked to judge whether the cluster nodes are upgraded successfully or not and output abnormal cluster contents.
The embodiments in the present description are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other.
The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.
Claims (10)
1. An online upgrade method for distributed cluster management software is characterized by comprising the following steps:
before upgrading the management software, executing a checking task to obtain a checking result; the inspection task is used for inspecting an upgrade package, a cluster environment and each node environment;
judging whether a target check item which does not pass the check exists in the check result;
if yes, automatically repairing the target check item; if the slave nodes do not exist, the management software of each slave node is upgraded;
after the management software of each slave node is upgraded, the management software of the original master node is upgraded, the upgraded original master node is used as a target master node, and the virtual IP of the management software of the original master node is drifted to the target master node; and upgrading the management software of the original main node, and taking the upgraded original main node as a target standby main node of the target main node.
2. The online upgrade method according to claim 1, wherein the performing of the inspection task obtains an inspection result, including:
adding the inspection task to a task manager to monitor execution progress of the inspection task through the task manager;
and executing the inspection task to obtain the inspection result of each inspection item, and displaying the inspection result of each inspection item through an interface.
3. The online upgrade method according to claim 1, wherein, when checking the node environments, the performing the check task to obtain the check result includes:
dividing nodes to be checked into different node groups;
and sequentially checking each node group, and when checking each node group, checking each node to be checked in the node group in a concurrent mode.
4. The online upgrading method according to claim 1, wherein the upgrading the management software of each slave node includes:
and adding the management software upgrading task of each slave node to an online upgrading task manager, and upgrading the management software of each slave node in a concurrent mode through the online upgrading task manager.
5. The online upgrade method according to claim 4, wherein, when upgrading the management software of each slave node, the method further comprises:
and if the management software of the slave node fails to be upgraded, returning the management software version of the slave node which fails to be upgraded to the management software version before being upgraded.
6. The online upgrade method according to claim 1, wherein before the upgrade of the original master node, the online upgrade method further comprises:
and migrating the data information in the database of the original main node to the database of the target main node according to the updated database format.
7. The online upgrade method according to any one of claims 1 to 6, wherein after the upgraded original master node is used as a target standby master node of the target master node, the method further includes:
checking each node in the distributed cluster;
if the checking result is abnormal, generating upgrading abnormal prompt information; the abnormal inspection result comprises at least one of the following conditions: the upgraded management software versions of the nodes are inconsistent, the upgraded management software is abnormal in service and the upgraded management software is abnormal in database.
8. An online upgrade device for distributed cluster management software, comprising:
the pre-upgrade check module is used for executing a check task to obtain a check result before the management software is upgraded; the inspection task is used for inspecting an upgrade package, a cluster environment and each node environment;
the judging module is used for judging whether a target checking item which cannot pass the checking exists in the checking result;
the restoration module is used for automatically restoring the target check item when the check result has the target check item which cannot pass the check;
the first upgrading module is used for upgrading the management software of each slave node when the checking result does not have a target checking item which cannot pass the checking;
the second upgrading module is used for upgrading the management software of the original standby main node after the management software of each slave node is upgraded, and the upgraded original standby main node is used as a target main node;
the IP drifting module is used for drifting the virtual IP of the management software of the original main node to the target main node;
and the third upgrading module is used for upgrading the management software of the original main node and taking the upgraded original main node as a target standby main node of the target main node.
9. An electronic device, comprising:
a memory for storing a computer program;
a processor for implementing the steps of the method for online upgrade of distributed cluster management software according to any of claims 1 to 7 when executing the computer program.
10. A computer-readable storage medium, characterized in that a computer program is stored thereon, which computer program, when being executed by a processor, carries out the steps of the method for online upgrade of distributed cluster management software according to any of the claims 1 to 7.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111138029.2A CN113805925A (en) | 2021-09-27 | 2021-09-27 | Online upgrading method, device, equipment and medium for distributed cluster management software |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111138029.2A CN113805925A (en) | 2021-09-27 | 2021-09-27 | Online upgrading method, device, equipment and medium for distributed cluster management software |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113805925A true CN113805925A (en) | 2021-12-17 |
Family
ID=78938745
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111138029.2A Pending CN113805925A (en) | 2021-09-27 | 2021-09-27 | Online upgrading method, device, equipment and medium for distributed cluster management software |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113805925A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114500525A (en) * | 2021-12-24 | 2022-05-13 | 天翼云科技有限公司 | Method, device, computer equipment and medium for updating nodes in distributed system |
CN114640586A (en) * | 2022-05-19 | 2022-06-17 | 飞狐信息技术(天津)有限公司 | Cluster version upgrading method and device, storage medium and equipment |
CN117422418A (en) * | 2023-11-07 | 2024-01-19 | 深圳市鹏中科技有限公司 | Method, device, equipment and storage medium for developing operation link visualization |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107193611A (en) * | 2017-05-31 | 2017-09-22 | 郑州云海信息技术有限公司 | The method for upgrading software and system of a kind of group system |
CN108667641A (en) * | 2017-03-30 | 2018-10-16 | 杭州海康威视数字技术股份有限公司 | Distributed system upgrade method, distributed system and updating and management equipment |
CN108829420A (en) * | 2018-06-12 | 2018-11-16 | 郑州云海信息技术有限公司 | A kind of distributed memory system method for upgrading software based on RPM packet |
CN109101370A (en) * | 2018-08-30 | 2018-12-28 | 重庆富民银行股份有限公司 | The date storage method of principal and subordinate's bi-directional synchronization |
CN110879718A (en) * | 2019-11-15 | 2020-03-13 | 北京浪潮数据技术有限公司 | maridb upgrading method and device, electronic equipment and storage medium |
-
2021
- 2021-09-27 CN CN202111138029.2A patent/CN113805925A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108667641A (en) * | 2017-03-30 | 2018-10-16 | 杭州海康威视数字技术股份有限公司 | Distributed system upgrade method, distributed system and updating and management equipment |
CN107193611A (en) * | 2017-05-31 | 2017-09-22 | 郑州云海信息技术有限公司 | The method for upgrading software and system of a kind of group system |
CN108829420A (en) * | 2018-06-12 | 2018-11-16 | 郑州云海信息技术有限公司 | A kind of distributed memory system method for upgrading software based on RPM packet |
CN109101370A (en) * | 2018-08-30 | 2018-12-28 | 重庆富民银行股份有限公司 | The date storage method of principal and subordinate's bi-directional synchronization |
CN110879718A (en) * | 2019-11-15 | 2020-03-13 | 北京浪潮数据技术有限公司 | maridb upgrading method and device, electronic equipment and storage medium |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114500525A (en) * | 2021-12-24 | 2022-05-13 | 天翼云科技有限公司 | Method, device, computer equipment and medium for updating nodes in distributed system |
CN114500525B (en) * | 2021-12-24 | 2024-04-26 | 天翼云科技有限公司 | Method, device, computer equipment and medium for updating nodes in distributed system |
CN114640586A (en) * | 2022-05-19 | 2022-06-17 | 飞狐信息技术(天津)有限公司 | Cluster version upgrading method and device, storage medium and equipment |
CN117422418A (en) * | 2023-11-07 | 2024-01-19 | 深圳市鹏中科技有限公司 | Method, device, equipment and storage medium for developing operation link visualization |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR102493449B1 (en) | Edge computing test methods, devices, electronic devices and computer-readable media | |
CN113805925A (en) | Online upgrading method, device, equipment and medium for distributed cluster management software | |
US20120159443A1 (en) | System and method for reducing test effort by object risk analysis | |
US10042744B2 (en) | Adopting an existing automation script to a new framework | |
US10795793B1 (en) | Method and system for simulating system failures using domain-specific language constructs | |
CN111338656A (en) | Method and device for installing software package to target host and computer equipment | |
CN110851471A (en) | Distributed log data processing method, device and system | |
CN112256593B (en) | Program processing method and device, computer equipment and readable storage medium | |
CN111417019A (en) | Method and device for processing plug-in abnormity, computer equipment and storage medium | |
CN111694684A (en) | Abnormal construction method and device of storage equipment, electronic equipment and storage medium | |
CN114116330A (en) | Server performance test method, system, terminal and storage medium | |
CN109002317B (en) | PCBA firmware upgrading method and system and PCBA | |
CN115757099A (en) | Automatic test method and device for platform firmware protection recovery function | |
CN106528248B (en) | Downloading method and system of mobile terminal based on software integrity | |
JP2023044720A (en) | Computer implemented method for recovering crashed application, computer program product, and remote computer server (remote recovery of crashed process) | |
CN115729679A (en) | Task processing method and device, computer readable storage medium and electronic device | |
CN114124835A (en) | Interface-based data transmission method, device, equipment and medium | |
CN113505079A (en) | Application program testing method, device, equipment and medium | |
JP2012181737A (en) | Computer system | |
CN112379973A (en) | Heavy loading method and device | |
CN114327588A (en) | Method and device for processing code submission log | |
Lim et al. | TAOS-CI: lightweight & modular continuous integration system for edge computing | |
CN117971564B (en) | Data recovery method, device, computer equipment and storage medium | |
CN110908725A (en) | Application program starting method and device, electronic equipment and readable medium | |
US20240330477A1 (en) | System and method for evaluating active backups using penetration testing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |