[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN106301853A - The fault detection method of group system interior joint and device - Google Patents

The fault detection method of group system interior joint and device Download PDF

Info

Publication number
CN106301853A
CN106301853A CN201510306800.0A CN201510306800A CN106301853A CN 106301853 A CN106301853 A CN 106301853A CN 201510306800 A CN201510306800 A CN 201510306800A CN 106301853 A CN106301853 A CN 106301853A
Authority
CN
China
Prior art keywords
node
nodal point
neighbor
neighbor nodes
heartbeat message
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510306800.0A
Other languages
Chinese (zh)
Other versions
CN106301853B (en
Inventor
胡琳
伍湘平
彭佩星
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CN201510306800.0A priority Critical patent/CN106301853B/en
Priority to PCT/CN2016/073606 priority patent/WO2016192408A1/en
Publication of CN106301853A publication Critical patent/CN106301853A/en
Application granted granted Critical
Publication of CN106301853B publication Critical patent/CN106301853B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/10Active monitoring, e.g. heartbeat, ping or trace-route

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Cardiology (AREA)
  • General Health & Medical Sciences (AREA)
  • Hardware Redundancy (AREA)
  • Measuring And Recording Apparatus For Diagnosis (AREA)

Abstract

The embodiment of the present invention provides fault detection method and the device of a kind of group system interior joint, the method includes: primary nodal point judges whether receive the first heartbeat message that secondary nodal point sends in Preset Time, primary nodal point is the neighbor node of secondary nodal point, and the first heartbeat message is the heartbeat message that secondary nodal point sends to each neighbor node of secondary nodal point concurrently;In the case of primary nodal point does not receives the heartbeat message that secondary nodal point sends, in all neighbor nodes of secondary nodal point, other neighbor nodes in addition to primary nodal point send request message;Primary nodal point receives the response message carrying reception state that other neighbor nodes send;In the case of according to reception state, primary nodal point determines that other neighbor nodes all do not receive heartbeat message, primary nodal point determines that secondary nodal point breaks down.The fault detection method of the group system interior joint that the embodiment of the present invention provides and device can improve the efficiency of node failure detection.

Description

The fault detection method of group system interior joint and device
Technical field
The present embodiments relate to communication technology, particularly relate to the fault detect of a kind of group system interior joint Method and apparatus.
Background technology
In distributed cluster system, generally include a Centroid and multiple ordinary node, work as center After node or ordinary node break down, the reliability of distributed cluster system will be caused the biggest shadow Ring, therefore, the most effectively carry out the fault detect of node, be very important.
Fig. 1 is the schematic diagram of the fault detection method of prior art interior joint, as it is shown in figure 1, commonly save Point (B, C, D, E) sends heartbeat message, Centroid (M) according to heart beat cycle to Centroid (M) According to the situation of the successive heartbeat message received in the detection cycle, detect ordinary node whether fault, its In, a detection cycle can comprise multiple heart beat cycle.Meanwhile, Centroid (M) can also be periodically To ordinary node (B, C, D, E) send heartbeat message, to notify that ordinary node Centroid is served as Role and whether be in normal condition, once ordinary node (B, C, D, E) within the detection cycle not Receive the heartbeat message that Centroid (M) sends, then can judge that Centroid (M) breaks down, this Time, ordinary node can initiate to re-elect the operation of Centroid, if electing successfully, ordinary node will sense Knowing new Centroid, and heartbeat message is sent to new Centroid, cluster carries out fault detect again.
But, in the prior art, by judging whether to receive the side of heartbeat message within the detection cycle Formula detects node when whether breaking down, owing to, in the case of cluster scale is fixing, sending heart beating report The heart beat cycle of literary composition cannot change, and therefore detects the time in cycle and also cannot change so that node failure is examined Survey needs just can be detected by multiple heart beat cycles, and the cycle causing node failure to detect is longer, leads Cause the inefficient of node failure detection.
Summary of the invention
The embodiment of the present invention provides fault detection method and the device of a kind of group system interior joint, is used for solving The node failure detection that certainly prior art also exists needs just can be detected by multiple heart beat cycles, makes Become the problem that the cycle of node failure detection is longer, thus improve the efficiency of node failure detection.
First aspect, the embodiment of the present invention provides the fault detection method of a kind of group system interior joint, bag Include:
Primary nodal point judges whether receive the first heartbeat message that secondary nodal point sends in Preset Time; Described primary nodal point is the neighbor node of described secondary nodal point, and described first heartbeat message is described second section The heartbeat message that point sends to each neighbor node of described secondary nodal point concurrently, described second section The number of all neighbor nodes of point is two or more;Described Preset Time is more than or equal to a heart beating week Phase, and less than two heart beat cycles;
In the case of described primary nodal point does not receives the first heartbeat message that described secondary nodal point sends, Described primary nodal point other in addition to described primary nodal point in all neighbor nodes of described secondary nodal point Neighbor node sends request message, and described request message is used for inquiring whether other neighbor nodes described receive To described first heartbeat message;
Described primary nodal point receives the response carrying reception state of described other neighbor nodes transmission and disappears Breath, described reception state is used for indicating whether to receive described first heartbeat message;
The described response sent according to other neighbor nodes each described received at described primary nodal point disappears The reception state carried in breath, determines that other neighbor nodes described all do not receive described first heart beating report In the case of literary composition, described primary nodal point determines that described secondary nodal point breaks down.
In conjunction with first aspect, in the first possible implementation of first aspect, described primary nodal point After determining that described secondary nodal point breaks down, also include:
Described primary nodal point generates the first vote information, and receives what each other neighbor nodes described sent Second vote information, described first vote information includes the node that node that described primary nodal point is elected is corresponding Mark;Described second vote information includes the node sending the neighbor node election of described second vote information Corresponding node identification;
Described primary nodal point is according to the node identification in described first vote information and each other neighbours described Node identification in the second vote information that node sends, each node in all nodes that statistics is elected The ballot quantity obtained, and using nodes most for quantity of voting as the 3rd node;Described 3rd node is Substitute described secondary nodal point and send heartbeat message to all neighbor nodes of described 3rd node concurrently Node;All neighbor nodes of described 3rd node include described 3rd node self neighbor node and The neighbor node of described secondary nodal point.
In conjunction with the first possible implementation of first aspect or first aspect, in the second of first aspect Plant in possible implementation, also include:
The described response sent according to other neighbor nodes each described received at described primary nodal point disappears The reception state carried in breath, determines that at least one other neighbor node described receives described first heart In the case of jumping message, described primary nodal point determines node and the institute not receiving described first heartbeat message The link stated between secondary nodal point breaks down;The described node bag not receiving described first heartbeat message Include the node not receiving described first heartbeat message in described primary nodal point and other neighbor nodes described.
In conjunction with first aspect, first aspect the first to any one possible reality of the second of first aspect Existing mode, in the third possible implementation of first aspect, also includes:
Described primary nodal point according in the neighbor node of described 3rd node and other neighbor nodes described except institute State the node outside the 3rd node, redefine the neighbor node of described primary nodal point.
Second aspect, the embodiment of the present invention provides the fault detection method of a kind of group system interior joint, institute The method of stating includes:
Secondary nodal point sends the first heartbeat message to primary nodal point and other neighbor nodes concurrently;Described One node is the neighbor node of described secondary nodal point, and other neighbor nodes described are the institute of described secondary nodal point Having the node in addition to described primary nodal point in neighbor node, the number of other neighbor nodes described is one Above;
Described primary nodal point judges whether receive described first heartbeat message in Preset Time;Described pre- If the time is more than or equal to a heart beat cycle, and less than two heart beat cycles;
In the case of described primary nodal point does not receives described first heartbeat message, described primary nodal point to Each other neighbor nodes described send request message respectively, and described request message is used for inquiring each described Whether other neighbor nodes receive described first heartbeat message;
Described primary nodal point receives the response carrying reception state that each other neighbor nodes described send Message, described reception state is used for indicating whether to receive described first heartbeat message;
At described primary nodal point according to the reception state carried in the described response message received, determine In the case of other neighbor nodes described all do not receive described first heartbeat message, described primary nodal point is true Fixed described secondary nodal point breaks down.
In conjunction with second aspect, in the first possible implementation of second aspect, described primary nodal point After determining that described secondary nodal point breaks down, also include:
Described primary nodal point generates the first vote information, and receives what each other neighbor nodes described sent Second vote information, described first vote information includes the node that node that described primary nodal point is elected is corresponding Mark;Described second vote information includes the node sending the neighbor node election of described second vote information Corresponding node identification;
Described primary nodal point is according to the node identification in described first vote information and each other neighbours described Node identification in the second vote information that node sends, each node in all nodes that statistics is elected The ballot quantity obtained, and using nodes most for quantity of voting as the 3rd node;Described 3rd node is Substitute described secondary nodal point and send heartbeat message to all neighbor nodes of described 3rd node concurrently Node;All neighbor nodes of described 3rd node include described 3rd node self neighbor node and The neighbor node of described secondary nodal point.
In conjunction with the first possible implementation of second aspect or second aspect, in the second of second aspect Plant in possible implementation, also include:
The described response sent according to other neighbor nodes each described received at described primary nodal point disappears The reception state carried in breath, determines that at least one other neighbor node described receives described first heart Jump in the case of message, the most described primary nodal point determine the node not receiving described first heartbeat message with Link between described secondary nodal point breaks down;The described node not receiving described first heartbeat message Including the node not receiving the first heartbeat message in described primary nodal point and other neighbor nodes described.
In conjunction with second aspect, second aspect the first to any one possible reality of the second of second aspect Existing mode, in the third possible implementation of second aspect, also includes:
Described primary nodal point according in the neighbor node of described 3rd node and other neighbor nodes described except institute State the node outside the 3rd node, redefine the neighbor node of described primary nodal point.
The third aspect, the embodiment of the present invention provides the failure detector of a kind of group system interior joint, bag Include:
Judge module, for judging whether receive the first heart beating that secondary nodal point sends in Preset Time Message;Described primary nodal point is the neighbor node of described secondary nodal point, and described first heartbeat message is described The heartbeat message that secondary nodal point sends to each neighbor node of described secondary nodal point concurrently, described The number of all neighbor nodes of secondary nodal point is two or more;Described Preset Time is more than or equal to one Heart beat cycle, and less than two heart beat cycles;
Judge that receiver module does not receives the first heart beating that described secondary nodal point sends at described judge module In the case of message,
Sending module, in all neighbor nodes of described secondary nodal point in addition to described primary nodal point Other neighbor nodes send request message, described request message is used for inquiring that other neighbor nodes described are No receive described first heartbeat message;
Described receiver module, is additionally operable to the reception state that carries of other neighbor nodes transmission described in receiving Response message, described reception state is used for indicating whether to receive described first heartbeat message;
Determine module, send for other neighbor nodes each described received according to described receiver module Described response message in the reception state carried, it is determined whether other neighbor nodes described all do not receive Described first heartbeat message;
Determine that module determines that other neighbor nodes described all do not receive described first heartbeat message described In the case of, described determine module, be additionally operable to determine that described secondary nodal point breaks down.
In conjunction with the third aspect, in the first possible implementation of the third aspect, determine mould described After block determines that described secondary nodal point breaks down, also include:
Generation module, is additionally operable to generate the first vote information, and described first vote information includes described first The node identification corresponding to node of node election;
Described receiver module, is additionally operable to receive the second vote information that each other neighbor nodes described send, Described second vote information includes that the node sending the neighbor node election of described second vote information is corresponding Node identification;
Described determine module, be additionally operable to according to the node identification in described first vote information and each described Node identification in the second vote information that other neighbor nodes send, in all nodes that statistics is elected The ballot quantity that each node obtains, and using nodes most for quantity of voting as the 3rd node;Described Three nodes are to substitute described secondary nodal point and send to all neighbor nodes of described 3rd node concurrently The node of heartbeat message;All neighbor nodes of described 3rd node include the neighbour of described 3rd node self Occupy node and the neighbor node of described secondary nodal point.
In conjunction with the first possible implementation of the third aspect or the third aspect, in the second of the third aspect Plant in possible implementation,
Send at described other neighbor nodes each described determining that module receives according to described receiver module Described response message in the reception state carried, determine that at least one other neighbor node described receives In the case of described first heartbeat message,
Described determine that module is additionally operable to determine the node and described second not receiving described first heartbeat message Link between node breaks down;The described node not receiving described first heartbeat message includes described Primary nodal point and other neighbor nodes described do not receive the node of described first heartbeat message.
In conjunction with the third aspect, the third aspect the first to any one possible reality of the second of the third aspect Existing mode, in the third possible implementation of the third aspect,
Described determine that module is additionally operable to the neighbor node according to described 3rd node and other neighbor nodes described In node in addition to described 3rd node, redefine the neighbor node of described primary nodal point.
Fourth aspect, the embodiment of the present invention provides the fault detection system of a kind of group system interior joint, bag Including primary nodal point, secondary nodal point and other neighbor nodes, described primary nodal point is the neighbour of described secondary nodal point Occupy node, other neighbor nodes described be described secondary nodal point all neighbor nodes in except described first segment Node outside Dian, the number of other neighbor nodes described is more than one, including:
Described secondary nodal point, for sending the to described primary nodal point and other neighbor nodes described concurrently One heartbeat message;
Described primary nodal point, for judging whether receive described first heartbeat message in Preset Time; Described Preset Time is more than or equal to a heart beat cycle, and less than two heart beat cycles;
In the case of described primary nodal point does not receives described first heartbeat message, described primary nodal point is also For sending request message respectively to each other neighbor nodes described, described request message is used for inquiring often Whether other neighbor nodes described in receive described first heartbeat message;And, described primary nodal point is also For receiving the response message carrying reception state that each other neighbor nodes described send, described in connect Receipts state is used for indicating whether to receive described first heartbeat message;
The described response sent according to other neighbor nodes each described received at described primary nodal point disappears The reception state carried in breath, determines that other neighbor nodes described all do not receive described first heart beating report In the case of literary composition, described primary nodal point is additionally operable to determine that described secondary nodal point breaks down.
In conjunction with fourth aspect, in the first possible implementation of fourth aspect, described primary nodal point After determining that described secondary nodal point breaks down, also include:
Described primary nodal point is additionally operable to:
Generate the first vote information, and receive the second vote information that each other neighbor nodes described send, Described first vote information includes the node identification that node that described primary nodal point is elected is corresponding, described second The node mark that the node of the neighbor node election that vote information includes sending described second vote information is corresponding Know;
And, send out according to the node identification in described first vote information and each other neighbor nodes described Node identification in the second vote information sent, in all nodes that statistics is elected, each node obtains Ballot quantity, and using nodes most for quantity of voting as the 3rd node;Described 3rd node is for substituting institute State secondary nodal point and send the node of heartbeat message concurrently to all neighbor nodes of described 3rd node; All neighbor nodes of described 3rd node include the neighbor node and described second of described 3rd node self The neighbor node of node.
In conjunction with the first possible implementation of fourth aspect or fourth aspect, in the second of fourth aspect Plant in possible implementation,
The described response sent according to other neighbor nodes each described received at described primary nodal point disappears The reception state carried in breath, determines that at least one other neighbor node described receives described first heart In the case of jumping message,
Described primary nodal point is additionally operable to determine the node and described second not receiving described first heartbeat message Link between node breaks down;The described node not receiving described first heartbeat message includes described Primary nodal point and other neighbor nodes described do not receive the neighbor node of described first heartbeat message.
In conjunction with fourth aspect, fourth aspect the first to any one possible reality of the second of fourth aspect Existing mode, in the third possible implementation of fourth aspect,
Described primary nodal point is additionally operable to the neighbor node according to described 3rd node and other neighbor nodes described In node in addition to described 3rd node, redefine the neighbor node of described primary nodal point.
In the fault detection method of the group system interior joint that the embodiment of the present invention provides and device, first segment Point judges whether receive the first heartbeat message that secondary nodal point sends, wherein, first in Preset Time Node is the neighbor node of secondary nodal point, and the first heartbeat message is that secondary nodal point is concurrently to secondary nodal point The heartbeat message that each neighbor node sends, the number of all neighbor nodes of secondary nodal point is two Above;This Preset Time is more than or equal to a heart beat cycle, and less than two heart beat cycles;First segment Point, in the case of self not receiving the first heartbeat message, inquires other neighbor nodes of this secondary nodal point Whether receive the first heartbeat message, and the most all do not receive at other neighbor nodes determining this secondary nodal point In the case of this first heartbeat message, determine that secondary nodal point there occurs fault.Owing to Preset Time is more than Or equal to a heart beat cycle, and less than two heart beat cycles, so using the technical side that the present invention provides When case carries out fault detect, it is to avoid prior art needs just can be detected joint by multiple heart beat cycles The phenomenon whether point breaks down, can shorten the cycle of fault detect, thus improves node failure inspection The efficiency surveyed.
Accompanying drawing explanation
In order to be illustrated more clearly that the embodiment of the present invention or technical scheme of the prior art, below will be to reality Execute the required accompanying drawing used in example or description of the prior art to be briefly described, it should be apparent that under, Accompanying drawing during face describes is some embodiments of the present invention, for those of ordinary skill in the art, On the premise of not paying creative work, it is also possible to obtain other accompanying drawing according to these accompanying drawings.
Fig. 1 is the structural representation of the fault detection method of group system interior joint in prior art;
The flow process of the fault detection method embodiment one of the group system interior joint that Fig. 2 provides for the present invention Schematic diagram;
Fig. 3 is the schematic diagram one of neighbouring relations between group system interior joint;
Fig. 4 is the schematic diagram two of neighbouring relations between group system interior joint;
The flow process of the fault detection method embodiment two of the group system interior joint that Fig. 5 provides for the present invention Schematic diagram;
Fig. 6 A is the signal of neighbouring relations between front nodal point node failure being detected in group system Figure;
Fig. 6 B be group system detects node failure after redefine neighbouring relations between node Schematic diagram;
The flow process of the fault detection method embodiment three of the group system interior joint that Fig. 7 provides for the present invention Schematic diagram;
The flow process of the fault detection method embodiment four of the group system interior joint that Fig. 8 provides for the present invention Schematic diagram;
Fig. 9 is the structural representation of the failure detector embodiment one of group system interior joint of the present invention Figure;
Figure 10 is the structural representation of the fault detection system embodiment one of group system interior joint of the present invention Figure Figure 10 is the structural representation of node embodiment one of the present invention;
Figure 11 is the structural representation of node embodiment one of the present invention.
Detailed description of the invention
For making the purpose of the embodiment of the present invention, technical scheme and advantage clearer, below in conjunction with this Accompanying drawing in bright embodiment, is clearly and completely described the technical scheme in the embodiment of the present invention, Obviously, described embodiment is a part of embodiment of the present invention rather than whole embodiments.Based on Embodiment in the present invention, those of ordinary skill in the art are obtained under not making creative work premise The every other embodiment obtained, broadly falls into the scope of protection of the invention.
The embodiment of the present invention is be applicable to group system, and it it is particularly applicable to distributed cluster system interior joint The scene of fault detect.This distributed cluster system includes at least two node, and this node is the most permissible It it is computer.Optionally, the node in the group system in the present embodiment is with existing group system not It is with part: in the group system of the present embodiment, all of node is all given identical function, i.e. All of node all has identical receiving heartbeat message and send the ability of heartbeat message, therefore, at this In the group system of embodiment, there is not the differentiation of Centroid and ordinary node, it is not required that center Node administration ordinary node.Optionally, the technical scheme of following embodiment is all using computer as performing master Body is introduced.
The flow process of the fault detection method embodiment one of the group system interior joint that Fig. 2 provides for the present invention is shown It is intended to.The method that the present embodiments relate to is applicable to distributed cluster system.The present embodiment is with computer Introduce as a example by executive agent.As in figure 2 it is shown, the method for the present embodiment may include that
Step 201, primary nodal point judge whether receive first that secondary nodal point sends in Preset Time Heartbeat message;Primary nodal point is the neighbor node of secondary nodal point, and the first heartbeat message is that secondary nodal point is parallel The heartbeat message that ground sends to each neighbor node of secondary nodal point, all neighbours joint of secondary nodal point The number of point is two or more;Preset Time is more than or equal to a heart beat cycle, and less than two heart beatings Cycle.
In the present embodiment, secondary nodal point is according to the information of nodes all in group system, according to cluster system The rule preset in system determines primary nodal point, and wherein, primary nodal point is any one neighbour of secondary nodal point Occupying node, the neighbor node of secondary nodal point is the node relevant with secondary nodal point.Fig. 3 is cluster The schematic diagram one of neighbouring relations between system interior joint, as it is shown on figure 3, in group system, node E According to the information of all nodes, can determine that it has four neighbours according to the rule preset in group system Node, is node A, B, C and D respectively.Wherein, primary nodal point can be node A, B, C and Any one in D.Primary nodal point is by judging that whether receiving secondary nodal point in Preset Time sends The first heartbeat message, detect whether secondary nodal point breaks down.Need to illustrate, second Node is by sending heartbeat message, therefore, the first heart beating report to its all neighbor nodes concurrently Literary composition is the heart that secondary nodal point sends to each neighbor node of secondary nodal point in the same moment concurrently Jump message.It addition, secondary nodal point can send to its all neighbor nodes concurrently according to heart beat cycle First heartbeat message, therefore, primary nodal point may determine that more than or equal to a heart beat cycle and little The first heartbeat message that this secondary nodal point sends whether is received within the time of two heart beat cycles.Such as: Assume that heart beat cycle is 5s, i.e. secondary nodal point every 5s, will send to its all neighbor nodes concurrently Heartbeat message, the first heartbeat message sent at 5s for secondary nodal point, primary nodal point will determine that More than or equal to 5s, and within less than the time of 10s, whether receive first heart that secondary nodal point sends Jump message.Wherein, heart beat cycle can rule of thumb or practical situation is configured, for heart beating week The concrete value of phase, this is not restricted for the present embodiment.
It addition, secondary nodal point periodically can send first heart to primary nodal point by a physical network Jump message, but during owing to carrying out fault detect based on single physical network, at network failure, such as: Management plane network failure, and when service plane network is normal, often cannot define is group system Middle secondary nodal point there occurs that the link between fault or secondary nodal point and primary nodal point there occurs fault, or Person's secondary nodal point and primary nodal point there occurs fault simultaneously, and thus, the testing result of causing trouble is inaccurate. In order to solve this problem, it is preferable that the present embodiment can also send first by least two network Heartbeat message, for example, the first heartbeat message can be sent by biplane, such as: management plane And service plane, it is also possible to send the first heartbeat message by three planes, such as: management plane, business Plane and signaling plane.The mode using many physical networks sends the first heartbeat message, detects node and is No break down, the accuracy of detection can be improved.Need to illustrate, if the number of physical network When amount is at least two, mutually isolated between this at least two physical network, so can avoid due to many When there is some equipment shared between network, if shared device breaks down, thus cause nothing between node The phenomenon of method proper communication, is conducive to improving the accuracy of detection.
Step 202, primary nodal point do not receive secondary nodal point send the first heartbeat message in the case of, Primary nodal point other neighbor nodes in addition to primary nodal point in all neighbor nodes of secondary nodal point send Request message, request message is used for inquiring whether other neighbor nodes receive the first heartbeat message.
In the prior art, ordinary node be sent to the heart beat cycle of Centroid fixing in the case of, Because the restriction of the performance of Centroid, group system cannot infinitely increase ordinary node so that cluster system The autgmentability of system is affected.For this problem, in the embodiment of the present invention, if primary nodal point does not exist Receive the first heartbeat message that secondary nodal point sends in Preset Time, can primarily determine that secondary nodal point has May have occurred fault.Owing to secondary nodal point is first heart sent to its all neighbor nodes concurrently Jumping message, therefore, primary nodal point is by the neighbor node of secondary nodal point, and other in addition to self are adjacent Occupy node and send request message, to inquire whether other neighbor nodes receive first that secondary nodal point sends Heartbeat message.As can be seen here, when primary nodal point does not receives the first heartbeat message that secondary nodal point sends, Primary nodal point can send request message to other neighbor nodes of secondary nodal point, and secondary nodal point is non- Neighbor node also no longer will send heartbeat message to secondary nodal point, thus can reduce secondary nodal point and process the heart Jump the quantity of message, such that it is able to alleviate the burden of secondary nodal point so that the extensibility of group system is relatively Good.
For example, Fig. 4 is the schematic diagram two of neighbouring relations between group system interior joint, such as Fig. 4 institute Showing, the neighbor node of node E has X, A, D, C and G, node E will in each heart beat cycle to Its all neighbor node X, A, D, C and G send heartbeat message, it is assumed that using node E as second Node, using node A as primary nodal point, if in some heart beat cycle, primary nodal point A does not receives To secondary nodal point E send the first heartbeat message, then primary nodal point A will to other neighbor nodes X, D, C and G send request message, to inquire whether nodes X, D, C and G receive the first heart beating report Literary composition.
Step 203, primary nodal point receive the response carrying reception state of other neighbor nodes transmission and disappear Breath, this reception state is used for indicating whether to receive the first heartbeat message.
In the present embodiment, after other neighbor nodes receive the request message that primary nodal point sends, will be from Whether body receives the reception state of the first heartbeat message is carried and is sent to primary nodal point in the response message.
Step 204, the response message sent according to other neighbor nodes each received at primary nodal point In the reception state carried, in the case of determining that other neighbor nodes all do not receive the first heartbeat message, Primary nodal point determines that secondary nodal point breaks down.
In the present embodiment, each other neighbor node is receiving the request message that primary nodal point sends Afterwards, all can return the response message carrying reception state to primary nodal point, primary nodal point is according to reception To other neighbor nodes each send the response message carrying reception state, it is judged that other neighbours save Whether point receives the first heartbeat message, is judging that other neighbor nodes are all not received by secondary nodal point In the case of the first heartbeat message sent, i.e. can determine that secondary nodal point there occurs fault.
Needing to illustrate, the neighbouring relations between node are two-way, i.e. form neighborhood Can mutually send heartbeat message between node, therefore, all neighbor nodes of secondary nodal point all can be independent Execution step 201-step 204.
In the fault detection method of the group system interior joint that the embodiment of the present invention provides, primary nodal point judges Whether receiving the first heartbeat message that secondary nodal point sends in Preset Time, wherein, primary nodal point is The neighbor node of secondary nodal point, the first heartbeat message is that secondary nodal point is concurrently to each neighbour of secondary nodal point Occupying the heartbeat message that node sends, the number of all neighbor nodes of secondary nodal point is two or more; This Preset Time is more than or equal to a heart beat cycle, and less than two heart beat cycles;Primary nodal point is certainly In the case of body does not receives the first heartbeat message, inquire whether other neighbor nodes of this secondary nodal point connect Receive the first heartbeat message, and other neighbor nodes determining this secondary nodal point the most all do not receive this In the case of one heartbeat message, determine that secondary nodal point there occurs fault.Owing to Preset Time is more than or equal to One heart beat cycle, and less than two heart beat cycles, so the technical scheme using the present invention to provide is carried out During fault detect, it is to avoid prior art needs by whether multiple heart beat cycles just can detect node The phenomenon broken down, shortens the cycle of fault detect, thus improves the efficiency of node failure detection.
The flow process of the fault detection method embodiment two of the group system interior joint that Fig. 5 provides for the present invention is shown It is intended to.On the basis of embodiment illustrated in fig. 2, after primary nodal point is determined that secondary nodal point breaks down, Each node redefines the embodiment of neighbor node, elaborates.As it is shown in figure 5, the present embodiment Method may include that
Step 501, primary nodal point generate the first vote information, and receive other neighbor nodes each transmission The second vote information, the first vote information includes the node identification that node that primary nodal point is elected is corresponding; The node mark that the node of the neighbor node election that the second vote information includes sending the second vote information is corresponding Know.
In the present embodiment, after the neighbor node of secondary nodal point determines that secondary nodal point breaks down, All of neighbor node is required to recalculate respective neighbor node.For purposes of illustration only, can be by second Any one neighbor node of node is as primary nodal point, and primary nodal point needs to generate the first vote information, This first vote information comprises node identification corresponding to node and the ballot foundation of primary nodal point election. It addition, primary nodal point also to receive the second vote information that other neighbor nodes each send, the second ballot Node identification that the node of the neighbor node election that information includes sending the second vote information is corresponding and throwing Ticket foundation.In actual applications, ballot is according to relevant with many factors, such as: loading condition, node Size, nodal cache newness degree and the meshed network bandwidth etc. of numbering, such as: primary nodal point can lead to Cross and judge that the load which node is undertaken is minimum, and by node mark corresponding for this node of load minimum Know to carry in the first vote information, be sent to other neighbor nodes.Same, other neighbor nodes also may be used With in the way of similar, the second vote information is sent to primary nodal point.
Step 502, primary nodal point save according to the node identification in the first vote information and other neighbours each Node identification in the second vote information that point sends, each node in all nodes that statistics is elected The ballot quantity obtained, and using nodes most for quantity of voting as the 3rd node;3rd node is for substituting Secondary nodal point and concurrently to the 3rd node all neighbor nodes send heartbeat message node;3rd All neighbor nodes of node include neighbor node and the neighbor node of secondary nodal point of the 3rd node self.
In the present embodiment, primary nodal point is receiving the second ballot letter that other neighbor nodes each send After breath, according to the node identification in the first vote information that self generates and the second vote information received In node identification, it may be determined that go out the 3rd node.During concrete implementation, can be according to first The node identification carried in vote information and the second vote information, by the way of vote by ballot, adds up quilt The ballot quantity that in all nodes of election, each node obtains, and the node most by obtaining ballot quantity As the 3rd node.The neighbor node of the secondary nodal point that the 3rd node breaks down for adapter, Ye Jijie Incidence relation between pipe secondary nodal point and other nodes, therefore, the 3rd node general's replacement secondary nodal point, And concurrently to the node of all neighbor nodes transmission heartbeat message of the 3rd node, wherein, the 3rd node All neighbor nodes in addition to including the neighbor node of the 3rd node self, also include secondary nodal point Neighbor node.
Step 503, primary nodal point according in the neighbor node of the 3rd node and other neighbor nodes except the 3rd Node outside node, redefines the neighbor node of primary nodal point.
In the present embodiment, all neighbor nodes of secondary nodal point determine by the way of vote by ballot After three nodes, if primary nodal point is the 3rd node, then primary nodal point will take over the adjacent pass of secondary nodal point System, other neighbor nodes can according to the neighbouring relations after the neighbor node of primary nodal point adapter secondary nodal point, Respective neighbor node is determined again through calculating;If primary nodal point is not the 3rd node, then primary nodal point After treating that the 3rd node redefines out neighbouring relations, neighbor node and other according to the 3rd node are adjacent Occupy the node in addition to the 3rd node in node, redefine the neighbor node of self.
For example, Fig. 6 A detects neighbouring relations between the front nodal point of node failure in being group system Schematic diagram, Fig. 6 B be group system detects node failure after redefine adjacent pass between node The schematic diagram of system.As shown in Figure 6A, it is assumed that node E is secondary nodal point, node A is primary nodal point, After primary nodal point A determines that secondary nodal point E breaks down, primary nodal point A will generate the first ballot letter Breath, and the second vote information that respectively receiving node X, D, C and G send, primary nodal point A according to Node identification in first vote information and the node identification in the second vote information determine the 3rd node, So that the 3rd node substitutes secondary nodal point and sends heart beating to all neighbor nodes of the 3rd node concurrently Message.As shown in Figure 6B, if by vote by ballot, determining that primary nodal point A is the 3rd node, then Substituted secondary nodal point by primary nodal point A and send to all neighbor nodes of primary nodal point A concurrently Heartbeat message, now, primary nodal point A needs by other neighbor nodes X, D, C and G the most true Determine the neighbor node of oneself, and nodes X, D, C and G are the neighbour waiting primary nodal point A to determine oneself After occupying node, the neighbor node determined according to primary nodal point A redefines respective neighbor node.
The fault detection method of the group system interior joint that the embodiment of the present invention provides, primary nodal point judges Whether receiving the first heartbeat message that secondary nodal point sends in Preset Time, wherein, primary nodal point is the The neighbor node of two nodes, the first heartbeat message is that secondary nodal point is concurrently to each neighbours of secondary nodal point The heartbeat message that node sends, the number of all neighbor nodes of secondary nodal point is two or more;Should Preset Time is more than or equal to a heart beat cycle, and less than two heart beat cycles;Primary nodal point is at self In the case of not receiving the first heartbeat message, inquire whether other neighbor nodes of this secondary nodal point receive To the first heartbeat message, and other neighbor nodes determining this secondary nodal point the most all do not receive this first In the case of heartbeat message, determine that secondary nodal point there occurs fault.Owing to Preset Time is more than or equal to one Individual heart beat cycle, and less than two heart beat cycles, so the technical scheme using the present invention to provide carries out event During barrier detection, it is to avoid prior art needs just to detect whether node is sent out by multiple heart beat cycles The phenomenon of raw fault, shortens the cycle of fault detect, thus improves the efficiency of node failure detection. It addition, by after determining that primary nodal point breaks down, redefine respective neighbor node, and then Proceed fault detect, improve the accuracy of fault detect.
Alternatively, in the response message that primary nodal point sends according to other neighbor nodes each received The reception state carried, determines that at least one other neighbor node receives the situation of the first heartbeat message Under, primary nodal point determine described in do not receive the link between node and the secondary nodal point of the first heartbeat message Break down.
Specifically, primary nodal point is not receiving the first heartbeat message that secondary nodal point sends, and to each Other nodes send request message, to inquire whether other neighbor nodes each receive the first heartbeat message Afterwards, if determining at least other neighbor nodes according to the response message of other nodes each transmission Have received the first heartbeat message, then primary nodal point can determine that secondary nodal point is normal, and may It is between secondary nodal point and primary nodal point and node and the primary nodal point not receiving the first heartbeat message Link there occurs fault, wherein, do not receive the node of the first heartbeat message include primary nodal point and its He does not receives the neighbor node of the first heartbeat message in neighbor node.
The fault detection method of the group system interior joint that the embodiment of the present invention provides, owing to primary nodal point exists In the case of determining that at least one other neighbor node receives the first heartbeat message, primary nodal point determines The link not received between node and the secondary nodal point of the first heartbeat message breaks down so that fault is examined Survey more comprehensive.
The flow process of the fault detection method embodiment three of the group system interior joint that Fig. 7 provides for the present invention is shown It is intended to.The method that the present embodiments relate to is applicable to distributed cluster system.In the present embodiment still with Computer is introduced as a example by executive agent.As it is shown in fig. 7, the method for the present embodiment may include that
Step 701, secondary nodal point send the first heart beating report to primary nodal point and other neighbor nodes concurrently Literary composition, primary nodal point is the neighbor node of secondary nodal point;Other neighbor nodes are all neighbours of secondary nodal point Node in addition to primary nodal point in node, the number of other neighbor nodes is more than one.
In the present embodiment, secondary nodal point can be according to the information of the node included in group system, root Determining self all of neighbor node according to the rule preset in group system, wherein, primary nodal point is the Any one neighbor node of two nodes, the neighbor node of secondary nodal point is pass relevant with this secondary nodal point The node of system.Secondary nodal point after determining all of neighbor node, can concurrently to primary nodal point and Other neighbor nodes send the first heartbeat message.
Step 702, primary nodal point judge whether to receive the first heartbeat message in Preset Time;Preset Time is more than or equal to a heart beat cycle, and less than two heart beat cycles.
In the present embodiment, secondary nodal point can be according to heart beat cycle concurrently to its all neighbor nodes Sending the first heartbeat message, therefore, primary nodal point may determine that more than or equal to a heart beat cycle, And less than whether receiving the first heartbeat message that this secondary nodal point sends in the time of two heart beat cycles. Such as: assume heart beat cycle be 5s, i.e. secondary nodal point every 5s, will send out to its neighbor node concurrently Sending a heartbeat message, the first heartbeat message sent for secondary nodal point at 5s, primary nodal point will be sentenced Break more than or equal to 5s, and within less than the time of 10s, whether receive first heart that secondary nodal point sends Jump message.Wherein, heart beat cycle can rule of thumb or practical situation is configured, for heart beating week The concrete value of phase, this is not restricted for the present embodiment.
It addition, secondary nodal point periodically can send first heart to primary nodal point by a physical network Jump message, but during owing to carrying out fault detect based on single physical network, at network failure, such as: Management plane network failure, and when service plane network is normal, often cannot define is group system Middle secondary nodal point there occurs that the link between fault or secondary nodal point and primary nodal point there occurs fault, or Person's secondary nodal point and primary nodal point there occurs fault simultaneously, and thus, the testing result of causing trouble is inaccurate. In order to solve this problem, it is preferable that the present embodiment can also send first by least two network Heartbeat message, for example, the first heartbeat message can be sent by biplane, such as: management plane And service plane, it is also possible to send the first heartbeat message by three planes, such as: management plane, business Plane and signaling plane.The mode using many physical networks sends the first heartbeat message, detects node and is No break down, the accuracy of detection can be improved.Need to illustrate, if the number of physical network When amount is at least two, mutually isolated between this at least two physical network, so can avoid due to many When there is some equipment shared between network, if shared device breaks down, thus cause nothing between node The phenomenon of method proper communication, is conducive to improving the accuracy of detection.
Step 703, in the case of primary nodal point does not receives the first heartbeat message, primary nodal point is to often One other neighbor nodes send request message respectively, and request message is used for inquiring that other neighbor nodes each are No receive described first heartbeat message.
In the present embodiment, if primary nodal point does not receive that secondary nodal point sends in Preset Time One heartbeat message, can primarily determine that secondary nodal point has been likely to occur fault.Owing to secondary nodal point is also The first heartbeat message that row ground sends to its all neighbor nodes, therefore, primary nodal point will be to second section In the neighbor node of point, other neighbor nodes in addition to self send request message, to inquire that other are adjacent Occupy whether node receives the first heartbeat message that secondary nodal point sends.
Step 704, primary nodal point receive the sound carrying reception state that other neighbor nodes each send Message, reception state is answered to be used for indicating whether to receive the first heartbeat message.
In the present embodiment, after other neighbor nodes each receive the request message that primary nodal point sends, Whether self receives the reception state of the first heartbeat message carry and be sent to first segment in the response message Point.
Step 705, at primary nodal point according to the reception state carried in the response message received, determine Going out in the case of other neighbor nodes all do not receive the first heartbeat message, primary nodal point determines secondary nodal point Break down.
In the present embodiment, each other neighbor node is receiving the request message that primary nodal point sends Afterwards, all can return the response message carrying reception state to primary nodal point, primary nodal point is according to reception To other neighbor nodes each send the response message carrying reception state, it is judged that other neighbours save Whether point receives the first heartbeat message, is judging that other neighbor nodes are all not received by secondary nodal point During the first heartbeat message sent, i.e. can determine that secondary nodal point there occurs fault.
In the fault detection method of the group system interior joint that the embodiment of the present invention provides, secondary nodal point passes through Sending the first heartbeat message to primary nodal point and other neighbor nodes concurrently, primary nodal point judges presetting Whether receiving the first heartbeat message that secondary nodal point sends in time, wherein, primary nodal point is second section The neighbor node of point, the first heartbeat message is that secondary nodal point is concurrently to each neighbor node of secondary nodal point The heartbeat message sent, the number of all neighbor nodes of secondary nodal point is two or more;This is preset Time is more than or equal to a heart beat cycle, and less than two heart beat cycles;Primary nodal point does not connects at self In the case of receiving the first heartbeat message, inquire whether other neighbor nodes of this secondary nodal point receive One heartbeat message, and the most all do not receive this first heart beating at other neighbor nodes determining this secondary nodal point In the case of message, determine that secondary nodal point there occurs fault.Owing to Preset Time is more than or equal to a heart Hop cycle, and less than two heart beat cycles, so the technical scheme using the present invention to provide carries out fault inspection During survey, it is to avoid prior art needs just can be detected by multiple heart beat cycles node whether occur therefore The phenomenon of barrier, shortens the cycle of fault detect, thus improves the efficiency of node failure detection.
The flow process of the fault detection method embodiment four of the group system interior joint that Fig. 8 provides for the present invention is shown It is intended to.On the basis of embodiment illustrated in fig. 7, after primary nodal point is determined that secondary nodal point breaks down, Each node redefines the embodiment of neighbor node, elaborates.As shown in Figure 8, the present embodiment Method may include that
Step 801, primary nodal point generate the first vote information, and receive other neighbor nodes each transmission The second vote information, the first vote information includes the node identification that node that primary nodal point is elected is corresponding; The node mark that the node of the neighbor node election that the second vote information includes sending the second vote information is corresponding Know.
In the present embodiment, after the neighbor node of secondary nodal point determines that secondary nodal point breaks down, All of neighbor node is required to recalculate respective neighbor node.For purposes of illustration only, can be by second Any one neighbor node of node is as primary nodal point, and primary nodal point needs to generate the first vote information, This first vote information comprises node identification corresponding to node and the ballot foundation of primary nodal point election. It addition, primary nodal point also to receive the second vote information that other neighbor nodes each send, this second throwing Node identification that the node of the neighbor node election that ticket information includes sending the second vote information is corresponding and Ballot foundation.In actual applications, ballot is according to relevant with many factors, such as: loading condition, joint Size, nodal cache newness degree and the meshed network bandwidth etc. of some numbering, such as: primary nodal point is permissible Minimum by judging the load which node is undertaken, and by node corresponding for this node of load minimum Mark is carried and is sent to other neighbor nodes in the first vote information.Same, other neighbor nodes are also By similar mode, the second vote information can be sent to primary nodal point.
Step 802, primary nodal point save according to the node identification in the first vote information and other neighbours each Node identification in the second vote information that point sends, in all nodes that statistics is elected, each node obtains The ballot quantity obtained, and using nodes most for quantity of voting as the 3rd node;3rd node is replacement Two nodes and concurrently to the 3rd node all neighbor nodes send heartbeat message node;Section three, All neighbor nodes of point include neighbor node and the neighbor node of secondary nodal point of the 3rd node self.
In the present embodiment, primary nodal point is receiving the second ballot letter that other neighbor nodes each send After breath, according to the node identification in the first vote information that self generates and the second vote information received In node identification, it may be determined that go out the 3rd node.During concrete implementation, can be according to first The node identification carried in vote information and the second vote information, by the way of vote by ballot, adds up quilt The ballot quantity that in all nodes of election, each node obtains, and the node most by obtaining ballot quantity As the 3rd node.The neighbor node of the secondary nodal point that the 3rd node breaks down for adapter, Ye Jijie Incidence relation between pipe secondary nodal point and other nodes, therefore, the 3rd node general's replacement secondary nodal point, And send heartbeat message, wherein, owning of the 3rd node to all neighbor nodes of the 3rd node concurrently Neighbor node, in addition to including the neighbor node of the 3rd node self, also includes neighbours' joint of secondary nodal point Point.
Step 803, primary nodal point according in the neighbor node of the 3rd node and other neighbor nodes except the 3rd Node outside node, redefines the neighbor node of primary nodal point.
In the present embodiment, all neighbor nodes of secondary nodal point determine by the way of vote by ballot After three nodes, if primary nodal point is the 3rd node, then primary nodal point will take over the adjacent pass of secondary nodal point System, other neighbor nodes can according to the neighbouring relations after the neighbor node of primary nodal point adapter secondary nodal point, Respective neighbor node is determined again through calculating;If primary nodal point is not the 3rd node, then primary nodal point After treating that the 3rd node redefines out neighbouring relations, neighbor node and other according to the 3rd node are adjacent Occupy the node in addition to the 3rd node in node, redefine the neighbor node of self.
The fault detection method of the group system interior joint that the embodiment of the present invention provides, secondary nodal point is by also Row ground sends the first heartbeat message to primary nodal point and other neighbor nodes, and primary nodal point judges when default In whether receive secondary nodal point send the first heartbeat message, wherein, primary nodal point is secondary nodal point Neighbor node, the first heartbeat message is that secondary nodal point is sent out to each neighbor node of secondary nodal point concurrently The heartbeat message sent, the number of all neighbor nodes of secondary nodal point is two or more;When this is default Between more than or equal to a heart beat cycle, and less than two heart beat cycles;Primary nodal point does not receives at self In the case of the first heartbeat message, inquire whether other neighbor nodes of this secondary nodal point receive first Heartbeat message, and the most all do not receive this first heart beating report at other neighbor nodes determining this secondary nodal point In the case of literary composition, determine that secondary nodal point there occurs fault.Owing to Preset Time is more than or equal to a heart beating Cycle, and less than two heart beat cycles, so the technical scheme using the present invention to provide carries out fault detect Time, it is to avoid prior art needs just to detect whether node breaks down by multiple heart beat cycles Phenomenon, shorten the cycle of fault detect, thus improve the efficiency of node failure detection.It addition, By after determining that primary nodal point breaks down, redefine respective neighbor node, so continue into Row fault detect, improves the accuracy of fault detect.
Alternatively, in the response message that primary nodal point sends according to other neighbor nodes each received The reception state carried, determines that at least one other neighbor node receives the situation of the first heartbeat message Under, primary nodal point determine described in do not receive the link between node and the secondary nodal point of the first heartbeat message Break down.
Specifically, primary nodal point is not receiving the first heartbeat message that secondary nodal point sends, and to each Other nodes send request message, to inquire whether other neighbor nodes each receive the first heartbeat message Afterwards, if determining at least other neighbours according to the response message of other neighbor nodes each transmission Node have received the first heartbeat message, then primary nodal point can determine that secondary nodal point is normal, and It is probably secondary nodal point and primary nodal point and does not receives node and the primary nodal point of the first heartbeat message Between link there occurs fault, wherein, the node not receiving the first heartbeat message includes primary nodal point With the neighbor node not receiving the first heartbeat message in other neighbor nodes.
Alternatively, described primary nodal point saves according to neighbor node and other neighbours described of described 3rd node Node in addition to described 3rd node in point, redefines the neighbor node of described primary nodal point.
The fault detection method of the group system interior joint that the embodiment of the present invention provides, owing to primary nodal point exists In the case of determining that at least one other neighbor node receives the first heartbeat message, primary nodal point determines The link not received between node and the secondary nodal point of the first heartbeat message breaks down so that fault is examined Survey more comprehensive.
Fig. 9 is the structural representation of the failure detector embodiment one of group system interior joint of the present invention Figure, as it is shown in figure 9, the failure detector 10 of the group system interior joint of embodiment of the present invention offer Including judge module 11, sending module 12, receiver module 13, determine module 14 and generation module 15。
Wherein, it is judged that module 11 is for judging in Preset Time whether receiver module 13 receives the The first heartbeat message that two nodes send;Described primary nodal point is the neighbor node of described secondary nodal point, Described first heartbeat message is that described secondary nodal point saves to each neighbours of described secondary nodal point concurrently Point send a heartbeat message, the number of all neighbor nodes of described secondary nodal point be two with On;Described Preset Time is more than or equal to a heart beat cycle, and less than two heart beat cycles;Institute State judge module 11 and judge that described receiver module 13 does not receives first that described secondary nodal point sends In the case of heartbeat message, sending module 12 is in all neighbor nodes of described secondary nodal point Other neighbor nodes in addition to described primary nodal point send request message;Described request message is used for asking Ask whether other neighbor nodes described receive described first heartbeat message;Described receiver module 13 is also For receiving the response message carrying reception state that other neighbor nodes described send, described reception State is used for indicating whether to receive described first heartbeat message;Determine that module 14 is for according to described The described response message that other neighbor nodes each described that receiver module 13 receives send carries Reception state, it is determined whether other neighbor nodes described all do not receive described first heartbeat message; Determine that module 14 determines that other neighbor nodes described all do not receive described first heart beating report described In the case of literary composition, described determine that module 14 is additionally operable to determine that described secondary nodal point breaks down.Institute State primary nodal point according to the reception state carried in the described response message received, determine described its In the case of his neighbor node does not all receive described first heartbeat message, determine that module 14 is for really Fixed described secondary nodal point breaks down.
The failure detector of the group system interior joint that the embodiment of the present invention provides, it is judged that module judges In Preset Time, whether receiver module receives the first heartbeat message that secondary nodal point sends, first heart Jumping message is the heart beating report that secondary nodal point sends to each neighbor node of secondary nodal point concurrently Literary composition, the number of all neighbor nodes of secondary nodal point is two or more;This Preset Time is more than or equal to One heart beat cycle, and less than two heart beat cycles;Receiver module is not receiving the first heartbeat message In the case of, sending module sends request message to other neighbor nodes of this secondary nodal point, with inquiry Whether other neighbor nodes receive the first heartbeat message, and are determining that module determines this secondary nodal point Other neighbor nodes the most all do not receive this first heartbeat message in the case of, determine that secondary nodal point is sent out Give birth to fault.Owing to Preset Time is more than or equal to a heart beat cycle, and less than two heart beat cycles, So use the technical scheme that the present invention provides when carrying out fault detect, it is to avoid prior art needs Just can be detected the phenomenon whether node breaks down by multiple heart beat cycles, shorten fault detect Cycle, thus improve node failure detection efficiency.
Alternatively, generation module 15 is additionally operable to generate the first vote information, and described first vote information includes The node identification corresponding to node of described primary nodal point election;
Described receiver module 13 is additionally operable to receive the second ballot letter that each other neighbor nodes described send Breath, described second vote information includes the node pair sending the neighbor node election of described second vote information The node identification answered;
Described determine that module 14 is additionally operable to according to the node identification in described first vote information and each institute State the node identification in the second vote information that other neighbor nodes send, all nodes that statistics is elected In the ballot quantity that obtains of each node, and using nodes most for quantity of voting as the 3rd node;Described 3rd node be substitute described secondary nodal point and concurrently all neighbor nodes to described 3rd node send out Send the node of heartbeat message;All neighbor nodes of described 3rd node include described 3rd node self Neighbor node and the neighbor node of described secondary nodal point.
Alternatively, described determine module 14 according to described receiver module 13 receive each described The reception state carried in the described response message that other neighbor nodes send, determines at least one institute State in the case of other neighbor nodes receive described first heartbeat message,
Described determine that module 14 is additionally operable to determine node and the institute not receiving described first heartbeat message The link stated between secondary nodal point breaks down;The described node not receiving described first heartbeat message Described first heartbeat message is not received including in described primary nodal point and other neighbor nodes described Node.
Alternatively, described determine that module 14 is additionally operable to the neighbor node according to described 3rd node and institute State the node in addition to described 3rd node in other neighbor nodes, redefine described primary nodal point Neighbor node.
The failure detector of the group system interior joint of the present embodiment, may be used for performing the present invention any The technical scheme of the fault detection method of the group system interior joint that embodiment is provided, its realize principle and Technique effect is similar to, and here is omitted.
Figure 10 is the structural representation of the fault detection system embodiment one of group system interior joint of the present invention Figure, as shown in Figure 10, the fault detection system of the group system interior joint that the embodiment of the present invention provides 20 include primary nodal point 21, secondary nodal point 22 and other neighbor nodes 23, described primary nodal point 21 For the neighbor node of described secondary nodal point 22, other neighbor nodes 23 described are described secondary nodal point 22 All neighbor nodes in node in addition to described primary nodal point 21, other neighbor nodes 23 described Number be more than one.
Wherein, described secondary nodal point 22 is for concurrently to described primary nodal point and other neighbor nodes described Send the first heartbeat message;Whether described primary nodal point 21 is described in judge to receive in Preset Time First heartbeat message;Described Preset Time is more than or equal to a heart beat cycle, and less than two heart beating weeks Phase;In the case of described primary nodal point does not receives described first heartbeat message, described primary nodal point 21 Being additionally operable to send request message respectively to each other neighbor nodes described, described request message is used for inquiring Whether each other neighbor nodes described receive described first heartbeat message;Described primary nodal point 21 is also used In receiving the response message carrying reception state that each other neighbor nodes described send, described reception State is used for indicating whether to receive described first heartbeat message;At described primary nodal point according to receiving The reception state carried in the described response message that each other neighbor nodes described send, determines described In the case of other neighbor nodes all do not receive described first heartbeat message, described primary nodal point 21 is also used In determining that described secondary nodal point breaks down.
In the fault detection system of the group system interior joint that the embodiment of the present invention provides, it is judged that module judges In Preset Time, whether receiver module receives the first heartbeat message that secondary nodal point sends, the first heart beating Message is the heartbeat message that secondary nodal point sends to each neighbor node of secondary nodal point concurrently, the The number of all neighbor nodes of two nodes is two or more;This Preset Time is more than or equal to a heart beating Cycle, and less than two heart beat cycles;Receiver module in the case of not receiving the first heartbeat message, Sending module sends request message to other neighbor nodes of this secondary nodal point, to inquire other neighbor nodes Whether receive the first heartbeat message, and determining that module determines other neighbor nodes of this secondary nodal point In the case of the most all not receiving this first heartbeat message, determine that secondary nodal point there occurs fault.Due in advance If the time is more than or equal to a heart beat cycle, and less than two heart beat cycles, so using the present invention to carry When the technical scheme of confession carries out fault detect, it is to avoid prior art needs by multiple heart beat cycles The phenomenon whether node breaks down can be detected, shorten the cycle of fault detect, thus improve joint The efficiency of point failure detection.
In the above-described embodiments, after described primary nodal point 21 determines that described secondary nodal point breaks down, also Including: described primary nodal point 21 is additionally operable to:
Generate the first vote information, and receive the second vote information that each other neighbor nodes described send, Described first vote information includes the node identification that node that described primary nodal point is elected is corresponding, described second The node mark that the node of the neighbor node election that vote information includes sending described second vote information is corresponding Know;
And, send out according to the node identification in described first vote information and each other neighbor nodes described Node identification in the second vote information sent, in all nodes that statistics is elected, each node obtains Ballot quantity, and using nodes most for quantity of voting as the 3rd node, described 3rd node is for substituting institute State secondary nodal point and send the node of heartbeat message concurrently to all neighbor nodes of described 3rd node; All neighbor nodes of described 3rd node include the neighbor node and described second of described 3rd node self The neighbor node of node.
In the above-described embodiments, at described primary nodal point according to other neighbor nodes each described received The reception state carried in the described response message sent, determines at least one other neighbor node described In the case of receiving described first heartbeat message,
Described primary nodal point 21 is additionally operable to determine that the node not receiving described first heartbeat message is with described Link between secondary nodal point breaks down;The described node not receiving described first heartbeat message includes Described primary nodal point and other neighbor nodes described do not receive the node of described first heartbeat message.
In the above-described embodiments, described primary nodal point 21 is additionally operable to the neighbor node according to described 3rd node With node in addition to described 3rd node in other neighbor nodes described, redefine described primary nodal point Neighbor node.
Said system embodiment can be used for performing the technical scheme of embodiment of the method accordingly, and it realizes principle Similar with technique effect, here is omitted.
Figure 11 is the structural representation of node embodiment one of the present invention, as shown in figure 11, the present embodiment Node 600 includes processor 601, user interface 603, network interface 604 and memorizer 605, sends Device 606 and receptor 607, memorizer 605 can include operating system 6051, application program 6052 etc.. Processor 601 can be central processing unit (Central Processing Unit, CPU).Memorizer 605 For storing executable instruction.Processor 601 can perform the executable instruction of storage in memorizer 605. Wherein, receptor 607 is for receiving the first heartbeat message that secondary nodal point sends;Described processor 601 For judging whether described receptor 607 receives the first heart beating that secondary nodal point sends in Preset Time Message;Described first heartbeat message is that described secondary nodal point is concurrently to each neighbours of described secondary nodal point The heartbeat message that node sends, the number of all neighbor nodes of described secondary nodal point is two or more; Described Preset Time is more than or equal to a heart beat cycle, and less than two heart beat cycles;In described process Device 601 judges that described receptor 607 does not receives the first heartbeat message of described secondary nodal point transmission In the case of, transmitter 606 is for removing described primary nodal point in all neighbor nodes of described secondary nodal point Outside other neighbor nodes send request message, described request message is used for inquiring described other neighbours joint Whether point receives described first heartbeat message, and described primary nodal point is the neighbor node of described secondary nodal point; Described receptor 607 is additionally operable to receive the response carrying reception state that other neighbor nodes described send Message, described reception state is used for indicating whether to receive described first heartbeat message;Described processor 601 The described response sent for other neighbor nodes each described received according to described receptor 607 disappears The reception state carried in breath, it is determined whether other neighbor nodes described all do not receive described first heart beating Message;Determine that other neighbor nodes described all do not receive described first heart beating at described processor 601 In the case of message, described processor 601 is additionally operable to determine that described secondary nodal point breaks down.
The node that the present embodiment provides, may be used for performing the cluster system that any embodiment of the present invention is provided The technical scheme of the fault detection method of system interior joint, it is similar with technique effect that it realizes principle, the most not Repeat again.
Alternatively, described processor 601 is additionally operable to generate the first vote information, described first vote information Including the node identification that the node of described primary nodal point election is corresponding;
Described receptor 607 is additionally operable to receive the second ballot letter that each other neighbor nodes described send Breath, described second vote information includes the node pair sending the neighbor node election of described second vote information The node identification answered;
Described processor 601 is additionally operable to according to the node identification in described first vote information and each described Node identification in the second vote information that other neighbor nodes send, in all nodes that statistics is elected The ballot quantity that each node obtains, and using nodes most for quantity of voting as the 3rd node;Described Three nodes are to substitute described secondary nodal point and send to all neighbor nodes of described 3rd node concurrently The node of heartbeat message;All neighbor nodes of described 3rd node include the neighbour of described 3rd node self Occupy node and the neighbor node of described secondary nodal point.
Alternatively, described processor 601 according to described receptor 607 receive each described other The reception state carried in the described response message that neighbor node sends, determine at least one described other In the case of neighbor node receives described first heartbeat message, described processor 601 is additionally operable to determine not The link received between node and the described secondary nodal point of described first heartbeat message breaks down;Described The node not receiving described first heartbeat message includes in described primary nodal point and other neighbor nodes described Do not receive the node of described first heartbeat message.
Alternatively, described processor 601 be additionally operable to the neighbor node according to described 3rd node and described its Node in addition to described 3rd node in his neighbor node, redefines neighbours' joint of described primary nodal point Point.
The node that the present embodiment provides, may be used for performing the cluster system that any embodiment of the present invention is provided The technical scheme of the fault detection method of system interior joint, it is similar with technique effect that it realizes principle, the most not Repeat again.
One of ordinary skill in the art will appreciate that: realize all or part of step of above-mentioned each method embodiment Suddenly can be completed by the hardware that programmed instruction is relevant.Aforesaid program can be stored in a computer can Read in storage medium.This program upon execution, performs to include the step of above-mentioned each method embodiment;And Aforesaid storage medium includes: ROM, RAM, magnetic disc or CD etc. are various can store program code Medium.
Last it is noted that various embodiments above is only in order to illustrate technical scheme, rather than right It limits;Although the present invention being described in detail with reference to foregoing embodiments, this area common Skilled artisans appreciate that the technical scheme described in foregoing embodiments still can be modified by it, Or the most some or all of technical characteristic is carried out equivalent;And these amendments or replacement, and The essence not making appropriate technical solution departs from the scope of various embodiments of the present invention technical scheme.

Claims (12)

1. the fault detection method of a group system interior joint, it is characterised in that including:
Primary nodal point judges whether receive the first heartbeat message that secondary nodal point sends in Preset Time; Described primary nodal point is the neighbor node of described secondary nodal point, and described first heartbeat message is described second section The heartbeat message that point sends to each neighbor node of described secondary nodal point concurrently, described second section The number of all neighbor nodes of point is two or more;Described Preset Time is more than or equal to a heart beating week Phase, and less than two heart beat cycles;
In the case of described primary nodal point does not receives the first heartbeat message that described secondary nodal point sends, Described primary nodal point other in addition to described primary nodal point in all neighbor nodes of described secondary nodal point Neighbor node sends request message, and described request message is used for inquiring whether other neighbor nodes described receive To described first heartbeat message;
Described primary nodal point receives the response carrying reception state of described other neighbor nodes transmission and disappears Breath, described reception state is used for indicating whether to receive described first heartbeat message;
The described response sent according to other neighbor nodes each described received at described primary nodal point disappears The reception state carried in breath, determines that other neighbor nodes described all do not receive described first heart beating report In the case of literary composition, described primary nodal point determines that described secondary nodal point breaks down.
Method the most according to claim 1, it is characterised in that described primary nodal point determines described After two nodes break down, also include:
Described primary nodal point generates the first vote information, and receives what each other neighbor nodes described sent Second vote information, described first vote information includes the node that node that described primary nodal point is elected is corresponding Mark;Described second vote information includes the node sending the neighbor node election of described second vote information Corresponding node identification;
Described primary nodal point is according to the node identification in described first vote information and each other neighbours described Node identification in the second vote information that node sends, each node in all nodes that statistics is elected The ballot quantity obtained, and using nodes most for quantity of voting as the 3rd node;Described 3rd node is Substitute described secondary nodal point and send heartbeat message to all neighbor nodes of described 3rd node concurrently Node;All neighbor nodes of described 3rd node include described 3rd node self neighbor node and The neighbor node of described secondary nodal point.
Method the most according to claim 1 and 2, it is characterised in that also include:
The described response sent according to other neighbor nodes each described received at described primary nodal point disappears The reception state carried in breath, determines that at least one other neighbor node described receives described first heart In the case of jumping message, described primary nodal point determines node and the institute not receiving described first heartbeat message The link stated between secondary nodal point breaks down;The described node bag not receiving described first heartbeat message Include the node not receiving described first heartbeat message in described primary nodal point and other neighbor nodes described.
4. the fault detection method of a group system interior joint, it is characterised in that described method includes:
Secondary nodal point sends the first heartbeat message to primary nodal point and other neighbor nodes concurrently;Described One node is the neighbor node of described secondary nodal point, and other neighbor nodes described are the institute of described secondary nodal point Having the node in addition to described primary nodal point in neighbor node, the number of other neighbor nodes described is one Above;
Described primary nodal point judges whether receive described first heartbeat message in Preset Time;Described pre- If the time is more than or equal to a heart beat cycle, and less than two heart beat cycles;
In the case of described primary nodal point does not receives described first heartbeat message, described primary nodal point to Each other neighbor nodes described send request message respectively, and described request message is used for inquiring each described Whether other neighbor nodes receive described first heartbeat message;
Described primary nodal point receives the response carrying reception state that each other neighbor nodes described send Message, described reception state is used for indicating whether to receive described first heartbeat message;
At described primary nodal point according to the reception state carried in the described response message received, determine In the case of other neighbor nodes described all do not receive described first heartbeat message, described primary nodal point is true Fixed described secondary nodal point breaks down.
Method the most according to claim 4, it is characterised in that described primary nodal point determines described After two nodes break down, also include:
Described primary nodal point generates the first vote information, and receives what each other neighbor nodes described sent Second vote information, described first vote information includes the node that node that described primary nodal point is elected is corresponding Mark;Described second vote information includes the node sending the neighbor node election of described second vote information Corresponding node identification;
Described primary nodal point is according to the node identification in described first vote information and each other neighbours described Node identification in the second vote information that node sends, each node in all nodes that statistics is elected The ballot quantity obtained, and using nodes most for quantity of voting as the 3rd node;Described 3rd node is Substitute described secondary nodal point and send heartbeat message to all neighbor nodes of described 3rd node concurrently Node;All neighbor nodes of described 3rd node include described 3rd node self neighbor node and The neighbor node of described secondary nodal point.
6. according to the method described in claim 4 or 5, it is characterised in that also include:
The described response sent according to other neighbor nodes each described received at described primary nodal point disappears The reception state carried in breath, determines that at least one other neighbor node described receives described first heart Jump in the case of message, the most described primary nodal point determine the node not receiving described first heartbeat message with Link between described secondary nodal point breaks down;The described node not receiving described first heartbeat message Including the node not receiving the first heartbeat message in described primary nodal point and other neighbor nodes described.
7. the failure detector of a group system interior joint, it is characterised in that including:
Judge module, for judging whether receiver module receives what secondary nodal point sent in Preset Time First heartbeat message;Described primary nodal point is the neighbor node of described secondary nodal point, described first heart beating report Literary composition is the heart beating report that described secondary nodal point sends to each neighbor node of described secondary nodal point concurrently Literary composition, the number of all neighbor nodes of described secondary nodal point is two or more;Described Preset Time more than or Equal to a heart beat cycle, and less than two heart beat cycles;
Judge that described receiver module does not receives first that described secondary nodal point sends at described judge module In the case of heartbeat message,
Sending module, in all neighbor nodes of described secondary nodal point in addition to described primary nodal point Other neighbor nodes send request message, described request message is used for inquiring that other neighbor nodes described are No receive described first heartbeat message;
Described receiver module, is additionally operable to the reception state that carries of other neighbor nodes transmission described in receiving Response message, described reception state is used for indicating whether to receive described first heartbeat message;
Determine module, send for other neighbor nodes each described received according to described receiver module Described response message in the reception state carried, it is determined whether other neighbor nodes described all do not receive Described first heartbeat message;
Determine that module determines that other neighbor nodes described all do not receive described first heartbeat message described In the case of, described determine module, be additionally operable to determine that described secondary nodal point breaks down.
Device the most according to claim 7, it is characterised in that to determine that module determines described described After secondary nodal point breaks down, also include:
Generation module, is additionally operable to generate the first vote information, and described first vote information includes described first The node identification corresponding to node of node election;
Described receiver module, is additionally operable to receive the second vote information that each other neighbor nodes described send, Described second vote information includes that the node sending the neighbor node election of described second vote information is corresponding Node identification;
Described determine module, be additionally operable to according to the node identification in described first vote information and each described Node identification in the second vote information that other neighbor nodes send, in all nodes that statistics is elected The ballot quantity that each node obtains, and using nodes most for quantity of voting as the 3rd node;Described Three nodes are to substitute described secondary nodal point and send to all neighbor nodes of described 3rd node concurrently The node of heartbeat message;All neighbor nodes of described 3rd node include the neighbour of described 3rd node self Occupy node and the neighbor node of described secondary nodal point.
9. according to the device described in claim 7 or 8, it is characterised in that:
Send at described other neighbor nodes each described determining that module receives according to described receiver module Described response message in the reception state carried, determine that at least one other neighbor node described receives In the case of described first heartbeat message,
Described determine that module is additionally operable to determine the node and described second not receiving described first heartbeat message Link between node breaks down;The described node not receiving described first heartbeat message includes described Primary nodal point and other neighbor nodes described do not receive the node of described first heartbeat message.
10. the fault detection system of a group system interior joint, it is characterised in that include primary nodal point, Secondary nodal point and other neighbor nodes, described primary nodal point is the neighbor node of described secondary nodal point, described Other neighbor nodes be described secondary nodal point all neighbor nodes in joint in addition to described primary nodal point Point, the number of other neighbor nodes described is more than one, including:
Described secondary nodal point, for sending the to described primary nodal point and other neighbor nodes described concurrently One heartbeat message;
Described primary nodal point, for judging whether receive described first heartbeat message in Preset Time; Described Preset Time is more than or equal to a heart beat cycle, and less than two heart beat cycles;
In the case of described primary nodal point does not receives described first heartbeat message, described primary nodal point is also For sending request message respectively to each other neighbor nodes described, described request message is used for inquiring often Whether other neighbor nodes described in receive described first heartbeat message;And, described primary nodal point is also For receiving the response message carrying reception state that each other neighbor nodes described send, described in connect Receipts state is used for indicating whether to receive described first heartbeat message;
The described response sent according to other neighbor nodes each described received at described primary nodal point disappears The reception state carried in breath, determines that other neighbor nodes described all do not receive described first heart beating report In the case of literary composition, described primary nodal point is additionally operable to determine that described secondary nodal point breaks down.
11. systems according to claim 10, it is characterised in that described primary nodal point determines described After secondary nodal point breaks down, also include:
Described primary nodal point is additionally operable to:
Generate the first vote information, and receive the second vote information that each other neighbor nodes described send, Described first vote information includes the node identification that node that described primary nodal point is elected is corresponding, described second The node mark that the node of the neighbor node election that vote information includes sending described second vote information is corresponding Know;
And, send out according to the node identification in described first vote information and each other neighbor nodes described Node identification in the second vote information sent, in all nodes that statistics is elected, each node obtains Ballot quantity, and using nodes most for quantity of voting as the 3rd node, described 3rd node is for substituting institute State secondary nodal point and send the node of heartbeat message concurrently to all neighbor nodes of described 3rd node; All neighbor nodes of described 3rd node include the neighbor node and described second of described 3rd node self The neighbor node of node.
12. according to the system described in claim 10 or 11, it is characterised in that:
The described response sent according to other neighbor nodes each described received at described primary nodal point disappears The reception state carried in breath, determines that at least one other neighbor node described receives described first heart In the case of jumping message,
Described primary nodal point is additionally operable to determine the node and described second not receiving described first heartbeat message Link between node breaks down;The described node not receiving described first heartbeat message includes described Primary nodal point and other neighbor nodes described do not receive the node of described first heartbeat message.
CN201510306800.0A 2015-06-05 2015-06-05 The fault detection method and device of group system interior joint Active CN106301853B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201510306800.0A CN106301853B (en) 2015-06-05 2015-06-05 The fault detection method and device of group system interior joint
PCT/CN2016/073606 WO2016192408A1 (en) 2015-06-05 2016-02-05 Fault detection method and apparatus for node in cluster system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510306800.0A CN106301853B (en) 2015-06-05 2015-06-05 The fault detection method and device of group system interior joint

Publications (2)

Publication Number Publication Date
CN106301853A true CN106301853A (en) 2017-01-04
CN106301853B CN106301853B (en) 2019-06-18

Family

ID=57440098

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510306800.0A Active CN106301853B (en) 2015-06-05 2015-06-05 The fault detection method and device of group system interior joint

Country Status (2)

Country Link
CN (1) CN106301853B (en)
WO (1) WO2016192408A1 (en)

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107566219A (en) * 2017-09-27 2018-01-09 华为技术有限公司 Method for diagnosing faults, node device and computer equipment applied to group system
CN107864486A (en) * 2017-12-26 2018-03-30 杭州迪普科技股份有限公司 A kind of offline AP detection methods and device
CN108092857A (en) * 2018-01-15 2018-05-29 郑州云海信息技术有限公司 A kind of distributed system heartbeat detecting method and relevant apparatus
CN108337274A (en) * 2017-01-19 2018-07-27 贵州白山云科技有限公司 A kind of message distributing method and system
CN108683561A (en) * 2018-05-16 2018-10-19 杭州迪普科技股份有限公司 A kind of station state detection method and device
CN109218141A (en) * 2018-11-20 2019-01-15 郑州云海信息技术有限公司 A kind of malfunctioning node detection method and relevant apparatus
CN109428740A (en) * 2017-08-21 2019-03-05 华为技术有限公司 The method and apparatus that equipment fault restores
CN109525408A (en) * 2017-09-18 2019-03-26 杭州海康威视系统技术有限公司 A kind of unit exception processing method, device and cloud storage system
CN109714183A (en) * 2017-10-26 2019-05-03 阿里巴巴集团控股有限公司 Data processing method and device in a kind of cluster
CN109873719A (en) * 2019-02-03 2019-06-11 华为技术有限公司 A kind of fault detection method and device
CN110324166A (en) * 2018-03-31 2019-10-11 华为技术有限公司 A kind of method, apparatus and system of target information synchronous in multiple nodes
CN110377570A (en) * 2017-10-12 2019-10-25 腾讯科技(深圳)有限公司 Node switching method, device, computer equipment and storage medium
CN110380934A (en) * 2019-07-23 2019-10-25 南京航空航天大学 A kind of distribution redundant system heartbeat detecting method
CN111181763A (en) * 2019-11-28 2020-05-19 泰康保险集团股份有限公司 Network fault reporting method and device
CN111586110A (en) * 2020-04-22 2020-08-25 广州锦行网络科技有限公司 Optimization processing method for raft in point-to-point fault
WO2020220231A1 (en) * 2019-04-29 2020-11-05 华为海洋网络有限公司 Submarine cable failure determination method and apparatus
CN112398905A (en) * 2020-09-28 2021-02-23 联想(北京)有限公司 Node and information synchronization method
CN112468372A (en) * 2017-04-10 2021-03-09 华为技术有限公司 Equipment state detection method and device in power line communication network
CN112911520A (en) * 2019-12-04 2021-06-04 哈尔滨海能达科技有限公司 Method, device and storage medium for determining master node in ad hoc network
CN112988463A (en) * 2021-02-23 2021-06-18 新华三大数据技术有限公司 Fault node isolation method and device
CN113542052A (en) * 2021-06-07 2021-10-22 新华三信息技术有限公司 Node fault determination method and device and server
CN113783735A (en) * 2021-09-24 2021-12-10 小红书科技有限公司 Method, device, equipment and medium for identifying fault node in Redis cluster
CN115102886A (en) * 2022-06-21 2022-09-23 上海驻云信息科技有限公司 Task scheduling method and device for multiple acquisition clients
CN116260705A (en) * 2022-12-21 2023-06-13 广西壮族自治区自然资源信息中心 Geographic information distributed cluster fault processing method, device, medium and equipment

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018214106A1 (en) * 2017-05-25 2018-11-29 深圳市伊特利网络科技有限公司 Update method and system for network connection list
WO2019000954A1 (en) * 2017-06-30 2019-01-03 中兴通讯股份有限公司 Method, device and system for monitoring node survival state
US10547499B2 (en) 2017-09-04 2020-01-28 International Business Machines Corporation Software defined failure detection of many nodes
CN109302445B (en) * 2018-08-14 2021-10-12 新华三云计算技术有限公司 Host node state determination method and device, host node and storage medium
CN113923105B (en) * 2021-12-13 2022-04-22 中机联科技(广东)有限公司 Internet of things equipment fault monitoring method and system based on block chain

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070294596A1 (en) * 2006-05-22 2007-12-20 Gissel Thomas R Inter-tier failure detection using central aggregation point
CN101159536A (en) * 2007-10-30 2008-04-09 中兴通讯股份有限公司 Media gateway node condition synchronizing method in dual-home network
CN102204169A (en) * 2011-05-12 2011-09-28 华为技术有限公司 Fault detection method, route node and system
CN102612110A (en) * 2012-03-02 2012-07-25 浙江大学 Distributive self-organized routing method in electric carrier wave illumination control system
CN102821011A (en) * 2012-08-28 2012-12-12 北京星网锐捷网络技术有限公司 Opposite terminal state detection method, device and equipment
CN103297396A (en) * 2012-02-28 2013-09-11 国际商业机器公司 Management failure transferring device and method in cluster system
CN103916275A (en) * 2014-03-31 2014-07-09 杭州华三通信技术有限公司 BFD detection device and method
US20140301401A1 (en) * 2013-04-07 2014-10-09 Hangzhou H3C Technologies Co., Ltd. Providing aggregation link groups in logical network device
CN104283711A (en) * 2014-09-29 2015-01-14 中国联合网络通信集团有限公司 Fault detection method based on BFD, nodes and system

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102752143B (en) * 2012-07-05 2015-08-19 杭州华三通信技术有限公司 The BFD detection method of MPLS TE bidirectional tunnel and routing device

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070294596A1 (en) * 2006-05-22 2007-12-20 Gissel Thomas R Inter-tier failure detection using central aggregation point
CN101159536A (en) * 2007-10-30 2008-04-09 中兴通讯股份有限公司 Media gateway node condition synchronizing method in dual-home network
CN102204169A (en) * 2011-05-12 2011-09-28 华为技术有限公司 Fault detection method, route node and system
CN103297396A (en) * 2012-02-28 2013-09-11 国际商业机器公司 Management failure transferring device and method in cluster system
CN102612110A (en) * 2012-03-02 2012-07-25 浙江大学 Distributive self-organized routing method in electric carrier wave illumination control system
CN102821011A (en) * 2012-08-28 2012-12-12 北京星网锐捷网络技术有限公司 Opposite terminal state detection method, device and equipment
US20140301401A1 (en) * 2013-04-07 2014-10-09 Hangzhou H3C Technologies Co., Ltd. Providing aggregation link groups in logical network device
CN103916275A (en) * 2014-03-31 2014-07-09 杭州华三通信技术有限公司 BFD detection device and method
CN104283711A (en) * 2014-09-29 2015-01-14 中国联合网络通信集团有限公司 Fault detection method based on BFD, nodes and system

Cited By (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108337274A (en) * 2017-01-19 2018-07-27 贵州白山云科技有限公司 A kind of message distributing method and system
CN112468372B (en) * 2017-04-10 2023-10-13 华为技术有限公司 Method and device for detecting equipment state in power line communication network
CN112468372A (en) * 2017-04-10 2021-03-09 华为技术有限公司 Equipment state detection method and device in power line communication network
CN109428740A (en) * 2017-08-21 2019-03-05 华为技术有限公司 The method and apparatus that equipment fault restores
CN109428740B (en) * 2017-08-21 2020-09-08 华为技术有限公司 Method and device for recovering equipment failure
CN109525408A (en) * 2017-09-18 2019-03-26 杭州海康威视系统技术有限公司 A kind of unit exception processing method, device and cloud storage system
CN109525408B (en) * 2017-09-18 2021-12-21 杭州海康威视系统技术有限公司 Equipment exception handling method and device and cloud storage system
CN107566219A (en) * 2017-09-27 2018-01-09 华为技术有限公司 Method for diagnosing faults, node device and computer equipment applied to group system
CN107566219B (en) * 2017-09-27 2020-09-18 华为技术有限公司 Fault diagnosis method applied to cluster system, node equipment and computer equipment
CN110377570B (en) * 2017-10-12 2021-06-11 腾讯科技(深圳)有限公司 Node switching method and device, computer equipment and storage medium
CN110377570A (en) * 2017-10-12 2019-10-25 腾讯科技(深圳)有限公司 Node switching method, device, computer equipment and storage medium
CN109714183A (en) * 2017-10-26 2019-05-03 阿里巴巴集团控股有限公司 Data processing method and device in a kind of cluster
CN107864486A (en) * 2017-12-26 2018-03-30 杭州迪普科技股份有限公司 A kind of offline AP detection methods and device
CN108092857A (en) * 2018-01-15 2018-05-29 郑州云海信息技术有限公司 A kind of distributed system heartbeat detecting method and relevant apparatus
CN110324166A (en) * 2018-03-31 2019-10-11 华为技术有限公司 A kind of method, apparatus and system of target information synchronous in multiple nodes
CN110324166B (en) * 2018-03-31 2020-12-15 华为技术有限公司 Method, device and system for synchronizing target information in multiple nodes
CN108683561B (en) * 2018-05-16 2020-10-02 杭州迪普科技股份有限公司 Site state detection method and device
CN108683561A (en) * 2018-05-16 2018-10-19 杭州迪普科技股份有限公司 A kind of station state detection method and device
CN109218141A (en) * 2018-11-20 2019-01-15 郑州云海信息技术有限公司 A kind of malfunctioning node detection method and relevant apparatus
CN109873719A (en) * 2019-02-03 2019-06-11 华为技术有限公司 A kind of fault detection method and device
WO2020220231A1 (en) * 2019-04-29 2020-11-05 华为海洋网络有限公司 Submarine cable failure determination method and apparatus
US11265080B2 (en) 2019-04-29 2022-03-01 Hmn Technologies Co., Limited Submarine cable fault determining method and apparatus
CN110380934A (en) * 2019-07-23 2019-10-25 南京航空航天大学 A kind of distribution redundant system heartbeat detecting method
CN111181763A (en) * 2019-11-28 2020-05-19 泰康保险集团股份有限公司 Network fault reporting method and device
CN112911520B (en) * 2019-12-04 2022-05-31 哈尔滨海能达科技有限公司 Method, device and storage medium for determining master node in ad hoc network
CN112911520A (en) * 2019-12-04 2021-06-04 哈尔滨海能达科技有限公司 Method, device and storage medium for determining master node in ad hoc network
CN111586110B (en) * 2020-04-22 2021-03-19 广州锦行网络科技有限公司 Optimization processing method for raft in point-to-point fault
CN111586110A (en) * 2020-04-22 2020-08-25 广州锦行网络科技有限公司 Optimization processing method for raft in point-to-point fault
CN112398905A (en) * 2020-09-28 2021-02-23 联想(北京)有限公司 Node and information synchronization method
CN112398905B (en) * 2020-09-28 2022-05-31 联想(北京)有限公司 Node and information synchronization method
CN112988463A (en) * 2021-02-23 2021-06-18 新华三大数据技术有限公司 Fault node isolation method and device
CN112988463B (en) * 2021-02-23 2022-08-30 新华三大数据技术有限公司 Fault node isolation method and device
CN113542052A (en) * 2021-06-07 2021-10-22 新华三信息技术有限公司 Node fault determination method and device and server
CN113783735A (en) * 2021-09-24 2021-12-10 小红书科技有限公司 Method, device, equipment and medium for identifying fault node in Redis cluster
CN115102886A (en) * 2022-06-21 2022-09-23 上海驻云信息科技有限公司 Task scheduling method and device for multiple acquisition clients
CN116260705A (en) * 2022-12-21 2023-06-13 广西壮族自治区自然资源信息中心 Geographic information distributed cluster fault processing method, device, medium and equipment
CN116260705B (en) * 2022-12-21 2023-09-15 广西壮族自治区自然资源信息中心 Geographic information distributed cluster fault processing method, device, medium and equipment

Also Published As

Publication number Publication date
WO2016192408A1 (en) 2016-12-08
CN106301853B (en) 2019-06-18

Similar Documents

Publication Publication Date Title
CN106301853A (en) The fault detection method of group system interior joint and device
CA2577200C (en) Method and systems for copying data components between nodes of a network
CN103298012B (en) A kind of AP fault detection method and equipment
CN102929696B (en) A kind of distributed system Centroid structure, submission, method for supervising and device
KR20180133606A (en) Apparatus and operating method for monitoring micro controller unit having multi-core
CN105379201B (en) Method, controller and the failover interchanger of path switching
CN104301140B (en) Service request response method, device and system
CN106034045A (en) Ethernet link failure positioning method, device and system
CN105897507A (en) Node equipment state detection method and node equipment state detection device
CN108683528B (en) Data transmission method, central server, server and data transmission system
CN109039795A (en) A kind of Cloud Server resource monitoring method and system
CN110943877A (en) Network state measuring method, equipment and system
CN107426051A (en) The monitoring method of the working condition of distributed cluster system interior joint, apparatus and system
US8681645B2 (en) System and method for coordinated discovery of the status of network routes by hosts in a network
US20060031445A1 (en) Method and apparatus for determination of network topology
US10164856B2 (en) Reconciliation of asymmetric topology in a clustered environment
CN105187264B (en) A kind of method, communication equipment and the system of direct connected link quality-monitoring
Xiaolin et al. Probe selection algorithm for faulty links localization in all-optical networks
WO2023011163A1 (en) Beam quality monitoring method and apparatus, and device and medium
JP2014103516A (en) Packet transfer delay measuring apparatus, method and program
CN109120547A (en) A kind of message length determines method and device
CN110752967A (en) Interface fault detection method, device, equipment and storage medium
JPH0260338A (en) Bus type lan
KR20170122403A (en) System and method for diagnosing based on data distribution service network
CN117176626A (en) Node detection method, device, equipment and readable storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant