CN106301853A - The fault detection method of group system interior joint and device - Google Patents
The fault detection method of group system interior joint and device Download PDFInfo
- Publication number
- CN106301853A CN106301853A CN201510306800.0A CN201510306800A CN106301853A CN 106301853 A CN106301853 A CN 106301853A CN 201510306800 A CN201510306800 A CN 201510306800A CN 106301853 A CN106301853 A CN 106301853A
- Authority
- CN
- China
- Prior art keywords
- node
- nodal point
- neighbor
- neighbor nodes
- heartbeat message
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
- H04L41/0631—Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/10—Active monitoring, e.g. heartbeat, ping or trace-route
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Cardiology (AREA)
- General Health & Medical Sciences (AREA)
- Hardware Redundancy (AREA)
- Measuring And Recording Apparatus For Diagnosis (AREA)
Abstract
The embodiment of the present invention provides fault detection method and the device of a kind of group system interior joint, the method includes: primary nodal point judges whether receive the first heartbeat message that secondary nodal point sends in Preset Time, primary nodal point is the neighbor node of secondary nodal point, and the first heartbeat message is the heartbeat message that secondary nodal point sends to each neighbor node of secondary nodal point concurrently;In the case of primary nodal point does not receives the heartbeat message that secondary nodal point sends, in all neighbor nodes of secondary nodal point, other neighbor nodes in addition to primary nodal point send request message;Primary nodal point receives the response message carrying reception state that other neighbor nodes send;In the case of according to reception state, primary nodal point determines that other neighbor nodes all do not receive heartbeat message, primary nodal point determines that secondary nodal point breaks down.The fault detection method of the group system interior joint that the embodiment of the present invention provides and device can improve the efficiency of node failure detection.
Description
Technical field
The present embodiments relate to communication technology, particularly relate to the fault detect of a kind of group system interior joint
Method and apparatus.
Background technology
In distributed cluster system, generally include a Centroid and multiple ordinary node, work as center
After node or ordinary node break down, the reliability of distributed cluster system will be caused the biggest shadow
Ring, therefore, the most effectively carry out the fault detect of node, be very important.
Fig. 1 is the schematic diagram of the fault detection method of prior art interior joint, as it is shown in figure 1, commonly save
Point (B, C, D, E) sends heartbeat message, Centroid (M) according to heart beat cycle to Centroid (M)
According to the situation of the successive heartbeat message received in the detection cycle, detect ordinary node whether fault, its
In, a detection cycle can comprise multiple heart beat cycle.Meanwhile, Centroid (M) can also be periodically
To ordinary node (B, C, D, E) send heartbeat message, to notify that ordinary node Centroid is served as
Role and whether be in normal condition, once ordinary node (B, C, D, E) within the detection cycle not
Receive the heartbeat message that Centroid (M) sends, then can judge that Centroid (M) breaks down, this
Time, ordinary node can initiate to re-elect the operation of Centroid, if electing successfully, ordinary node will sense
Knowing new Centroid, and heartbeat message is sent to new Centroid, cluster carries out fault detect again.
But, in the prior art, by judging whether to receive the side of heartbeat message within the detection cycle
Formula detects node when whether breaking down, owing to, in the case of cluster scale is fixing, sending heart beating report
The heart beat cycle of literary composition cannot change, and therefore detects the time in cycle and also cannot change so that node failure is examined
Survey needs just can be detected by multiple heart beat cycles, and the cycle causing node failure to detect is longer, leads
Cause the inefficient of node failure detection.
Summary of the invention
The embodiment of the present invention provides fault detection method and the device of a kind of group system interior joint, is used for solving
The node failure detection that certainly prior art also exists needs just can be detected by multiple heart beat cycles, makes
Become the problem that the cycle of node failure detection is longer, thus improve the efficiency of node failure detection.
First aspect, the embodiment of the present invention provides the fault detection method of a kind of group system interior joint, bag
Include:
Primary nodal point judges whether receive the first heartbeat message that secondary nodal point sends in Preset Time;
Described primary nodal point is the neighbor node of described secondary nodal point, and described first heartbeat message is described second section
The heartbeat message that point sends to each neighbor node of described secondary nodal point concurrently, described second section
The number of all neighbor nodes of point is two or more;Described Preset Time is more than or equal to a heart beating week
Phase, and less than two heart beat cycles;
In the case of described primary nodal point does not receives the first heartbeat message that described secondary nodal point sends,
Described primary nodal point other in addition to described primary nodal point in all neighbor nodes of described secondary nodal point
Neighbor node sends request message, and described request message is used for inquiring whether other neighbor nodes described receive
To described first heartbeat message;
Described primary nodal point receives the response carrying reception state of described other neighbor nodes transmission and disappears
Breath, described reception state is used for indicating whether to receive described first heartbeat message;
The described response sent according to other neighbor nodes each described received at described primary nodal point disappears
The reception state carried in breath, determines that other neighbor nodes described all do not receive described first heart beating report
In the case of literary composition, described primary nodal point determines that described secondary nodal point breaks down.
In conjunction with first aspect, in the first possible implementation of first aspect, described primary nodal point
After determining that described secondary nodal point breaks down, also include:
Described primary nodal point generates the first vote information, and receives what each other neighbor nodes described sent
Second vote information, described first vote information includes the node that node that described primary nodal point is elected is corresponding
Mark;Described second vote information includes the node sending the neighbor node election of described second vote information
Corresponding node identification;
Described primary nodal point is according to the node identification in described first vote information and each other neighbours described
Node identification in the second vote information that node sends, each node in all nodes that statistics is elected
The ballot quantity obtained, and using nodes most for quantity of voting as the 3rd node;Described 3rd node is
Substitute described secondary nodal point and send heartbeat message to all neighbor nodes of described 3rd node concurrently
Node;All neighbor nodes of described 3rd node include described 3rd node self neighbor node and
The neighbor node of described secondary nodal point.
In conjunction with the first possible implementation of first aspect or first aspect, in the second of first aspect
Plant in possible implementation, also include:
The described response sent according to other neighbor nodes each described received at described primary nodal point disappears
The reception state carried in breath, determines that at least one other neighbor node described receives described first heart
In the case of jumping message, described primary nodal point determines node and the institute not receiving described first heartbeat message
The link stated between secondary nodal point breaks down;The described node bag not receiving described first heartbeat message
Include the node not receiving described first heartbeat message in described primary nodal point and other neighbor nodes described.
In conjunction with first aspect, first aspect the first to any one possible reality of the second of first aspect
Existing mode, in the third possible implementation of first aspect, also includes:
Described primary nodal point according in the neighbor node of described 3rd node and other neighbor nodes described except institute
State the node outside the 3rd node, redefine the neighbor node of described primary nodal point.
Second aspect, the embodiment of the present invention provides the fault detection method of a kind of group system interior joint, institute
The method of stating includes:
Secondary nodal point sends the first heartbeat message to primary nodal point and other neighbor nodes concurrently;Described
One node is the neighbor node of described secondary nodal point, and other neighbor nodes described are the institute of described secondary nodal point
Having the node in addition to described primary nodal point in neighbor node, the number of other neighbor nodes described is one
Above;
Described primary nodal point judges whether receive described first heartbeat message in Preset Time;Described pre-
If the time is more than or equal to a heart beat cycle, and less than two heart beat cycles;
In the case of described primary nodal point does not receives described first heartbeat message, described primary nodal point to
Each other neighbor nodes described send request message respectively, and described request message is used for inquiring each described
Whether other neighbor nodes receive described first heartbeat message;
Described primary nodal point receives the response carrying reception state that each other neighbor nodes described send
Message, described reception state is used for indicating whether to receive described first heartbeat message;
At described primary nodal point according to the reception state carried in the described response message received, determine
In the case of other neighbor nodes described all do not receive described first heartbeat message, described primary nodal point is true
Fixed described secondary nodal point breaks down.
In conjunction with second aspect, in the first possible implementation of second aspect, described primary nodal point
After determining that described secondary nodal point breaks down, also include:
Described primary nodal point generates the first vote information, and receives what each other neighbor nodes described sent
Second vote information, described first vote information includes the node that node that described primary nodal point is elected is corresponding
Mark;Described second vote information includes the node sending the neighbor node election of described second vote information
Corresponding node identification;
Described primary nodal point is according to the node identification in described first vote information and each other neighbours described
Node identification in the second vote information that node sends, each node in all nodes that statistics is elected
The ballot quantity obtained, and using nodes most for quantity of voting as the 3rd node;Described 3rd node is
Substitute described secondary nodal point and send heartbeat message to all neighbor nodes of described 3rd node concurrently
Node;All neighbor nodes of described 3rd node include described 3rd node self neighbor node and
The neighbor node of described secondary nodal point.
In conjunction with the first possible implementation of second aspect or second aspect, in the second of second aspect
Plant in possible implementation, also include:
The described response sent according to other neighbor nodes each described received at described primary nodal point disappears
The reception state carried in breath, determines that at least one other neighbor node described receives described first heart
Jump in the case of message, the most described primary nodal point determine the node not receiving described first heartbeat message with
Link between described secondary nodal point breaks down;The described node not receiving described first heartbeat message
Including the node not receiving the first heartbeat message in described primary nodal point and other neighbor nodes described.
In conjunction with second aspect, second aspect the first to any one possible reality of the second of second aspect
Existing mode, in the third possible implementation of second aspect, also includes:
Described primary nodal point according in the neighbor node of described 3rd node and other neighbor nodes described except institute
State the node outside the 3rd node, redefine the neighbor node of described primary nodal point.
The third aspect, the embodiment of the present invention provides the failure detector of a kind of group system interior joint, bag
Include:
Judge module, for judging whether receive the first heart beating that secondary nodal point sends in Preset Time
Message;Described primary nodal point is the neighbor node of described secondary nodal point, and described first heartbeat message is described
The heartbeat message that secondary nodal point sends to each neighbor node of described secondary nodal point concurrently, described
The number of all neighbor nodes of secondary nodal point is two or more;Described Preset Time is more than or equal to one
Heart beat cycle, and less than two heart beat cycles;
Judge that receiver module does not receives the first heart beating that described secondary nodal point sends at described judge module
In the case of message,
Sending module, in all neighbor nodes of described secondary nodal point in addition to described primary nodal point
Other neighbor nodes send request message, described request message is used for inquiring that other neighbor nodes described are
No receive described first heartbeat message;
Described receiver module, is additionally operable to the reception state that carries of other neighbor nodes transmission described in receiving
Response message, described reception state is used for indicating whether to receive described first heartbeat message;
Determine module, send for other neighbor nodes each described received according to described receiver module
Described response message in the reception state carried, it is determined whether other neighbor nodes described all do not receive
Described first heartbeat message;
Determine that module determines that other neighbor nodes described all do not receive described first heartbeat message described
In the case of, described determine module, be additionally operable to determine that described secondary nodal point breaks down.
In conjunction with the third aspect, in the first possible implementation of the third aspect, determine mould described
After block determines that described secondary nodal point breaks down, also include:
Generation module, is additionally operable to generate the first vote information, and described first vote information includes described first
The node identification corresponding to node of node election;
Described receiver module, is additionally operable to receive the second vote information that each other neighbor nodes described send,
Described second vote information includes that the node sending the neighbor node election of described second vote information is corresponding
Node identification;
Described determine module, be additionally operable to according to the node identification in described first vote information and each described
Node identification in the second vote information that other neighbor nodes send, in all nodes that statistics is elected
The ballot quantity that each node obtains, and using nodes most for quantity of voting as the 3rd node;Described
Three nodes are to substitute described secondary nodal point and send to all neighbor nodes of described 3rd node concurrently
The node of heartbeat message;All neighbor nodes of described 3rd node include the neighbour of described 3rd node self
Occupy node and the neighbor node of described secondary nodal point.
In conjunction with the first possible implementation of the third aspect or the third aspect, in the second of the third aspect
Plant in possible implementation,
Send at described other neighbor nodes each described determining that module receives according to described receiver module
Described response message in the reception state carried, determine that at least one other neighbor node described receives
In the case of described first heartbeat message,
Described determine that module is additionally operable to determine the node and described second not receiving described first heartbeat message
Link between node breaks down;The described node not receiving described first heartbeat message includes described
Primary nodal point and other neighbor nodes described do not receive the node of described first heartbeat message.
In conjunction with the third aspect, the third aspect the first to any one possible reality of the second of the third aspect
Existing mode, in the third possible implementation of the third aspect,
Described determine that module is additionally operable to the neighbor node according to described 3rd node and other neighbor nodes described
In node in addition to described 3rd node, redefine the neighbor node of described primary nodal point.
Fourth aspect, the embodiment of the present invention provides the fault detection system of a kind of group system interior joint, bag
Including primary nodal point, secondary nodal point and other neighbor nodes, described primary nodal point is the neighbour of described secondary nodal point
Occupy node, other neighbor nodes described be described secondary nodal point all neighbor nodes in except described first segment
Node outside Dian, the number of other neighbor nodes described is more than one, including:
Described secondary nodal point, for sending the to described primary nodal point and other neighbor nodes described concurrently
One heartbeat message;
Described primary nodal point, for judging whether receive described first heartbeat message in Preset Time;
Described Preset Time is more than or equal to a heart beat cycle, and less than two heart beat cycles;
In the case of described primary nodal point does not receives described first heartbeat message, described primary nodal point is also
For sending request message respectively to each other neighbor nodes described, described request message is used for inquiring often
Whether other neighbor nodes described in receive described first heartbeat message;And, described primary nodal point is also
For receiving the response message carrying reception state that each other neighbor nodes described send, described in connect
Receipts state is used for indicating whether to receive described first heartbeat message;
The described response sent according to other neighbor nodes each described received at described primary nodal point disappears
The reception state carried in breath, determines that other neighbor nodes described all do not receive described first heart beating report
In the case of literary composition, described primary nodal point is additionally operable to determine that described secondary nodal point breaks down.
In conjunction with fourth aspect, in the first possible implementation of fourth aspect, described primary nodal point
After determining that described secondary nodal point breaks down, also include:
Described primary nodal point is additionally operable to:
Generate the first vote information, and receive the second vote information that each other neighbor nodes described send,
Described first vote information includes the node identification that node that described primary nodal point is elected is corresponding, described second
The node mark that the node of the neighbor node election that vote information includes sending described second vote information is corresponding
Know;
And, send out according to the node identification in described first vote information and each other neighbor nodes described
Node identification in the second vote information sent, in all nodes that statistics is elected, each node obtains
Ballot quantity, and using nodes most for quantity of voting as the 3rd node;Described 3rd node is for substituting institute
State secondary nodal point and send the node of heartbeat message concurrently to all neighbor nodes of described 3rd node;
All neighbor nodes of described 3rd node include the neighbor node and described second of described 3rd node self
The neighbor node of node.
In conjunction with the first possible implementation of fourth aspect or fourth aspect, in the second of fourth aspect
Plant in possible implementation,
The described response sent according to other neighbor nodes each described received at described primary nodal point disappears
The reception state carried in breath, determines that at least one other neighbor node described receives described first heart
In the case of jumping message,
Described primary nodal point is additionally operable to determine the node and described second not receiving described first heartbeat message
Link between node breaks down;The described node not receiving described first heartbeat message includes described
Primary nodal point and other neighbor nodes described do not receive the neighbor node of described first heartbeat message.
In conjunction with fourth aspect, fourth aspect the first to any one possible reality of the second of fourth aspect
Existing mode, in the third possible implementation of fourth aspect,
Described primary nodal point is additionally operable to the neighbor node according to described 3rd node and other neighbor nodes described
In node in addition to described 3rd node, redefine the neighbor node of described primary nodal point.
In the fault detection method of the group system interior joint that the embodiment of the present invention provides and device, first segment
Point judges whether receive the first heartbeat message that secondary nodal point sends, wherein, first in Preset Time
Node is the neighbor node of secondary nodal point, and the first heartbeat message is that secondary nodal point is concurrently to secondary nodal point
The heartbeat message that each neighbor node sends, the number of all neighbor nodes of secondary nodal point is two
Above;This Preset Time is more than or equal to a heart beat cycle, and less than two heart beat cycles;First segment
Point, in the case of self not receiving the first heartbeat message, inquires other neighbor nodes of this secondary nodal point
Whether receive the first heartbeat message, and the most all do not receive at other neighbor nodes determining this secondary nodal point
In the case of this first heartbeat message, determine that secondary nodal point there occurs fault.Owing to Preset Time is more than
Or equal to a heart beat cycle, and less than two heart beat cycles, so using the technical side that the present invention provides
When case carries out fault detect, it is to avoid prior art needs just can be detected joint by multiple heart beat cycles
The phenomenon whether point breaks down, can shorten the cycle of fault detect, thus improves node failure inspection
The efficiency surveyed.
Accompanying drawing explanation
In order to be illustrated more clearly that the embodiment of the present invention or technical scheme of the prior art, below will be to reality
Execute the required accompanying drawing used in example or description of the prior art to be briefly described, it should be apparent that under,
Accompanying drawing during face describes is some embodiments of the present invention, for those of ordinary skill in the art,
On the premise of not paying creative work, it is also possible to obtain other accompanying drawing according to these accompanying drawings.
Fig. 1 is the structural representation of the fault detection method of group system interior joint in prior art;
The flow process of the fault detection method embodiment one of the group system interior joint that Fig. 2 provides for the present invention
Schematic diagram;
Fig. 3 is the schematic diagram one of neighbouring relations between group system interior joint;
Fig. 4 is the schematic diagram two of neighbouring relations between group system interior joint;
The flow process of the fault detection method embodiment two of the group system interior joint that Fig. 5 provides for the present invention
Schematic diagram;
Fig. 6 A is the signal of neighbouring relations between front nodal point node failure being detected in group system
Figure;
Fig. 6 B be group system detects node failure after redefine neighbouring relations between node
Schematic diagram;
The flow process of the fault detection method embodiment three of the group system interior joint that Fig. 7 provides for the present invention
Schematic diagram;
The flow process of the fault detection method embodiment four of the group system interior joint that Fig. 8 provides for the present invention
Schematic diagram;
Fig. 9 is the structural representation of the failure detector embodiment one of group system interior joint of the present invention
Figure;
Figure 10 is the structural representation of the fault detection system embodiment one of group system interior joint of the present invention
Figure Figure 10 is the structural representation of node embodiment one of the present invention;
Figure 11 is the structural representation of node embodiment one of the present invention.
Detailed description of the invention
For making the purpose of the embodiment of the present invention, technical scheme and advantage clearer, below in conjunction with this
Accompanying drawing in bright embodiment, is clearly and completely described the technical scheme in the embodiment of the present invention,
Obviously, described embodiment is a part of embodiment of the present invention rather than whole embodiments.Based on
Embodiment in the present invention, those of ordinary skill in the art are obtained under not making creative work premise
The every other embodiment obtained, broadly falls into the scope of protection of the invention.
The embodiment of the present invention is be applicable to group system, and it it is particularly applicable to distributed cluster system interior joint
The scene of fault detect.This distributed cluster system includes at least two node, and this node is the most permissible
It it is computer.Optionally, the node in the group system in the present embodiment is with existing group system not
It is with part: in the group system of the present embodiment, all of node is all given identical function, i.e.
All of node all has identical receiving heartbeat message and send the ability of heartbeat message, therefore, at this
In the group system of embodiment, there is not the differentiation of Centroid and ordinary node, it is not required that center
Node administration ordinary node.Optionally, the technical scheme of following embodiment is all using computer as performing master
Body is introduced.
The flow process of the fault detection method embodiment one of the group system interior joint that Fig. 2 provides for the present invention is shown
It is intended to.The method that the present embodiments relate to is applicable to distributed cluster system.The present embodiment is with computer
Introduce as a example by executive agent.As in figure 2 it is shown, the method for the present embodiment may include that
Step 201, primary nodal point judge whether receive first that secondary nodal point sends in Preset Time
Heartbeat message;Primary nodal point is the neighbor node of secondary nodal point, and the first heartbeat message is that secondary nodal point is parallel
The heartbeat message that ground sends to each neighbor node of secondary nodal point, all neighbours joint of secondary nodal point
The number of point is two or more;Preset Time is more than or equal to a heart beat cycle, and less than two heart beatings
Cycle.
In the present embodiment, secondary nodal point is according to the information of nodes all in group system, according to cluster system
The rule preset in system determines primary nodal point, and wherein, primary nodal point is any one neighbour of secondary nodal point
Occupying node, the neighbor node of secondary nodal point is the node relevant with secondary nodal point.Fig. 3 is cluster
The schematic diagram one of neighbouring relations between system interior joint, as it is shown on figure 3, in group system, node E
According to the information of all nodes, can determine that it has four neighbours according to the rule preset in group system
Node, is node A, B, C and D respectively.Wherein, primary nodal point can be node A, B, C and
Any one in D.Primary nodal point is by judging that whether receiving secondary nodal point in Preset Time sends
The first heartbeat message, detect whether secondary nodal point breaks down.Need to illustrate, second
Node is by sending heartbeat message, therefore, the first heart beating report to its all neighbor nodes concurrently
Literary composition is the heart that secondary nodal point sends to each neighbor node of secondary nodal point in the same moment concurrently
Jump message.It addition, secondary nodal point can send to its all neighbor nodes concurrently according to heart beat cycle
First heartbeat message, therefore, primary nodal point may determine that more than or equal to a heart beat cycle and little
The first heartbeat message that this secondary nodal point sends whether is received within the time of two heart beat cycles.Such as:
Assume that heart beat cycle is 5s, i.e. secondary nodal point every 5s, will send to its all neighbor nodes concurrently
Heartbeat message, the first heartbeat message sent at 5s for secondary nodal point, primary nodal point will determine that
More than or equal to 5s, and within less than the time of 10s, whether receive first heart that secondary nodal point sends
Jump message.Wherein, heart beat cycle can rule of thumb or practical situation is configured, for heart beating week
The concrete value of phase, this is not restricted for the present embodiment.
It addition, secondary nodal point periodically can send first heart to primary nodal point by a physical network
Jump message, but during owing to carrying out fault detect based on single physical network, at network failure, such as:
Management plane network failure, and when service plane network is normal, often cannot define is group system
Middle secondary nodal point there occurs that the link between fault or secondary nodal point and primary nodal point there occurs fault, or
Person's secondary nodal point and primary nodal point there occurs fault simultaneously, and thus, the testing result of causing trouble is inaccurate.
In order to solve this problem, it is preferable that the present embodiment can also send first by least two network
Heartbeat message, for example, the first heartbeat message can be sent by biplane, such as: management plane
And service plane, it is also possible to send the first heartbeat message by three planes, such as: management plane, business
Plane and signaling plane.The mode using many physical networks sends the first heartbeat message, detects node and is
No break down, the accuracy of detection can be improved.Need to illustrate, if the number of physical network
When amount is at least two, mutually isolated between this at least two physical network, so can avoid due to many
When there is some equipment shared between network, if shared device breaks down, thus cause nothing between node
The phenomenon of method proper communication, is conducive to improving the accuracy of detection.
Step 202, primary nodal point do not receive secondary nodal point send the first heartbeat message in the case of,
Primary nodal point other neighbor nodes in addition to primary nodal point in all neighbor nodes of secondary nodal point send
Request message, request message is used for inquiring whether other neighbor nodes receive the first heartbeat message.
In the prior art, ordinary node be sent to the heart beat cycle of Centroid fixing in the case of,
Because the restriction of the performance of Centroid, group system cannot infinitely increase ordinary node so that cluster system
The autgmentability of system is affected.For this problem, in the embodiment of the present invention, if primary nodal point does not exist
Receive the first heartbeat message that secondary nodal point sends in Preset Time, can primarily determine that secondary nodal point has
May have occurred fault.Owing to secondary nodal point is first heart sent to its all neighbor nodes concurrently
Jumping message, therefore, primary nodal point is by the neighbor node of secondary nodal point, and other in addition to self are adjacent
Occupy node and send request message, to inquire whether other neighbor nodes receive first that secondary nodal point sends
Heartbeat message.As can be seen here, when primary nodal point does not receives the first heartbeat message that secondary nodal point sends,
Primary nodal point can send request message to other neighbor nodes of secondary nodal point, and secondary nodal point is non-
Neighbor node also no longer will send heartbeat message to secondary nodal point, thus can reduce secondary nodal point and process the heart
Jump the quantity of message, such that it is able to alleviate the burden of secondary nodal point so that the extensibility of group system is relatively
Good.
For example, Fig. 4 is the schematic diagram two of neighbouring relations between group system interior joint, such as Fig. 4 institute
Showing, the neighbor node of node E has X, A, D, C and G, node E will in each heart beat cycle to
Its all neighbor node X, A, D, C and G send heartbeat message, it is assumed that using node E as second
Node, using node A as primary nodal point, if in some heart beat cycle, primary nodal point A does not receives
To secondary nodal point E send the first heartbeat message, then primary nodal point A will to other neighbor nodes X,
D, C and G send request message, to inquire whether nodes X, D, C and G receive the first heart beating report
Literary composition.
Step 203, primary nodal point receive the response carrying reception state of other neighbor nodes transmission and disappear
Breath, this reception state is used for indicating whether to receive the first heartbeat message.
In the present embodiment, after other neighbor nodes receive the request message that primary nodal point sends, will be from
Whether body receives the reception state of the first heartbeat message is carried and is sent to primary nodal point in the response message.
Step 204, the response message sent according to other neighbor nodes each received at primary nodal point
In the reception state carried, in the case of determining that other neighbor nodes all do not receive the first heartbeat message,
Primary nodal point determines that secondary nodal point breaks down.
In the present embodiment, each other neighbor node is receiving the request message that primary nodal point sends
Afterwards, all can return the response message carrying reception state to primary nodal point, primary nodal point is according to reception
To other neighbor nodes each send the response message carrying reception state, it is judged that other neighbours save
Whether point receives the first heartbeat message, is judging that other neighbor nodes are all not received by secondary nodal point
In the case of the first heartbeat message sent, i.e. can determine that secondary nodal point there occurs fault.
Needing to illustrate, the neighbouring relations between node are two-way, i.e. form neighborhood
Can mutually send heartbeat message between node, therefore, all neighbor nodes of secondary nodal point all can be independent
Execution step 201-step 204.
In the fault detection method of the group system interior joint that the embodiment of the present invention provides, primary nodal point judges
Whether receiving the first heartbeat message that secondary nodal point sends in Preset Time, wherein, primary nodal point is
The neighbor node of secondary nodal point, the first heartbeat message is that secondary nodal point is concurrently to each neighbour of secondary nodal point
Occupying the heartbeat message that node sends, the number of all neighbor nodes of secondary nodal point is two or more;
This Preset Time is more than or equal to a heart beat cycle, and less than two heart beat cycles;Primary nodal point is certainly
In the case of body does not receives the first heartbeat message, inquire whether other neighbor nodes of this secondary nodal point connect
Receive the first heartbeat message, and other neighbor nodes determining this secondary nodal point the most all do not receive this
In the case of one heartbeat message, determine that secondary nodal point there occurs fault.Owing to Preset Time is more than or equal to
One heart beat cycle, and less than two heart beat cycles, so the technical scheme using the present invention to provide is carried out
During fault detect, it is to avoid prior art needs by whether multiple heart beat cycles just can detect node
The phenomenon broken down, shortens the cycle of fault detect, thus improves the efficiency of node failure detection.
The flow process of the fault detection method embodiment two of the group system interior joint that Fig. 5 provides for the present invention is shown
It is intended to.On the basis of embodiment illustrated in fig. 2, after primary nodal point is determined that secondary nodal point breaks down,
Each node redefines the embodiment of neighbor node, elaborates.As it is shown in figure 5, the present embodiment
Method may include that
Step 501, primary nodal point generate the first vote information, and receive other neighbor nodes each transmission
The second vote information, the first vote information includes the node identification that node that primary nodal point is elected is corresponding;
The node mark that the node of the neighbor node election that the second vote information includes sending the second vote information is corresponding
Know.
In the present embodiment, after the neighbor node of secondary nodal point determines that secondary nodal point breaks down,
All of neighbor node is required to recalculate respective neighbor node.For purposes of illustration only, can be by second
Any one neighbor node of node is as primary nodal point, and primary nodal point needs to generate the first vote information,
This first vote information comprises node identification corresponding to node and the ballot foundation of primary nodal point election.
It addition, primary nodal point also to receive the second vote information that other neighbor nodes each send, the second ballot
Node identification that the node of the neighbor node election that information includes sending the second vote information is corresponding and throwing
Ticket foundation.In actual applications, ballot is according to relevant with many factors, such as: loading condition, node
Size, nodal cache newness degree and the meshed network bandwidth etc. of numbering, such as: primary nodal point can lead to
Cross and judge that the load which node is undertaken is minimum, and by node mark corresponding for this node of load minimum
Know to carry in the first vote information, be sent to other neighbor nodes.Same, other neighbor nodes also may be used
With in the way of similar, the second vote information is sent to primary nodal point.
Step 502, primary nodal point save according to the node identification in the first vote information and other neighbours each
Node identification in the second vote information that point sends, each node in all nodes that statistics is elected
The ballot quantity obtained, and using nodes most for quantity of voting as the 3rd node;3rd node is for substituting
Secondary nodal point and concurrently to the 3rd node all neighbor nodes send heartbeat message node;3rd
All neighbor nodes of node include neighbor node and the neighbor node of secondary nodal point of the 3rd node self.
In the present embodiment, primary nodal point is receiving the second ballot letter that other neighbor nodes each send
After breath, according to the node identification in the first vote information that self generates and the second vote information received
In node identification, it may be determined that go out the 3rd node.During concrete implementation, can be according to first
The node identification carried in vote information and the second vote information, by the way of vote by ballot, adds up quilt
The ballot quantity that in all nodes of election, each node obtains, and the node most by obtaining ballot quantity
As the 3rd node.The neighbor node of the secondary nodal point that the 3rd node breaks down for adapter, Ye Jijie
Incidence relation between pipe secondary nodal point and other nodes, therefore, the 3rd node general's replacement secondary nodal point,
And concurrently to the node of all neighbor nodes transmission heartbeat message of the 3rd node, wherein, the 3rd node
All neighbor nodes in addition to including the neighbor node of the 3rd node self, also include secondary nodal point
Neighbor node.
Step 503, primary nodal point according in the neighbor node of the 3rd node and other neighbor nodes except the 3rd
Node outside node, redefines the neighbor node of primary nodal point.
In the present embodiment, all neighbor nodes of secondary nodal point determine by the way of vote by ballot
After three nodes, if primary nodal point is the 3rd node, then primary nodal point will take over the adjacent pass of secondary nodal point
System, other neighbor nodes can according to the neighbouring relations after the neighbor node of primary nodal point adapter secondary nodal point,
Respective neighbor node is determined again through calculating;If primary nodal point is not the 3rd node, then primary nodal point
After treating that the 3rd node redefines out neighbouring relations, neighbor node and other according to the 3rd node are adjacent
Occupy the node in addition to the 3rd node in node, redefine the neighbor node of self.
For example, Fig. 6 A detects neighbouring relations between the front nodal point of node failure in being group system
Schematic diagram, Fig. 6 B be group system detects node failure after redefine adjacent pass between node
The schematic diagram of system.As shown in Figure 6A, it is assumed that node E is secondary nodal point, node A is primary nodal point,
After primary nodal point A determines that secondary nodal point E breaks down, primary nodal point A will generate the first ballot letter
Breath, and the second vote information that respectively receiving node X, D, C and G send, primary nodal point A according to
Node identification in first vote information and the node identification in the second vote information determine the 3rd node,
So that the 3rd node substitutes secondary nodal point and sends heart beating to all neighbor nodes of the 3rd node concurrently
Message.As shown in Figure 6B, if by vote by ballot, determining that primary nodal point A is the 3rd node, then
Substituted secondary nodal point by primary nodal point A and send to all neighbor nodes of primary nodal point A concurrently
Heartbeat message, now, primary nodal point A needs by other neighbor nodes X, D, C and G the most true
Determine the neighbor node of oneself, and nodes X, D, C and G are the neighbour waiting primary nodal point A to determine oneself
After occupying node, the neighbor node determined according to primary nodal point A redefines respective neighbor node.
The fault detection method of the group system interior joint that the embodiment of the present invention provides, primary nodal point judges
Whether receiving the first heartbeat message that secondary nodal point sends in Preset Time, wherein, primary nodal point is the
The neighbor node of two nodes, the first heartbeat message is that secondary nodal point is concurrently to each neighbours of secondary nodal point
The heartbeat message that node sends, the number of all neighbor nodes of secondary nodal point is two or more;Should
Preset Time is more than or equal to a heart beat cycle, and less than two heart beat cycles;Primary nodal point is at self
In the case of not receiving the first heartbeat message, inquire whether other neighbor nodes of this secondary nodal point receive
To the first heartbeat message, and other neighbor nodes determining this secondary nodal point the most all do not receive this first
In the case of heartbeat message, determine that secondary nodal point there occurs fault.Owing to Preset Time is more than or equal to one
Individual heart beat cycle, and less than two heart beat cycles, so the technical scheme using the present invention to provide carries out event
During barrier detection, it is to avoid prior art needs just to detect whether node is sent out by multiple heart beat cycles
The phenomenon of raw fault, shortens the cycle of fault detect, thus improves the efficiency of node failure detection.
It addition, by after determining that primary nodal point breaks down, redefine respective neighbor node, and then
Proceed fault detect, improve the accuracy of fault detect.
Alternatively, in the response message that primary nodal point sends according to other neighbor nodes each received
The reception state carried, determines that at least one other neighbor node receives the situation of the first heartbeat message
Under, primary nodal point determine described in do not receive the link between node and the secondary nodal point of the first heartbeat message
Break down.
Specifically, primary nodal point is not receiving the first heartbeat message that secondary nodal point sends, and to each
Other nodes send request message, to inquire whether other neighbor nodes each receive the first heartbeat message
Afterwards, if determining at least other neighbor nodes according to the response message of other nodes each transmission
Have received the first heartbeat message, then primary nodal point can determine that secondary nodal point is normal, and may
It is between secondary nodal point and primary nodal point and node and the primary nodal point not receiving the first heartbeat message
Link there occurs fault, wherein, do not receive the node of the first heartbeat message include primary nodal point and its
He does not receives the neighbor node of the first heartbeat message in neighbor node.
The fault detection method of the group system interior joint that the embodiment of the present invention provides, owing to primary nodal point exists
In the case of determining that at least one other neighbor node receives the first heartbeat message, primary nodal point determines
The link not received between node and the secondary nodal point of the first heartbeat message breaks down so that fault is examined
Survey more comprehensive.
The flow process of the fault detection method embodiment three of the group system interior joint that Fig. 7 provides for the present invention is shown
It is intended to.The method that the present embodiments relate to is applicable to distributed cluster system.In the present embodiment still with
Computer is introduced as a example by executive agent.As it is shown in fig. 7, the method for the present embodiment may include that
Step 701, secondary nodal point send the first heart beating report to primary nodal point and other neighbor nodes concurrently
Literary composition, primary nodal point is the neighbor node of secondary nodal point;Other neighbor nodes are all neighbours of secondary nodal point
Node in addition to primary nodal point in node, the number of other neighbor nodes is more than one.
In the present embodiment, secondary nodal point can be according to the information of the node included in group system, root
Determining self all of neighbor node according to the rule preset in group system, wherein, primary nodal point is the
Any one neighbor node of two nodes, the neighbor node of secondary nodal point is pass relevant with this secondary nodal point
The node of system.Secondary nodal point after determining all of neighbor node, can concurrently to primary nodal point and
Other neighbor nodes send the first heartbeat message.
Step 702, primary nodal point judge whether to receive the first heartbeat message in Preset Time;Preset
Time is more than or equal to a heart beat cycle, and less than two heart beat cycles.
In the present embodiment, secondary nodal point can be according to heart beat cycle concurrently to its all neighbor nodes
Sending the first heartbeat message, therefore, primary nodal point may determine that more than or equal to a heart beat cycle,
And less than whether receiving the first heartbeat message that this secondary nodal point sends in the time of two heart beat cycles.
Such as: assume heart beat cycle be 5s, i.e. secondary nodal point every 5s, will send out to its neighbor node concurrently
Sending a heartbeat message, the first heartbeat message sent for secondary nodal point at 5s, primary nodal point will be sentenced
Break more than or equal to 5s, and within less than the time of 10s, whether receive first heart that secondary nodal point sends
Jump message.Wherein, heart beat cycle can rule of thumb or practical situation is configured, for heart beating week
The concrete value of phase, this is not restricted for the present embodiment.
It addition, secondary nodal point periodically can send first heart to primary nodal point by a physical network
Jump message, but during owing to carrying out fault detect based on single physical network, at network failure, such as:
Management plane network failure, and when service plane network is normal, often cannot define is group system
Middle secondary nodal point there occurs that the link between fault or secondary nodal point and primary nodal point there occurs fault, or
Person's secondary nodal point and primary nodal point there occurs fault simultaneously, and thus, the testing result of causing trouble is inaccurate.
In order to solve this problem, it is preferable that the present embodiment can also send first by least two network
Heartbeat message, for example, the first heartbeat message can be sent by biplane, such as: management plane
And service plane, it is also possible to send the first heartbeat message by three planes, such as: management plane, business
Plane and signaling plane.The mode using many physical networks sends the first heartbeat message, detects node and is
No break down, the accuracy of detection can be improved.Need to illustrate, if the number of physical network
When amount is at least two, mutually isolated between this at least two physical network, so can avoid due to many
When there is some equipment shared between network, if shared device breaks down, thus cause nothing between node
The phenomenon of method proper communication, is conducive to improving the accuracy of detection.
Step 703, in the case of primary nodal point does not receives the first heartbeat message, primary nodal point is to often
One other neighbor nodes send request message respectively, and request message is used for inquiring that other neighbor nodes each are
No receive described first heartbeat message.
In the present embodiment, if primary nodal point does not receive that secondary nodal point sends in Preset Time
One heartbeat message, can primarily determine that secondary nodal point has been likely to occur fault.Owing to secondary nodal point is also
The first heartbeat message that row ground sends to its all neighbor nodes, therefore, primary nodal point will be to second section
In the neighbor node of point, other neighbor nodes in addition to self send request message, to inquire that other are adjacent
Occupy whether node receives the first heartbeat message that secondary nodal point sends.
Step 704, primary nodal point receive the sound carrying reception state that other neighbor nodes each send
Message, reception state is answered to be used for indicating whether to receive the first heartbeat message.
In the present embodiment, after other neighbor nodes each receive the request message that primary nodal point sends,
Whether self receives the reception state of the first heartbeat message carry and be sent to first segment in the response message
Point.
Step 705, at primary nodal point according to the reception state carried in the response message received, determine
Going out in the case of other neighbor nodes all do not receive the first heartbeat message, primary nodal point determines secondary nodal point
Break down.
In the present embodiment, each other neighbor node is receiving the request message that primary nodal point sends
Afterwards, all can return the response message carrying reception state to primary nodal point, primary nodal point is according to reception
To other neighbor nodes each send the response message carrying reception state, it is judged that other neighbours save
Whether point receives the first heartbeat message, is judging that other neighbor nodes are all not received by secondary nodal point
During the first heartbeat message sent, i.e. can determine that secondary nodal point there occurs fault.
In the fault detection method of the group system interior joint that the embodiment of the present invention provides, secondary nodal point passes through
Sending the first heartbeat message to primary nodal point and other neighbor nodes concurrently, primary nodal point judges presetting
Whether receiving the first heartbeat message that secondary nodal point sends in time, wherein, primary nodal point is second section
The neighbor node of point, the first heartbeat message is that secondary nodal point is concurrently to each neighbor node of secondary nodal point
The heartbeat message sent, the number of all neighbor nodes of secondary nodal point is two or more;This is preset
Time is more than or equal to a heart beat cycle, and less than two heart beat cycles;Primary nodal point does not connects at self
In the case of receiving the first heartbeat message, inquire whether other neighbor nodes of this secondary nodal point receive
One heartbeat message, and the most all do not receive this first heart beating at other neighbor nodes determining this secondary nodal point
In the case of message, determine that secondary nodal point there occurs fault.Owing to Preset Time is more than or equal to a heart
Hop cycle, and less than two heart beat cycles, so the technical scheme using the present invention to provide carries out fault inspection
During survey, it is to avoid prior art needs just can be detected by multiple heart beat cycles node whether occur therefore
The phenomenon of barrier, shortens the cycle of fault detect, thus improves the efficiency of node failure detection.
The flow process of the fault detection method embodiment four of the group system interior joint that Fig. 8 provides for the present invention is shown
It is intended to.On the basis of embodiment illustrated in fig. 7, after primary nodal point is determined that secondary nodal point breaks down,
Each node redefines the embodiment of neighbor node, elaborates.As shown in Figure 8, the present embodiment
Method may include that
Step 801, primary nodal point generate the first vote information, and receive other neighbor nodes each transmission
The second vote information, the first vote information includes the node identification that node that primary nodal point is elected is corresponding;
The node mark that the node of the neighbor node election that the second vote information includes sending the second vote information is corresponding
Know.
In the present embodiment, after the neighbor node of secondary nodal point determines that secondary nodal point breaks down,
All of neighbor node is required to recalculate respective neighbor node.For purposes of illustration only, can be by second
Any one neighbor node of node is as primary nodal point, and primary nodal point needs to generate the first vote information,
This first vote information comprises node identification corresponding to node and the ballot foundation of primary nodal point election.
It addition, primary nodal point also to receive the second vote information that other neighbor nodes each send, this second throwing
Node identification that the node of the neighbor node election that ticket information includes sending the second vote information is corresponding and
Ballot foundation.In actual applications, ballot is according to relevant with many factors, such as: loading condition, joint
Size, nodal cache newness degree and the meshed network bandwidth etc. of some numbering, such as: primary nodal point is permissible
Minimum by judging the load which node is undertaken, and by node corresponding for this node of load minimum
Mark is carried and is sent to other neighbor nodes in the first vote information.Same, other neighbor nodes are also
By similar mode, the second vote information can be sent to primary nodal point.
Step 802, primary nodal point save according to the node identification in the first vote information and other neighbours each
Node identification in the second vote information that point sends, in all nodes that statistics is elected, each node obtains
The ballot quantity obtained, and using nodes most for quantity of voting as the 3rd node;3rd node is replacement
Two nodes and concurrently to the 3rd node all neighbor nodes send heartbeat message node;Section three,
All neighbor nodes of point include neighbor node and the neighbor node of secondary nodal point of the 3rd node self.
In the present embodiment, primary nodal point is receiving the second ballot letter that other neighbor nodes each send
After breath, according to the node identification in the first vote information that self generates and the second vote information received
In node identification, it may be determined that go out the 3rd node.During concrete implementation, can be according to first
The node identification carried in vote information and the second vote information, by the way of vote by ballot, adds up quilt
The ballot quantity that in all nodes of election, each node obtains, and the node most by obtaining ballot quantity
As the 3rd node.The neighbor node of the secondary nodal point that the 3rd node breaks down for adapter, Ye Jijie
Incidence relation between pipe secondary nodal point and other nodes, therefore, the 3rd node general's replacement secondary nodal point,
And send heartbeat message, wherein, owning of the 3rd node to all neighbor nodes of the 3rd node concurrently
Neighbor node, in addition to including the neighbor node of the 3rd node self, also includes neighbours' joint of secondary nodal point
Point.
Step 803, primary nodal point according in the neighbor node of the 3rd node and other neighbor nodes except the 3rd
Node outside node, redefines the neighbor node of primary nodal point.
In the present embodiment, all neighbor nodes of secondary nodal point determine by the way of vote by ballot
After three nodes, if primary nodal point is the 3rd node, then primary nodal point will take over the adjacent pass of secondary nodal point
System, other neighbor nodes can according to the neighbouring relations after the neighbor node of primary nodal point adapter secondary nodal point,
Respective neighbor node is determined again through calculating;If primary nodal point is not the 3rd node, then primary nodal point
After treating that the 3rd node redefines out neighbouring relations, neighbor node and other according to the 3rd node are adjacent
Occupy the node in addition to the 3rd node in node, redefine the neighbor node of self.
The fault detection method of the group system interior joint that the embodiment of the present invention provides, secondary nodal point is by also
Row ground sends the first heartbeat message to primary nodal point and other neighbor nodes, and primary nodal point judges when default
In whether receive secondary nodal point send the first heartbeat message, wherein, primary nodal point is secondary nodal point
Neighbor node, the first heartbeat message is that secondary nodal point is sent out to each neighbor node of secondary nodal point concurrently
The heartbeat message sent, the number of all neighbor nodes of secondary nodal point is two or more;When this is default
Between more than or equal to a heart beat cycle, and less than two heart beat cycles;Primary nodal point does not receives at self
In the case of the first heartbeat message, inquire whether other neighbor nodes of this secondary nodal point receive first
Heartbeat message, and the most all do not receive this first heart beating report at other neighbor nodes determining this secondary nodal point
In the case of literary composition, determine that secondary nodal point there occurs fault.Owing to Preset Time is more than or equal to a heart beating
Cycle, and less than two heart beat cycles, so the technical scheme using the present invention to provide carries out fault detect
Time, it is to avoid prior art needs just to detect whether node breaks down by multiple heart beat cycles
Phenomenon, shorten the cycle of fault detect, thus improve the efficiency of node failure detection.It addition,
By after determining that primary nodal point breaks down, redefine respective neighbor node, so continue into
Row fault detect, improves the accuracy of fault detect.
Alternatively, in the response message that primary nodal point sends according to other neighbor nodes each received
The reception state carried, determines that at least one other neighbor node receives the situation of the first heartbeat message
Under, primary nodal point determine described in do not receive the link between node and the secondary nodal point of the first heartbeat message
Break down.
Specifically, primary nodal point is not receiving the first heartbeat message that secondary nodal point sends, and to each
Other nodes send request message, to inquire whether other neighbor nodes each receive the first heartbeat message
Afterwards, if determining at least other neighbours according to the response message of other neighbor nodes each transmission
Node have received the first heartbeat message, then primary nodal point can determine that secondary nodal point is normal, and
It is probably secondary nodal point and primary nodal point and does not receives node and the primary nodal point of the first heartbeat message
Between link there occurs fault, wherein, the node not receiving the first heartbeat message includes primary nodal point
With the neighbor node not receiving the first heartbeat message in other neighbor nodes.
Alternatively, described primary nodal point saves according to neighbor node and other neighbours described of described 3rd node
Node in addition to described 3rd node in point, redefines the neighbor node of described primary nodal point.
The fault detection method of the group system interior joint that the embodiment of the present invention provides, owing to primary nodal point exists
In the case of determining that at least one other neighbor node receives the first heartbeat message, primary nodal point determines
The link not received between node and the secondary nodal point of the first heartbeat message breaks down so that fault is examined
Survey more comprehensive.
Fig. 9 is the structural representation of the failure detector embodiment one of group system interior joint of the present invention
Figure, as it is shown in figure 9, the failure detector 10 of the group system interior joint of embodiment of the present invention offer
Including judge module 11, sending module 12, receiver module 13, determine module 14 and generation module
15。
Wherein, it is judged that module 11 is for judging in Preset Time whether receiver module 13 receives the
The first heartbeat message that two nodes send;Described primary nodal point is the neighbor node of described secondary nodal point,
Described first heartbeat message is that described secondary nodal point saves to each neighbours of described secondary nodal point concurrently
Point send a heartbeat message, the number of all neighbor nodes of described secondary nodal point be two with
On;Described Preset Time is more than or equal to a heart beat cycle, and less than two heart beat cycles;Institute
State judge module 11 and judge that described receiver module 13 does not receives first that described secondary nodal point sends
In the case of heartbeat message, sending module 12 is in all neighbor nodes of described secondary nodal point
Other neighbor nodes in addition to described primary nodal point send request message;Described request message is used for asking
Ask whether other neighbor nodes described receive described first heartbeat message;Described receiver module 13 is also
For receiving the response message carrying reception state that other neighbor nodes described send, described reception
State is used for indicating whether to receive described first heartbeat message;Determine that module 14 is for according to described
The described response message that other neighbor nodes each described that receiver module 13 receives send carries
Reception state, it is determined whether other neighbor nodes described all do not receive described first heartbeat message;
Determine that module 14 determines that other neighbor nodes described all do not receive described first heart beating report described
In the case of literary composition, described determine that module 14 is additionally operable to determine that described secondary nodal point breaks down.Institute
State primary nodal point according to the reception state carried in the described response message received, determine described its
In the case of his neighbor node does not all receive described first heartbeat message, determine that module 14 is for really
Fixed described secondary nodal point breaks down.
The failure detector of the group system interior joint that the embodiment of the present invention provides, it is judged that module judges
In Preset Time, whether receiver module receives the first heartbeat message that secondary nodal point sends, first heart
Jumping message is the heart beating report that secondary nodal point sends to each neighbor node of secondary nodal point concurrently
Literary composition, the number of all neighbor nodes of secondary nodal point is two or more;This Preset Time is more than or equal to
One heart beat cycle, and less than two heart beat cycles;Receiver module is not receiving the first heartbeat message
In the case of, sending module sends request message to other neighbor nodes of this secondary nodal point, with inquiry
Whether other neighbor nodes receive the first heartbeat message, and are determining that module determines this secondary nodal point
Other neighbor nodes the most all do not receive this first heartbeat message in the case of, determine that secondary nodal point is sent out
Give birth to fault.Owing to Preset Time is more than or equal to a heart beat cycle, and less than two heart beat cycles,
So use the technical scheme that the present invention provides when carrying out fault detect, it is to avoid prior art needs
Just can be detected the phenomenon whether node breaks down by multiple heart beat cycles, shorten fault detect
Cycle, thus improve node failure detection efficiency.
Alternatively, generation module 15 is additionally operable to generate the first vote information, and described first vote information includes
The node identification corresponding to node of described primary nodal point election;
Described receiver module 13 is additionally operable to receive the second ballot letter that each other neighbor nodes described send
Breath, described second vote information includes the node pair sending the neighbor node election of described second vote information
The node identification answered;
Described determine that module 14 is additionally operable to according to the node identification in described first vote information and each institute
State the node identification in the second vote information that other neighbor nodes send, all nodes that statistics is elected
In the ballot quantity that obtains of each node, and using nodes most for quantity of voting as the 3rd node;Described
3rd node be substitute described secondary nodal point and concurrently all neighbor nodes to described 3rd node send out
Send the node of heartbeat message;All neighbor nodes of described 3rd node include described 3rd node self
Neighbor node and the neighbor node of described secondary nodal point.
Alternatively, described determine module 14 according to described receiver module 13 receive each described
The reception state carried in the described response message that other neighbor nodes send, determines at least one institute
State in the case of other neighbor nodes receive described first heartbeat message,
Described determine that module 14 is additionally operable to determine node and the institute not receiving described first heartbeat message
The link stated between secondary nodal point breaks down;The described node not receiving described first heartbeat message
Described first heartbeat message is not received including in described primary nodal point and other neighbor nodes described
Node.
Alternatively, described determine that module 14 is additionally operable to the neighbor node according to described 3rd node and institute
State the node in addition to described 3rd node in other neighbor nodes, redefine described primary nodal point
Neighbor node.
The failure detector of the group system interior joint of the present embodiment, may be used for performing the present invention any
The technical scheme of the fault detection method of the group system interior joint that embodiment is provided, its realize principle and
Technique effect is similar to, and here is omitted.
Figure 10 is the structural representation of the fault detection system embodiment one of group system interior joint of the present invention
Figure, as shown in Figure 10, the fault detection system of the group system interior joint that the embodiment of the present invention provides
20 include primary nodal point 21, secondary nodal point 22 and other neighbor nodes 23, described primary nodal point 21
For the neighbor node of described secondary nodal point 22, other neighbor nodes 23 described are described secondary nodal point 22
All neighbor nodes in node in addition to described primary nodal point 21, other neighbor nodes 23 described
Number be more than one.
Wherein, described secondary nodal point 22 is for concurrently to described primary nodal point and other neighbor nodes described
Send the first heartbeat message;Whether described primary nodal point 21 is described in judge to receive in Preset Time
First heartbeat message;Described Preset Time is more than or equal to a heart beat cycle, and less than two heart beating weeks
Phase;In the case of described primary nodal point does not receives described first heartbeat message, described primary nodal point 21
Being additionally operable to send request message respectively to each other neighbor nodes described, described request message is used for inquiring
Whether each other neighbor nodes described receive described first heartbeat message;Described primary nodal point 21 is also used
In receiving the response message carrying reception state that each other neighbor nodes described send, described reception
State is used for indicating whether to receive described first heartbeat message;At described primary nodal point according to receiving
The reception state carried in the described response message that each other neighbor nodes described send, determines described
In the case of other neighbor nodes all do not receive described first heartbeat message, described primary nodal point 21 is also used
In determining that described secondary nodal point breaks down.
In the fault detection system of the group system interior joint that the embodiment of the present invention provides, it is judged that module judges
In Preset Time, whether receiver module receives the first heartbeat message that secondary nodal point sends, the first heart beating
Message is the heartbeat message that secondary nodal point sends to each neighbor node of secondary nodal point concurrently, the
The number of all neighbor nodes of two nodes is two or more;This Preset Time is more than or equal to a heart beating
Cycle, and less than two heart beat cycles;Receiver module in the case of not receiving the first heartbeat message,
Sending module sends request message to other neighbor nodes of this secondary nodal point, to inquire other neighbor nodes
Whether receive the first heartbeat message, and determining that module determines other neighbor nodes of this secondary nodal point
In the case of the most all not receiving this first heartbeat message, determine that secondary nodal point there occurs fault.Due in advance
If the time is more than or equal to a heart beat cycle, and less than two heart beat cycles, so using the present invention to carry
When the technical scheme of confession carries out fault detect, it is to avoid prior art needs by multiple heart beat cycles
The phenomenon whether node breaks down can be detected, shorten the cycle of fault detect, thus improve joint
The efficiency of point failure detection.
In the above-described embodiments, after described primary nodal point 21 determines that described secondary nodal point breaks down, also
Including: described primary nodal point 21 is additionally operable to:
Generate the first vote information, and receive the second vote information that each other neighbor nodes described send,
Described first vote information includes the node identification that node that described primary nodal point is elected is corresponding, described second
The node mark that the node of the neighbor node election that vote information includes sending described second vote information is corresponding
Know;
And, send out according to the node identification in described first vote information and each other neighbor nodes described
Node identification in the second vote information sent, in all nodes that statistics is elected, each node obtains
Ballot quantity, and using nodes most for quantity of voting as the 3rd node, described 3rd node is for substituting institute
State secondary nodal point and send the node of heartbeat message concurrently to all neighbor nodes of described 3rd node;
All neighbor nodes of described 3rd node include the neighbor node and described second of described 3rd node self
The neighbor node of node.
In the above-described embodiments, at described primary nodal point according to other neighbor nodes each described received
The reception state carried in the described response message sent, determines at least one other neighbor node described
In the case of receiving described first heartbeat message,
Described primary nodal point 21 is additionally operable to determine that the node not receiving described first heartbeat message is with described
Link between secondary nodal point breaks down;The described node not receiving described first heartbeat message includes
Described primary nodal point and other neighbor nodes described do not receive the node of described first heartbeat message.
In the above-described embodiments, described primary nodal point 21 is additionally operable to the neighbor node according to described 3rd node
With node in addition to described 3rd node in other neighbor nodes described, redefine described primary nodal point
Neighbor node.
Said system embodiment can be used for performing the technical scheme of embodiment of the method accordingly, and it realizes principle
Similar with technique effect, here is omitted.
Figure 11 is the structural representation of node embodiment one of the present invention, as shown in figure 11, the present embodiment
Node 600 includes processor 601, user interface 603, network interface 604 and memorizer 605, sends
Device 606 and receptor 607, memorizer 605 can include operating system 6051, application program 6052 etc..
Processor 601 can be central processing unit (Central Processing Unit, CPU).Memorizer 605
For storing executable instruction.Processor 601 can perform the executable instruction of storage in memorizer 605.
Wherein, receptor 607 is for receiving the first heartbeat message that secondary nodal point sends;Described processor 601
For judging whether described receptor 607 receives the first heart beating that secondary nodal point sends in Preset Time
Message;Described first heartbeat message is that described secondary nodal point is concurrently to each neighbours of described secondary nodal point
The heartbeat message that node sends, the number of all neighbor nodes of described secondary nodal point is two or more;
Described Preset Time is more than or equal to a heart beat cycle, and less than two heart beat cycles;In described process
Device 601 judges that described receptor 607 does not receives the first heartbeat message of described secondary nodal point transmission
In the case of, transmitter 606 is for removing described primary nodal point in all neighbor nodes of described secondary nodal point
Outside other neighbor nodes send request message, described request message is used for inquiring described other neighbours joint
Whether point receives described first heartbeat message, and described primary nodal point is the neighbor node of described secondary nodal point;
Described receptor 607 is additionally operable to receive the response carrying reception state that other neighbor nodes described send
Message, described reception state is used for indicating whether to receive described first heartbeat message;Described processor 601
The described response sent for other neighbor nodes each described received according to described receptor 607 disappears
The reception state carried in breath, it is determined whether other neighbor nodes described all do not receive described first heart beating
Message;Determine that other neighbor nodes described all do not receive described first heart beating at described processor 601
In the case of message, described processor 601 is additionally operable to determine that described secondary nodal point breaks down.
The node that the present embodiment provides, may be used for performing the cluster system that any embodiment of the present invention is provided
The technical scheme of the fault detection method of system interior joint, it is similar with technique effect that it realizes principle, the most not
Repeat again.
Alternatively, described processor 601 is additionally operable to generate the first vote information, described first vote information
Including the node identification that the node of described primary nodal point election is corresponding;
Described receptor 607 is additionally operable to receive the second ballot letter that each other neighbor nodes described send
Breath, described second vote information includes the node pair sending the neighbor node election of described second vote information
The node identification answered;
Described processor 601 is additionally operable to according to the node identification in described first vote information and each described
Node identification in the second vote information that other neighbor nodes send, in all nodes that statistics is elected
The ballot quantity that each node obtains, and using nodes most for quantity of voting as the 3rd node;Described
Three nodes are to substitute described secondary nodal point and send to all neighbor nodes of described 3rd node concurrently
The node of heartbeat message;All neighbor nodes of described 3rd node include the neighbour of described 3rd node self
Occupy node and the neighbor node of described secondary nodal point.
Alternatively, described processor 601 according to described receptor 607 receive each described other
The reception state carried in the described response message that neighbor node sends, determine at least one described other
In the case of neighbor node receives described first heartbeat message, described processor 601 is additionally operable to determine not
The link received between node and the described secondary nodal point of described first heartbeat message breaks down;Described
The node not receiving described first heartbeat message includes in described primary nodal point and other neighbor nodes described
Do not receive the node of described first heartbeat message.
Alternatively, described processor 601 be additionally operable to the neighbor node according to described 3rd node and described its
Node in addition to described 3rd node in his neighbor node, redefines neighbours' joint of described primary nodal point
Point.
The node that the present embodiment provides, may be used for performing the cluster system that any embodiment of the present invention is provided
The technical scheme of the fault detection method of system interior joint, it is similar with technique effect that it realizes principle, the most not
Repeat again.
One of ordinary skill in the art will appreciate that: realize all or part of step of above-mentioned each method embodiment
Suddenly can be completed by the hardware that programmed instruction is relevant.Aforesaid program can be stored in a computer can
Read in storage medium.This program upon execution, performs to include the step of above-mentioned each method embodiment;And
Aforesaid storage medium includes: ROM, RAM, magnetic disc or CD etc. are various can store program code
Medium.
Last it is noted that various embodiments above is only in order to illustrate technical scheme, rather than right
It limits;Although the present invention being described in detail with reference to foregoing embodiments, this area common
Skilled artisans appreciate that the technical scheme described in foregoing embodiments still can be modified by it,
Or the most some or all of technical characteristic is carried out equivalent;And these amendments or replacement, and
The essence not making appropriate technical solution departs from the scope of various embodiments of the present invention technical scheme.
Claims (12)
1. the fault detection method of a group system interior joint, it is characterised in that including:
Primary nodal point judges whether receive the first heartbeat message that secondary nodal point sends in Preset Time;
Described primary nodal point is the neighbor node of described secondary nodal point, and described first heartbeat message is described second section
The heartbeat message that point sends to each neighbor node of described secondary nodal point concurrently, described second section
The number of all neighbor nodes of point is two or more;Described Preset Time is more than or equal to a heart beating week
Phase, and less than two heart beat cycles;
In the case of described primary nodal point does not receives the first heartbeat message that described secondary nodal point sends,
Described primary nodal point other in addition to described primary nodal point in all neighbor nodes of described secondary nodal point
Neighbor node sends request message, and described request message is used for inquiring whether other neighbor nodes described receive
To described first heartbeat message;
Described primary nodal point receives the response carrying reception state of described other neighbor nodes transmission and disappears
Breath, described reception state is used for indicating whether to receive described first heartbeat message;
The described response sent according to other neighbor nodes each described received at described primary nodal point disappears
The reception state carried in breath, determines that other neighbor nodes described all do not receive described first heart beating report
In the case of literary composition, described primary nodal point determines that described secondary nodal point breaks down.
Method the most according to claim 1, it is characterised in that described primary nodal point determines described
After two nodes break down, also include:
Described primary nodal point generates the first vote information, and receives what each other neighbor nodes described sent
Second vote information, described first vote information includes the node that node that described primary nodal point is elected is corresponding
Mark;Described second vote information includes the node sending the neighbor node election of described second vote information
Corresponding node identification;
Described primary nodal point is according to the node identification in described first vote information and each other neighbours described
Node identification in the second vote information that node sends, each node in all nodes that statistics is elected
The ballot quantity obtained, and using nodes most for quantity of voting as the 3rd node;Described 3rd node is
Substitute described secondary nodal point and send heartbeat message to all neighbor nodes of described 3rd node concurrently
Node;All neighbor nodes of described 3rd node include described 3rd node self neighbor node and
The neighbor node of described secondary nodal point.
Method the most according to claim 1 and 2, it is characterised in that also include:
The described response sent according to other neighbor nodes each described received at described primary nodal point disappears
The reception state carried in breath, determines that at least one other neighbor node described receives described first heart
In the case of jumping message, described primary nodal point determines node and the institute not receiving described first heartbeat message
The link stated between secondary nodal point breaks down;The described node bag not receiving described first heartbeat message
Include the node not receiving described first heartbeat message in described primary nodal point and other neighbor nodes described.
4. the fault detection method of a group system interior joint, it is characterised in that described method includes:
Secondary nodal point sends the first heartbeat message to primary nodal point and other neighbor nodes concurrently;Described
One node is the neighbor node of described secondary nodal point, and other neighbor nodes described are the institute of described secondary nodal point
Having the node in addition to described primary nodal point in neighbor node, the number of other neighbor nodes described is one
Above;
Described primary nodal point judges whether receive described first heartbeat message in Preset Time;Described pre-
If the time is more than or equal to a heart beat cycle, and less than two heart beat cycles;
In the case of described primary nodal point does not receives described first heartbeat message, described primary nodal point to
Each other neighbor nodes described send request message respectively, and described request message is used for inquiring each described
Whether other neighbor nodes receive described first heartbeat message;
Described primary nodal point receives the response carrying reception state that each other neighbor nodes described send
Message, described reception state is used for indicating whether to receive described first heartbeat message;
At described primary nodal point according to the reception state carried in the described response message received, determine
In the case of other neighbor nodes described all do not receive described first heartbeat message, described primary nodal point is true
Fixed described secondary nodal point breaks down.
Method the most according to claim 4, it is characterised in that described primary nodal point determines described
After two nodes break down, also include:
Described primary nodal point generates the first vote information, and receives what each other neighbor nodes described sent
Second vote information, described first vote information includes the node that node that described primary nodal point is elected is corresponding
Mark;Described second vote information includes the node sending the neighbor node election of described second vote information
Corresponding node identification;
Described primary nodal point is according to the node identification in described first vote information and each other neighbours described
Node identification in the second vote information that node sends, each node in all nodes that statistics is elected
The ballot quantity obtained, and using nodes most for quantity of voting as the 3rd node;Described 3rd node is
Substitute described secondary nodal point and send heartbeat message to all neighbor nodes of described 3rd node concurrently
Node;All neighbor nodes of described 3rd node include described 3rd node self neighbor node and
The neighbor node of described secondary nodal point.
6. according to the method described in claim 4 or 5, it is characterised in that also include:
The described response sent according to other neighbor nodes each described received at described primary nodal point disappears
The reception state carried in breath, determines that at least one other neighbor node described receives described first heart
Jump in the case of message, the most described primary nodal point determine the node not receiving described first heartbeat message with
Link between described secondary nodal point breaks down;The described node not receiving described first heartbeat message
Including the node not receiving the first heartbeat message in described primary nodal point and other neighbor nodes described.
7. the failure detector of a group system interior joint, it is characterised in that including:
Judge module, for judging whether receiver module receives what secondary nodal point sent in Preset Time
First heartbeat message;Described primary nodal point is the neighbor node of described secondary nodal point, described first heart beating report
Literary composition is the heart beating report that described secondary nodal point sends to each neighbor node of described secondary nodal point concurrently
Literary composition, the number of all neighbor nodes of described secondary nodal point is two or more;Described Preset Time more than or
Equal to a heart beat cycle, and less than two heart beat cycles;
Judge that described receiver module does not receives first that described secondary nodal point sends at described judge module
In the case of heartbeat message,
Sending module, in all neighbor nodes of described secondary nodal point in addition to described primary nodal point
Other neighbor nodes send request message, described request message is used for inquiring that other neighbor nodes described are
No receive described first heartbeat message;
Described receiver module, is additionally operable to the reception state that carries of other neighbor nodes transmission described in receiving
Response message, described reception state is used for indicating whether to receive described first heartbeat message;
Determine module, send for other neighbor nodes each described received according to described receiver module
Described response message in the reception state carried, it is determined whether other neighbor nodes described all do not receive
Described first heartbeat message;
Determine that module determines that other neighbor nodes described all do not receive described first heartbeat message described
In the case of, described determine module, be additionally operable to determine that described secondary nodal point breaks down.
Device the most according to claim 7, it is characterised in that to determine that module determines described described
After secondary nodal point breaks down, also include:
Generation module, is additionally operable to generate the first vote information, and described first vote information includes described first
The node identification corresponding to node of node election;
Described receiver module, is additionally operable to receive the second vote information that each other neighbor nodes described send,
Described second vote information includes that the node sending the neighbor node election of described second vote information is corresponding
Node identification;
Described determine module, be additionally operable to according to the node identification in described first vote information and each described
Node identification in the second vote information that other neighbor nodes send, in all nodes that statistics is elected
The ballot quantity that each node obtains, and using nodes most for quantity of voting as the 3rd node;Described
Three nodes are to substitute described secondary nodal point and send to all neighbor nodes of described 3rd node concurrently
The node of heartbeat message;All neighbor nodes of described 3rd node include the neighbour of described 3rd node self
Occupy node and the neighbor node of described secondary nodal point.
9. according to the device described in claim 7 or 8, it is characterised in that:
Send at described other neighbor nodes each described determining that module receives according to described receiver module
Described response message in the reception state carried, determine that at least one other neighbor node described receives
In the case of described first heartbeat message,
Described determine that module is additionally operable to determine the node and described second not receiving described first heartbeat message
Link between node breaks down;The described node not receiving described first heartbeat message includes described
Primary nodal point and other neighbor nodes described do not receive the node of described first heartbeat message.
10. the fault detection system of a group system interior joint, it is characterised in that include primary nodal point,
Secondary nodal point and other neighbor nodes, described primary nodal point is the neighbor node of described secondary nodal point, described
Other neighbor nodes be described secondary nodal point all neighbor nodes in joint in addition to described primary nodal point
Point, the number of other neighbor nodes described is more than one, including:
Described secondary nodal point, for sending the to described primary nodal point and other neighbor nodes described concurrently
One heartbeat message;
Described primary nodal point, for judging whether receive described first heartbeat message in Preset Time;
Described Preset Time is more than or equal to a heart beat cycle, and less than two heart beat cycles;
In the case of described primary nodal point does not receives described first heartbeat message, described primary nodal point is also
For sending request message respectively to each other neighbor nodes described, described request message is used for inquiring often
Whether other neighbor nodes described in receive described first heartbeat message;And, described primary nodal point is also
For receiving the response message carrying reception state that each other neighbor nodes described send, described in connect
Receipts state is used for indicating whether to receive described first heartbeat message;
The described response sent according to other neighbor nodes each described received at described primary nodal point disappears
The reception state carried in breath, determines that other neighbor nodes described all do not receive described first heart beating report
In the case of literary composition, described primary nodal point is additionally operable to determine that described secondary nodal point breaks down.
11. systems according to claim 10, it is characterised in that described primary nodal point determines described
After secondary nodal point breaks down, also include:
Described primary nodal point is additionally operable to:
Generate the first vote information, and receive the second vote information that each other neighbor nodes described send,
Described first vote information includes the node identification that node that described primary nodal point is elected is corresponding, described second
The node mark that the node of the neighbor node election that vote information includes sending described second vote information is corresponding
Know;
And, send out according to the node identification in described first vote information and each other neighbor nodes described
Node identification in the second vote information sent, in all nodes that statistics is elected, each node obtains
Ballot quantity, and using nodes most for quantity of voting as the 3rd node, described 3rd node is for substituting institute
State secondary nodal point and send the node of heartbeat message concurrently to all neighbor nodes of described 3rd node;
All neighbor nodes of described 3rd node include the neighbor node and described second of described 3rd node self
The neighbor node of node.
12. according to the system described in claim 10 or 11, it is characterised in that:
The described response sent according to other neighbor nodes each described received at described primary nodal point disappears
The reception state carried in breath, determines that at least one other neighbor node described receives described first heart
In the case of jumping message,
Described primary nodal point is additionally operable to determine the node and described second not receiving described first heartbeat message
Link between node breaks down;The described node not receiving described first heartbeat message includes described
Primary nodal point and other neighbor nodes described do not receive the node of described first heartbeat message.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510306800.0A CN106301853B (en) | 2015-06-05 | 2015-06-05 | The fault detection method and device of group system interior joint |
PCT/CN2016/073606 WO2016192408A1 (en) | 2015-06-05 | 2016-02-05 | Fault detection method and apparatus for node in cluster system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510306800.0A CN106301853B (en) | 2015-06-05 | 2015-06-05 | The fault detection method and device of group system interior joint |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106301853A true CN106301853A (en) | 2017-01-04 |
CN106301853B CN106301853B (en) | 2019-06-18 |
Family
ID=57440098
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510306800.0A Active CN106301853B (en) | 2015-06-05 | 2015-06-05 | The fault detection method and device of group system interior joint |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN106301853B (en) |
WO (1) | WO2016192408A1 (en) |
Cited By (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107566219A (en) * | 2017-09-27 | 2018-01-09 | 华为技术有限公司 | Method for diagnosing faults, node device and computer equipment applied to group system |
CN107864486A (en) * | 2017-12-26 | 2018-03-30 | 杭州迪普科技股份有限公司 | A kind of offline AP detection methods and device |
CN108092857A (en) * | 2018-01-15 | 2018-05-29 | 郑州云海信息技术有限公司 | A kind of distributed system heartbeat detecting method and relevant apparatus |
CN108337274A (en) * | 2017-01-19 | 2018-07-27 | 贵州白山云科技有限公司 | A kind of message distributing method and system |
CN108683561A (en) * | 2018-05-16 | 2018-10-19 | 杭州迪普科技股份有限公司 | A kind of station state detection method and device |
CN109218141A (en) * | 2018-11-20 | 2019-01-15 | 郑州云海信息技术有限公司 | A kind of malfunctioning node detection method and relevant apparatus |
CN109428740A (en) * | 2017-08-21 | 2019-03-05 | 华为技术有限公司 | The method and apparatus that equipment fault restores |
CN109525408A (en) * | 2017-09-18 | 2019-03-26 | 杭州海康威视系统技术有限公司 | A kind of unit exception processing method, device and cloud storage system |
CN109714183A (en) * | 2017-10-26 | 2019-05-03 | 阿里巴巴集团控股有限公司 | Data processing method and device in a kind of cluster |
CN109873719A (en) * | 2019-02-03 | 2019-06-11 | 华为技术有限公司 | A kind of fault detection method and device |
CN110324166A (en) * | 2018-03-31 | 2019-10-11 | 华为技术有限公司 | A kind of method, apparatus and system of target information synchronous in multiple nodes |
CN110377570A (en) * | 2017-10-12 | 2019-10-25 | 腾讯科技(深圳)有限公司 | Node switching method, device, computer equipment and storage medium |
CN110380934A (en) * | 2019-07-23 | 2019-10-25 | 南京航空航天大学 | A kind of distribution redundant system heartbeat detecting method |
CN111181763A (en) * | 2019-11-28 | 2020-05-19 | 泰康保险集团股份有限公司 | Network fault reporting method and device |
CN111586110A (en) * | 2020-04-22 | 2020-08-25 | 广州锦行网络科技有限公司 | Optimization processing method for raft in point-to-point fault |
WO2020220231A1 (en) * | 2019-04-29 | 2020-11-05 | 华为海洋网络有限公司 | Submarine cable failure determination method and apparatus |
CN112398905A (en) * | 2020-09-28 | 2021-02-23 | 联想(北京)有限公司 | Node and information synchronization method |
CN112468372A (en) * | 2017-04-10 | 2021-03-09 | 华为技术有限公司 | Equipment state detection method and device in power line communication network |
CN112911520A (en) * | 2019-12-04 | 2021-06-04 | 哈尔滨海能达科技有限公司 | Method, device and storage medium for determining master node in ad hoc network |
CN112988463A (en) * | 2021-02-23 | 2021-06-18 | 新华三大数据技术有限公司 | Fault node isolation method and device |
CN113542052A (en) * | 2021-06-07 | 2021-10-22 | 新华三信息技术有限公司 | Node fault determination method and device and server |
CN113783735A (en) * | 2021-09-24 | 2021-12-10 | 小红书科技有限公司 | Method, device, equipment and medium for identifying fault node in Redis cluster |
CN115102886A (en) * | 2022-06-21 | 2022-09-23 | 上海驻云信息科技有限公司 | Task scheduling method and device for multiple acquisition clients |
CN116260705A (en) * | 2022-12-21 | 2023-06-13 | 广西壮族自治区自然资源信息中心 | Geographic information distributed cluster fault processing method, device, medium and equipment |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2018214106A1 (en) * | 2017-05-25 | 2018-11-29 | 深圳市伊特利网络科技有限公司 | Update method and system for network connection list |
WO2019000954A1 (en) * | 2017-06-30 | 2019-01-03 | 中兴通讯股份有限公司 | Method, device and system for monitoring node survival state |
US10547499B2 (en) | 2017-09-04 | 2020-01-28 | International Business Machines Corporation | Software defined failure detection of many nodes |
CN109302445B (en) * | 2018-08-14 | 2021-10-12 | 新华三云计算技术有限公司 | Host node state determination method and device, host node and storage medium |
CN113923105B (en) * | 2021-12-13 | 2022-04-22 | 中机联科技(广东)有限公司 | Internet of things equipment fault monitoring method and system based on block chain |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070294596A1 (en) * | 2006-05-22 | 2007-12-20 | Gissel Thomas R | Inter-tier failure detection using central aggregation point |
CN101159536A (en) * | 2007-10-30 | 2008-04-09 | 中兴通讯股份有限公司 | Media gateway node condition synchronizing method in dual-home network |
CN102204169A (en) * | 2011-05-12 | 2011-09-28 | 华为技术有限公司 | Fault detection method, route node and system |
CN102612110A (en) * | 2012-03-02 | 2012-07-25 | 浙江大学 | Distributive self-organized routing method in electric carrier wave illumination control system |
CN102821011A (en) * | 2012-08-28 | 2012-12-12 | 北京星网锐捷网络技术有限公司 | Opposite terminal state detection method, device and equipment |
CN103297396A (en) * | 2012-02-28 | 2013-09-11 | 国际商业机器公司 | Management failure transferring device and method in cluster system |
CN103916275A (en) * | 2014-03-31 | 2014-07-09 | 杭州华三通信技术有限公司 | BFD detection device and method |
US20140301401A1 (en) * | 2013-04-07 | 2014-10-09 | Hangzhou H3C Technologies Co., Ltd. | Providing aggregation link groups in logical network device |
CN104283711A (en) * | 2014-09-29 | 2015-01-14 | 中国联合网络通信集团有限公司 | Fault detection method based on BFD, nodes and system |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102752143B (en) * | 2012-07-05 | 2015-08-19 | 杭州华三通信技术有限公司 | The BFD detection method of MPLS TE bidirectional tunnel and routing device |
-
2015
- 2015-06-05 CN CN201510306800.0A patent/CN106301853B/en active Active
-
2016
- 2016-02-05 WO PCT/CN2016/073606 patent/WO2016192408A1/en active Application Filing
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070294596A1 (en) * | 2006-05-22 | 2007-12-20 | Gissel Thomas R | Inter-tier failure detection using central aggregation point |
CN101159536A (en) * | 2007-10-30 | 2008-04-09 | 中兴通讯股份有限公司 | Media gateway node condition synchronizing method in dual-home network |
CN102204169A (en) * | 2011-05-12 | 2011-09-28 | 华为技术有限公司 | Fault detection method, route node and system |
CN103297396A (en) * | 2012-02-28 | 2013-09-11 | 国际商业机器公司 | Management failure transferring device and method in cluster system |
CN102612110A (en) * | 2012-03-02 | 2012-07-25 | 浙江大学 | Distributive self-organized routing method in electric carrier wave illumination control system |
CN102821011A (en) * | 2012-08-28 | 2012-12-12 | 北京星网锐捷网络技术有限公司 | Opposite terminal state detection method, device and equipment |
US20140301401A1 (en) * | 2013-04-07 | 2014-10-09 | Hangzhou H3C Technologies Co., Ltd. | Providing aggregation link groups in logical network device |
CN103916275A (en) * | 2014-03-31 | 2014-07-09 | 杭州华三通信技术有限公司 | BFD detection device and method |
CN104283711A (en) * | 2014-09-29 | 2015-01-14 | 中国联合网络通信集团有限公司 | Fault detection method based on BFD, nodes and system |
Cited By (37)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108337274A (en) * | 2017-01-19 | 2018-07-27 | 贵州白山云科技有限公司 | A kind of message distributing method and system |
CN112468372B (en) * | 2017-04-10 | 2023-10-13 | 华为技术有限公司 | Method and device for detecting equipment state in power line communication network |
CN112468372A (en) * | 2017-04-10 | 2021-03-09 | 华为技术有限公司 | Equipment state detection method and device in power line communication network |
CN109428740A (en) * | 2017-08-21 | 2019-03-05 | 华为技术有限公司 | The method and apparatus that equipment fault restores |
CN109428740B (en) * | 2017-08-21 | 2020-09-08 | 华为技术有限公司 | Method and device for recovering equipment failure |
CN109525408A (en) * | 2017-09-18 | 2019-03-26 | 杭州海康威视系统技术有限公司 | A kind of unit exception processing method, device and cloud storage system |
CN109525408B (en) * | 2017-09-18 | 2021-12-21 | 杭州海康威视系统技术有限公司 | Equipment exception handling method and device and cloud storage system |
CN107566219A (en) * | 2017-09-27 | 2018-01-09 | 华为技术有限公司 | Method for diagnosing faults, node device and computer equipment applied to group system |
CN107566219B (en) * | 2017-09-27 | 2020-09-18 | 华为技术有限公司 | Fault diagnosis method applied to cluster system, node equipment and computer equipment |
CN110377570B (en) * | 2017-10-12 | 2021-06-11 | 腾讯科技(深圳)有限公司 | Node switching method and device, computer equipment and storage medium |
CN110377570A (en) * | 2017-10-12 | 2019-10-25 | 腾讯科技(深圳)有限公司 | Node switching method, device, computer equipment and storage medium |
CN109714183A (en) * | 2017-10-26 | 2019-05-03 | 阿里巴巴集团控股有限公司 | Data processing method and device in a kind of cluster |
CN107864486A (en) * | 2017-12-26 | 2018-03-30 | 杭州迪普科技股份有限公司 | A kind of offline AP detection methods and device |
CN108092857A (en) * | 2018-01-15 | 2018-05-29 | 郑州云海信息技术有限公司 | A kind of distributed system heartbeat detecting method and relevant apparatus |
CN110324166A (en) * | 2018-03-31 | 2019-10-11 | 华为技术有限公司 | A kind of method, apparatus and system of target information synchronous in multiple nodes |
CN110324166B (en) * | 2018-03-31 | 2020-12-15 | 华为技术有限公司 | Method, device and system for synchronizing target information in multiple nodes |
CN108683561B (en) * | 2018-05-16 | 2020-10-02 | 杭州迪普科技股份有限公司 | Site state detection method and device |
CN108683561A (en) * | 2018-05-16 | 2018-10-19 | 杭州迪普科技股份有限公司 | A kind of station state detection method and device |
CN109218141A (en) * | 2018-11-20 | 2019-01-15 | 郑州云海信息技术有限公司 | A kind of malfunctioning node detection method and relevant apparatus |
CN109873719A (en) * | 2019-02-03 | 2019-06-11 | 华为技术有限公司 | A kind of fault detection method and device |
WO2020220231A1 (en) * | 2019-04-29 | 2020-11-05 | 华为海洋网络有限公司 | Submarine cable failure determination method and apparatus |
US11265080B2 (en) | 2019-04-29 | 2022-03-01 | Hmn Technologies Co., Limited | Submarine cable fault determining method and apparatus |
CN110380934A (en) * | 2019-07-23 | 2019-10-25 | 南京航空航天大学 | A kind of distribution redundant system heartbeat detecting method |
CN111181763A (en) * | 2019-11-28 | 2020-05-19 | 泰康保险集团股份有限公司 | Network fault reporting method and device |
CN112911520B (en) * | 2019-12-04 | 2022-05-31 | 哈尔滨海能达科技有限公司 | Method, device and storage medium for determining master node in ad hoc network |
CN112911520A (en) * | 2019-12-04 | 2021-06-04 | 哈尔滨海能达科技有限公司 | Method, device and storage medium for determining master node in ad hoc network |
CN111586110B (en) * | 2020-04-22 | 2021-03-19 | 广州锦行网络科技有限公司 | Optimization processing method for raft in point-to-point fault |
CN111586110A (en) * | 2020-04-22 | 2020-08-25 | 广州锦行网络科技有限公司 | Optimization processing method for raft in point-to-point fault |
CN112398905A (en) * | 2020-09-28 | 2021-02-23 | 联想(北京)有限公司 | Node and information synchronization method |
CN112398905B (en) * | 2020-09-28 | 2022-05-31 | 联想(北京)有限公司 | Node and information synchronization method |
CN112988463A (en) * | 2021-02-23 | 2021-06-18 | 新华三大数据技术有限公司 | Fault node isolation method and device |
CN112988463B (en) * | 2021-02-23 | 2022-08-30 | 新华三大数据技术有限公司 | Fault node isolation method and device |
CN113542052A (en) * | 2021-06-07 | 2021-10-22 | 新华三信息技术有限公司 | Node fault determination method and device and server |
CN113783735A (en) * | 2021-09-24 | 2021-12-10 | 小红书科技有限公司 | Method, device, equipment and medium for identifying fault node in Redis cluster |
CN115102886A (en) * | 2022-06-21 | 2022-09-23 | 上海驻云信息科技有限公司 | Task scheduling method and device for multiple acquisition clients |
CN116260705A (en) * | 2022-12-21 | 2023-06-13 | 广西壮族自治区自然资源信息中心 | Geographic information distributed cluster fault processing method, device, medium and equipment |
CN116260705B (en) * | 2022-12-21 | 2023-09-15 | 广西壮族自治区自然资源信息中心 | Geographic information distributed cluster fault processing method, device, medium and equipment |
Also Published As
Publication number | Publication date |
---|---|
WO2016192408A1 (en) | 2016-12-08 |
CN106301853B (en) | 2019-06-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106301853A (en) | The fault detection method of group system interior joint and device | |
CA2577200C (en) | Method and systems for copying data components between nodes of a network | |
CN103298012B (en) | A kind of AP fault detection method and equipment | |
CN102929696B (en) | A kind of distributed system Centroid structure, submission, method for supervising and device | |
KR20180133606A (en) | Apparatus and operating method for monitoring micro controller unit having multi-core | |
CN105379201B (en) | Method, controller and the failover interchanger of path switching | |
CN104301140B (en) | Service request response method, device and system | |
CN106034045A (en) | Ethernet link failure positioning method, device and system | |
CN105897507A (en) | Node equipment state detection method and node equipment state detection device | |
CN108683528B (en) | Data transmission method, central server, server and data transmission system | |
CN109039795A (en) | A kind of Cloud Server resource monitoring method and system | |
CN110943877A (en) | Network state measuring method, equipment and system | |
CN107426051A (en) | The monitoring method of the working condition of distributed cluster system interior joint, apparatus and system | |
US8681645B2 (en) | System and method for coordinated discovery of the status of network routes by hosts in a network | |
US20060031445A1 (en) | Method and apparatus for determination of network topology | |
US10164856B2 (en) | Reconciliation of asymmetric topology in a clustered environment | |
CN105187264B (en) | A kind of method, communication equipment and the system of direct connected link quality-monitoring | |
Xiaolin et al. | Probe selection algorithm for faulty links localization in all-optical networks | |
WO2023011163A1 (en) | Beam quality monitoring method and apparatus, and device and medium | |
JP2014103516A (en) | Packet transfer delay measuring apparatus, method and program | |
CN109120547A (en) | A kind of message length determines method and device | |
CN110752967A (en) | Interface fault detection method, device, equipment and storage medium | |
JPH0260338A (en) | Bus type lan | |
KR20170122403A (en) | System and method for diagnosing based on data distribution service network | |
CN117176626A (en) | Node detection method, device, equipment and readable storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |