[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN113965523A - Loop-based PFC deadlock processing method and device - Google Patents

Loop-based PFC deadlock processing method and device Download PDF

Info

Publication number
CN113965523A
CN113965523A CN202111264674.9A CN202111264674A CN113965523A CN 113965523 A CN113965523 A CN 113965523A CN 202111264674 A CN202111264674 A CN 202111264674A CN 113965523 A CN113965523 A CN 113965523A
Authority
CN
China
Prior art keywords
pfc
deadlock
port
pfc deadlock
timer
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN202111264674.9A
Other languages
Chinese (zh)
Inventor
黄伟杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ruijie Networks Co Ltd
Original Assignee
Ruijie Networks Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ruijie Networks Co Ltd filed Critical Ruijie Networks Co Ltd
Priority to CN202111264674.9A priority Critical patent/CN113965523A/en
Publication of CN113965523A publication Critical patent/CN113965523A/en
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/10Flow control; Congestion control
    • H04L47/24Traffic characterised by specific attributes, e.g. priority or QoS
    • H04L47/2425Traffic characterised by specific attributes, e.g. priority or QoS for supporting services specification, e.g. SLA
    • H04L47/2433Allocation of priorities to traffic types
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/10Flow control; Congestion control
    • H04L47/12Avoiding congestion; Recovering from congestion

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The invention discloses a method and a device for processing PFC deadlock based on a loop, wherein the method is applied to network equipment in a data center network and comprises the following steps: detecting whether a PFC deadlock occurs at a self port according to a preset period; when the PFC deadlock of the first port is detected, starting a first timer, and performing PFC deadlock recovery action according to a preset PFC deadlock strategy to remove the PFC deadlock; when the first timer is up, acquiring a PFC deadlock state of the first port; and when the PFC deadlock state of the first port is not released, resetting the first timer and starting, and continuing executing the action of recovering the PFC deadlock according to a preset PFC deadlock strategy to release the PFC deadlock. The embodiment of the invention can solve the problem that in the prior art, when a data center network loop has PFC deadlock, the recovery is slow or even cannot be realized.

Description

Loop-based PFC deadlock processing method and device
Technical Field
The present invention relates to the field of communications technologies, and in particular, to a method and an apparatus for processing a Priority-based Flow Control (PFC) deadlock.
Background
A data center network usually has a plurality of different types of message flows, the reason for network congestion may be caused by only one type of message flow, but the common flow control function is to suspend all traffic on a line for a period of time by sending a Pause frame to relieve the congestion phenomenon; the common flow control function cannot separate different types of message flows, which can stop all types of message flows on the whole link, has a wide influence range, and particularly has a large influence on delay-sensitive high-performance computation flows in a network, so that the common flow control function cannot meet the requirements of a data center network.
The PFC can differentiate the types of packet flows by using the 802.1p priority field in the Vlan Tag of the packet, create 8 virtual channels on the link, assign a priority to each virtual channel, send the packet to the virtual channel with the corresponding priority when receiving the packet, and any one of the channels can individually suspend and resume data transmission. When the virtual channel is congested, the priority assigned to the virtual channel can detect which priority message flows cause congestion, and then the sending of the message flows causing congestion is suspended by sending the PFC flow control frame.
However, when there is a loop in the network for the message stream sent by the server, as shown in fig. 1: the message flow returns to the device 1 after sequentially passing through the device 1, the device 4, the device 3 and the device 5; in this case, a PFC Deadlock (PFC dead lock) may be caused, specifically referring to fig. 2, when the device 3 detects that the used buffer of the logical channel corresponding to a certain priority packet flow reaches a preset congestion waterline, the device 3 is triggered to send out a PFC pause frame to notify an upstream device (device 4) of limiting the speed (stopping sending packets); the device 4 receives the PFC pause frame and stops sending packets; since the device 4 stops sending packets but continues receiving packets, the buffer corresponding to the packet flow in the device 4 reaches the preset congestion waterline again, and the PFC pause frame is also triggered to the device 1; the device 1 triggers the PFC pause frame to the device 5 similarly, and the device 5 triggers the PFC pause frame to the device 3; all devices in the loop trigger the PFC pause frame, and all the devices stop sending the flow; resulting in a PFC Deadlock.
When the PFC Deadlock occurs in the loop, the device may process the packet accumulated in the queue corresponding to the buffer according to a preset PFC Deadlock policy, for example, may select to forward or discard, however, the PFC Deadlock may only release the queue buffer corresponding to the current priority, so as to break the PFC Deadlock. The deadlock principle of breaking the loop is as follows: and releasing a certain space for the whole loop, and reducing the buffer occupation of all the switches to be below a congestion line so as to remove PFC (power factor correction) inhibition on the loop. However, in an environment with many loop devices, each device does not release the buffer space enough to lower the buffers on the entire loop below the congestion level according to the preset PFC delay policy, so that the Deadlock cannot be released necessarily, and the PFC delay may be triggered again in a short time to execute the corresponding logic again, so that the message cannot be recovered to normal for a long time.
Disclosure of Invention
The embodiment of the invention provides a loop-based PFC deadlock processing method and device, which are used for solving the problems that in the prior art, when a data center network loop is in PFC deadlock, recovery is slow or even cannot be achieved.
According to an embodiment of the present invention, a method for processing a loop-based PFC deadlock is provided, where the method is applied to a network device in a data center network, and includes:
detecting whether a PFC deadlock occurs to a port of the network equipment according to a preset period;
when the PFC deadlock of the first port is detected, starting a first timer, and performing PFC deadlock recovery action according to a preset PFC deadlock strategy to remove the PFC deadlock;
when the first timer is up, acquiring a PFC deadlock state of the first port;
and when the PFC deadlock state of the first port is not released, resetting the first timer and starting, and continuing executing the action of recovering the PFC deadlock according to a preset PFC deadlock strategy to release the PFC deadlock.
Further, after acquiring the PFC deadlock status of the first port, the method further includes:
and when the PFC deadlock state of the first port is released, ending the recovery action of the PFC deadlock, and continuously detecting whether the port of the network equipment is in PFC deadlock according to a preset period.
The recovering action of the PFC deadlock according to the preset PFC deadlock strategy comprises the following steps:
emptying the messages in the message queue with the congestion according to a preset PFC deadlock strategy;
and stopping sending the PFC pause frame to the upstream equipment, wherein the PFC pause frame is used for informing the upstream equipment of stopping sending the message to the network equipment within a preset time length.
Wherein the acquiring the PFC deadlock state of the first port includes:
judging whether the first port receives a PFC pause frame sent by downstream equipment;
when a PFC pause frame sent by downstream equipment is received, a second timer is reset and started, PFC suppression is carried out on a message queue of the first port corresponding to the PFC frame, and the PFC deadlock state of the first port is not released;
when the second timer is up and the PFC pause frame sent by the downstream equipment is still not received, the PFC suppression of the message queue of the first port corresponding to the PFC frame is released, and the PFC deadlock state of the first port is released.
According to an embodiment of the present invention, there is also provided a device for processing PFC deadlock based on a loop, where the device is applied to a network device in a data center network, and the device includes: the device comprises a detection unit, a processing unit and an acquisition unit;
the detection unit is used for detecting whether PFC deadlock occurs to a port of the network equipment according to a preset period;
the processing unit is used for starting a first timer when the detection unit detects that PFC deadlock occurs at the first port, and performing PFC deadlock recovery action according to a preset PFC deadlock strategy to remove the PFC deadlock;
the obtaining unit is configured to obtain a PFC deadlock state of the first port when the first timer expires;
the processing unit is further configured to, when the PFC deadlock state of the first port is not released, reset the first timer and start up, and continue to perform a PFC deadlock recovery action according to a preset PFC deadlock policy to release the PFC deadlock.
Further, the processing unit is further configured to end the PFC deadlock recovery action when the PFC deadlock state of the first port is released;
the detection unit is further configured to, when the PFC deadlock state of the first port is released, continue to detect whether PFC deadlock occurs to the port of the network device according to a preset period.
The processing unit is configured to, when performing a PFC deadlock recovery action according to a preset PFC deadlock strategy, specifically: emptying the messages in the message queue with the congestion according to a preset PFC deadlock strategy; and stopping sending the PFC pause frame to the upstream equipment, wherein the PFC pause frame is used for informing the upstream equipment of stopping sending the message to the network equipment within a preset time length.
The obtaining unit is configured to, when obtaining the PFC deadlock state of the first port, specifically:
judging whether the first port receives a PFC pause frame sent by downstream equipment;
when a PFC pause frame sent by downstream equipment is received, a second timer is reset and started, PFC suppression is carried out on a message queue of the first port corresponding to the PFC frame, and the PFC deadlock state of the first port is not released;
when the second timer is up and the PFC pause frame sent by the downstream equipment is still not received, the PFC suppression of the message queue of the first port corresponding to the PFC frame is released, and the PFC deadlock state of the first port is released.
According to the embodiment of the invention, the electronic equipment comprises a processor, a communication interface, a memory and a communication bus, wherein the processor, the communication interface and the memory complete mutual communication through the communication bus;
a memory for storing a computer program;
a processor for implementing the above method steps when executing the program stored in the memory.
According to an embodiment of the present invention, there is also provided a computer-readable storage medium having stored therein a computer program, which when executed by a processor, performs the above-mentioned method steps.
The invention has the following beneficial effects:
according to the loop-based PFC deadlock processing method and device provided by the embodiment of the invention, the network equipment detects whether PFC deadlock occurs at a port of the network equipment according to a preset period; when the PFC deadlock of the first port is detected, starting a first timer, and performing PFC deadlock recovery action according to a preset PFC deadlock strategy to remove the PFC deadlock; when the first timer is up, acquiring a PFC deadlock state of the first port; and when the PFC deadlock state of the first port is not released, resetting the first timer and starting, and continuing executing the action of recovering the PFC deadlock according to a preset PFC deadlock strategy to release the PFC deadlock. In the embodiment of the invention, after the network equipment starts to execute the recovery action, the PFC deadlock state of the first port is periodically acquired according to the first timer, and as long as the acquired PFC deadlock state of the first port is not released, the recovery action is continuously executed and the PFC deadlock device of the first port is continuously acquired until the PFC deadlock is released, so that whether the recovery action is executed again or not is determined without detecting whether the port of the network equipment is in PFC deadlock or not again, the time consumption of PFC deadlock detection is avoided, the recovery of PFC deadlock can be quickly realized, and the continuity of network service is ensured.
Drawings
FIG. 1 is a schematic diagram of a network topology of a loop in a data center network;
FIG. 2 is a schematic diagram of a network topology in which loops in a data center network are congested;
fig. 3 is a flowchart of a method for processing a loop-based PFC deadlock in an embodiment of the present invention;
fig. 4 is a schematic structural diagram of a processing apparatus for loop-based PFC deadlock in an embodiment of the present invention;
fig. 5 is a schematic structural diagram of an electronic device shown in the present application.
Detailed Description
Aiming at the problem that when a data center network loop is in PFC deadlock, recovery is slow or even cannot be recovered in the prior art, the PFC deadlock processing method based on the loop provided by the embodiment of the invention comprises the steps of starting a first timer when PFC deadlock of a first port is detected, and performing PFC deadlock recovery action according to a preset PFC deadlock strategy to remove the PFC deadlock; when the first timer is up, acquiring a PFC deadlock state of the first port; and when the PFC deadlock state of the first port is not released, resetting the first timer and starting, and continuing executing the action of recovering the PFC deadlock according to a preset PFC deadlock strategy to release the PFC deadlock. The flow of the method of the present invention is shown in fig. 3, and is applied to a network device of a data center network, and the method comprises the following steps:
step 301, detecting whether a port of a network device is subjected to PFC deadlock according to a preset period;
step 302, when detecting that the PFC deadlock occurs at the first port, starting a first timer, and performing PFC deadlock recovery action according to a preset PFC deadlock strategy to remove the PFC deadlock;
step 303, when the first timer is up, acquiring a PFC deadlock state of the first port;
and 304, when the PFC deadlock state of the first port is not released, resetting and starting the first timer, and continuing executing the action of recovering the PFC deadlock according to a preset PFC deadlock strategy to release the PFC deadlock.
After acquiring the PFC deadlock state of the first port, the method further includes:
and when the PFC deadlock state of the first port is released, ending the recovery action of the PFC deadlock, and continuously detecting whether the port of the network equipment is in PFC deadlock according to a preset period.
Optionally, the performing, according to a preset PFC deadlock policy, a recovery action of the PFC deadlock includes:
emptying the messages in the message queue with the congestion according to a preset PFC deadlock strategy; here, the message may be forwarded according to a configured rule or directly discarded;
and stopping sending the PFC pause frame to the upstream equipment, wherein the PFC pause frame is used for informing the upstream equipment of stopping sending the message to the network equipment within the preset time length. Here, when the packet queue of the network device is congested, a PFC pause frame is sent to the upstream device to notify the upstream device not to send packets to the upstream device any more within a certain time period, so that when the packet queue is empty, that is, the packet queue is not congested, at this time, it is necessary to stop sending the PFC pause frame to the upstream device, and the upstream device may continue sending packets to the network device. The value of the preset duration is carried in the PFC pause frame, and the network device analyzes the value of the preset duration after receiving the PFC pause frame.
Optionally, the acquiring the PFC deadlock state of the first port includes:
judging whether the first port receives a PFC pause frame sent by downstream equipment;
when a PFC pause frame sent by downstream equipment is received, a second timer is reset and started, PFC suppression is carried out on a message queue of the first port corresponding to the PFC frame, and the PFC deadlock state of the first port is not released; here, performing PFC suppression on the packet queue may be understood that the state of the port is an xoff state, and the function of suppressing traffic forwarding of the PFC of the port is activated in this state. The duration of the second timer may be set according to a value of a preset duration carried in the PFC pause frame.
When the second timer is up and the PFC pause frame sent by the downstream equipment is still not received, the PFC suppression of the message queue of the first port corresponding to the PFC frame is released, and the PFC deadlock state of the first port is released. Here, each time a PFC pause frame sent by the downstream device is received, the second timer which is counting time is reset, and the second timer is returned to the state where the second timer is just started and then started, that is, the second timer is restarted each time a PFC pause frame sent by the downstream device is received; therefore, when the second timer expires, it means that the PFC pause frame sent by the downstream device has not been received within the timing duration of the second timer for restarting.
Here, when the PFC pause frame sent by the downstream device is not received, it indicates that the congestion of the downstream device is released, and the network device may continue to send the packet to the downstream device, and then the congestion of the network device itself is released, and the network device also stops sending the PFC pause frame to the upstream device, and so on, and then the congestion of the entire loop is released.
Based on the same inventive concept, an embodiment of the present invention provides a loop-based apparatus for processing PFC deadlock, which can be applied to a network device in a data center network, and has a structure shown in fig. 4, where the apparatus includes: a detection unit 41, a processing unit 42, and an acquisition unit 43;
the detecting unit 41 is configured to detect whether a PFC deadlock occurs at a port of the network device according to a preset period;
the processing unit 42 is configured to start a first timer when the detection unit 41 detects that the PFC deadlock occurs at the first port, and perform a PFC deadlock recovery action according to a preset PFC deadlock policy to release the PFC deadlock;
the obtaining unit 43 is configured to obtain a PFC deadlock state of the first port when the first timer expires;
the processing unit 42 is further configured to, when the PFC deadlock state of the first port is not resolved, reset the first timer and start up, and continue to perform a PFC deadlock recovery action according to a preset PFC deadlock policy to resolve the PFC deadlock.
The processing unit 42 is further configured to end the recovery action of the PFC deadlock when the PFC deadlock state of the first port is released;
correspondingly, the detecting unit 41 is further configured to, when the PFC deadlock state of the first port is released, continue to detect whether PFC deadlock occurs on the port of the network device according to a preset period.
The processing unit 42 is configured to, when performing a PFC deadlock recovery action according to a preset PFC deadlock policy, specifically: emptying the messages in the message queue with the congestion according to a preset PFC deadlock strategy; and stopping sending the PFC pause frame to the upstream equipment, wherein the PFC pause frame is used for informing the upstream equipment of stopping sending the message to the network equipment within a preset time length. Here, when congestion occurs in a message queue of the upstream device, a PFC pause frame is sent to the upstream device to notify the upstream device not to send a message to the upstream device any longer within a certain time period, so that when the message queue is empty, that is, the message queue is not congested, at this time, sending of the PFC pause frame to the upstream device needs to be stopped, and the upstream device may continue sending a message to the network device. The value of the preset duration is carried in the PFC pause frame, and the network device analyzes the value of the preset duration after receiving the PFC pause frame.
Optionally, the obtaining unit 43 is configured to, when obtaining the PFC deadlock state of the first port, specifically:
judging whether the first port receives a PFC pause frame sent by downstream equipment;
when a PFC pause frame sent by downstream equipment is received, a second timer is reset and started, PFC suppression is carried out on a message queue of the first port corresponding to the PFC frame, and the PFC deadlock state of the first port is not released; here, performing PFC suppression on the packet queue may be understood that the state of the port is an xoff state, and the function of suppressing traffic forwarding of the PFC of the port is activated in this state. The duration of the second timer may be set according to a value of a preset duration carried in the PFC pause frame.
When the second timer is up and the PFC pause frame sent by the downstream equipment is still not received, the PFC suppression of the message queue of the first port corresponding to the PFC frame is released, and the PFC deadlock state of the first port is released. Here, each time a PFC pause frame sent by the downstream device is received, the second timer which is counting time is reset, and the second timer is returned to the state where the second timer is just started and then started, that is, the second timer is restarted each time a PFC pause frame sent by the downstream device is received; therefore, when the second timer expires, it means that the PFC pause frame sent by the downstream device has not been received within the timing duration of the second timer for restarting.
Here, when the PFC pause frame sent by the downstream device is not received, it indicates that the congestion of the downstream device is released, and the network device may continue to send the packet to the downstream device, and then the congestion of the network device itself is released, and the network device also stops sending the PFC pause frame to the upstream device, and so on, and then the congestion of the entire loop is released.
It should be understood that the implementation principle and the process of the loop-based PFC deadlock processing apparatus provided in the embodiment of the present invention are similar to those in the embodiment shown in fig. 3 and described above, and are not described in detail herein.
According to the loop-based PFC deadlock processing method and device provided by the embodiment of the invention, the network equipment detects whether PFC deadlock occurs at a port of the network equipment according to a preset period; when the PFC deadlock of the first port is detected, starting a first timer, and performing PFC deadlock recovery action according to a preset PFC deadlock strategy to remove the PFC deadlock; when the first timer is up, acquiring a PFC deadlock state of the first port; and when the PFC deadlock state of the first port is not released, resetting the first timer and starting, and continuing executing the action of recovering the PFC deadlock according to a preset PFC deadlock strategy to release the PFC deadlock. In the embodiment of the invention, after the network equipment starts to execute the recovery action, the PFC deadlock state of the first port is periodically acquired according to the first timer, and as long as the acquired PFC deadlock state of the first port is not released, the recovery action is continuously executed and the PFC deadlock device of the first port is continuously acquired until the PFC deadlock is released, so that whether the recovery action is executed again is determined without detecting whether the self port is in PFC deadlock again, the time consumption of PFC deadlock detection is avoided, the recovery of PFC deadlock can be quickly realized, and the continuity of network service is ensured.
It should be understood that, in a non-loop scenario, for example, a scenario in which a single device failure causes a message to be abnormally sent, thereby causing a PFC deadlock, may also be solved by using the above technical solution, and whether link congestion has been relieved is determined by deadlock relief determination, thereby quickly resolving an abnormal congestion environment at one time.
An electronic device is further provided in the embodiment of the present application, please refer to fig. 5, which includes a processor 510, a communication interface 520, a memory 530 and a communication bus 540, wherein the processor 510, the communication interface 520 and the memory 530 complete communication with each other through the communication bus 540.
A memory 530 for storing a computer program;
the processor 510 is configured to implement the loop-based PFC deadlock processing method according to any of the embodiments described above when executing the program stored in the memory 530.
The communication interface 520 is used for communication between the electronic apparatus and other apparatuses.
The Memory may include a Random Access Memory (RAM) or a Non-Volatile Memory (NVM), such as at least one disk Memory. Optionally, the memory may also be at least one memory device located remotely from the processor.
The Processor may be a general-purpose Processor, including a Central Processing Unit (CPU), a Network Processor (NP), and the like; but also Digital Signal Processors (DSPs), Application Specific Integrated Circuits (ASICs), Field Programmable Gate Arrays (FPGAs) or other Programmable logic devices, discrete Gate or transistor logic devices, discrete hardware components.
According to the scheme, after the network equipment starts to execute the recovery action, the PFC deadlock state of the first port is periodically acquired according to the first timer, and as long as the acquired PFC deadlock state of the first port is not released, the recovery action is continuously executed and the PFC deadlock device of the first port is continuously acquired until the PFC deadlock is released, so that whether the recovery action is executed again is determined without detecting whether the port of the network equipment is in PFC deadlock or not again, time consumption of PFC deadlock detection is avoided, recovery of PFC deadlock can be quickly achieved, and continuity of network service is guaranteed.
Accordingly, an embodiment of the present application further provides a computer-readable storage medium, in which instructions are stored, and when the instructions are executed on a computer, the computer is caused to execute the loop-based PFC deadlock processing method described in any one of the above embodiments.
According to the scheme, after the network equipment starts to execute the recovery action, the PFC deadlock state of the first port is periodically acquired according to the first timer, and as long as the acquired PFC deadlock state of the first port is not released, the recovery action is continuously executed and the PFC deadlock device of the first port is continuously acquired until the PFC deadlock is released, so that whether the recovery action is executed again is determined without detecting whether the port of the network equipment is in PFC deadlock or not again, time consumption of PFC deadlock detection is avoided, recovery of PFC deadlock can be quickly achieved, and continuity of network service is guaranteed.
Those of ordinary skill in the art will understand that: the figures are merely schematic representations of one embodiment, and the blocks or flow diagrams in the figures are not necessarily required to practice the present invention.
From the above description of the embodiments, it is clear to those skilled in the art that the present invention can be implemented by software plus necessary general hardware platform. Based on such understanding, the technical solutions of the present invention may be embodied in the form of a software product, which may be stored in a storage medium, such as ROM/RAM, magnetic disk, optical disk, etc., and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the method according to the embodiments or some parts of the embodiments.
The embodiments in the present specification are described in a progressive manner, and the same and similar parts among the embodiments are referred to each other, and each embodiment focuses on the differences from the other embodiments. In particular, for apparatus or system embodiments, since they are substantially similar to method embodiments, they are described in relative terms, as long as they are described in partial descriptions of method embodiments. The above-described embodiments of the apparatus and system are merely illustrative, and the units described as separate parts may or may not be physically separate, and the parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment. One of ordinary skill in the art can understand and implement it without inventive effort.
In addition, in some of the flows described in the above embodiments and the drawings, a plurality of operations are included in a specific order, but it should be clearly understood that the operations may be executed out of the order presented herein or in parallel, and the sequence numbers of the operations, such as 201, 202, 203, etc., are merely used for distinguishing different operations, and the sequence numbers themselves do not represent any execution order. Additionally, the flows may include more or fewer operations, and the operations may be performed sequentially or in parallel. It should be noted that, the descriptions of "first", "second", etc. in this document are used for distinguishing different messages, devices, modules, etc., and do not represent a sequential order, nor limit the types of "first" and "second" to be different.
The above-described embodiments of the apparatus are merely illustrative, and the units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment. One of ordinary skill in the art can understand and implement it without inventive effort.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
While alternative embodiments of the present invention have been described, additional variations and modifications in those embodiments may occur to those skilled in the art once they learn of the basic inventive concepts. It is therefore intended that the following appended claims be interpreted as including alternative embodiments and all such alterations and modifications as fall within the scope of the invention.
It will be apparent to those skilled in the art that various modifications and variations can be made in the embodiments of the present invention without departing from the spirit or scope of the embodiments of the invention. Thus, if such modifications and variations of the embodiments of the present invention fall within the scope of the claims of the present invention and their equivalents, the present invention is also intended to encompass such modifications and variations.

Claims (10)

1. A method for processing PFC deadlock based on a loop is applied to network equipment in a data center network and comprises the following steps:
detecting whether a PFC deadlock occurs to a port of the network equipment according to a preset period;
when the PFC deadlock of the first port is detected, starting a first timer, and performing PFC deadlock recovery action according to a preset PFC deadlock strategy to remove the PFC deadlock;
when the first timer is up, acquiring a PFC deadlock state of the first port;
and when the PFC deadlock state of the first port is not released, resetting the first timer and starting, and continuing executing the action of recovering the PFC deadlock according to a preset PFC deadlock strategy to release the PFC deadlock.
2. The method of claim 1, wherein after obtaining the PFC deadlock status of the first port, the method further comprises:
and when the PFC deadlock state of the first port is released, ending the recovery action of the PFC deadlock, and continuously detecting whether the port of the network equipment is in PFC deadlock according to a preset period.
3. The method of claim 1, wherein the performing the PFC deadlock recovery action according to the preset PFC deadlock strategy comprises:
emptying the messages in the message queue with the congestion according to a preset PFC deadlock strategy;
and stopping sending the PFC pause frame to the upstream equipment, wherein the PFC pause frame is used for informing the upstream equipment of stopping sending the message to the network equipment within a preset time length.
4. The method according to any one of claims 1 to 3, wherein the obtaining the PFC deadlock status of the first port comprises:
judging whether the first port receives a PFC pause frame sent by downstream equipment;
when a PFC pause frame sent by downstream equipment is received, a second timer is reset and started, PFC suppression is carried out on a message queue of the first port corresponding to the PFC frame, and the PFC deadlock state of the first port is not released;
when the second timer is up and the PFC pause frame sent by the downstream equipment is still not received, the PFC suppression of the message queue of the first port corresponding to the PFC frame is released, and the PFC deadlock state of the first port is released.
5. A loop-based PFC deadlock processing device is applied to a network device in a data center network and comprises the following components: the device comprises a detection unit, a processing unit and an acquisition unit;
the detection unit is used for detecting whether PFC deadlock occurs to a port of the network equipment according to a preset period;
the processing unit is used for starting a first timer when the detection unit detects that PFC deadlock occurs at the first port, and performing PFC deadlock recovery action according to a preset PFC deadlock strategy to remove the PFC deadlock;
the obtaining unit is configured to obtain a PFC deadlock state of the first port when the first timer expires;
the processing unit is further configured to, when the PFC deadlock state of the first port is not released, reset the first timer and start up, and continue to perform a PFC deadlock recovery action according to a preset PFC deadlock policy to release the PFC deadlock.
6. The apparatus of claim 5, wherein the processing unit is further configured to end the PFC deadlock recovery action when the PFC deadlock state of the first port is released;
the detection unit is further configured to, when the PFC deadlock state of the first port is released, continue to detect whether PFC deadlock occurs to the port of the network device according to a preset period.
7. The apparatus according to claim 5, wherein the processing unit, when performing the PFC deadlock recovery action according to a preset PFC deadlock policy, is specifically configured to: emptying the messages in the message queue with the congestion according to a preset PFC deadlock strategy; and stopping sending the PFC pause frame to the upstream equipment, wherein the PFC pause frame is used for informing the upstream equipment of stopping sending the message to the network equipment within a preset time length.
8. The apparatus according to any one of claims 5 to 7, wherein the obtaining unit, when obtaining the PFC deadlock state of the first port, is specifically:
judging whether the first port receives a PFC pause frame sent by downstream equipment;
when a PFC pause frame sent by downstream equipment is received, a second timer is reset and started, PFC suppression is carried out on a message queue of the first port corresponding to the PFC frame, and the PFC deadlock state of the first port is not released;
when the second timer is up and the PFC pause frame sent by the downstream equipment is still not received, the PFC suppression of the message queue of the first port corresponding to the PFC frame is released, and the PFC deadlock state of the first port is released.
9. An electronic device, characterized in that the electronic device comprises a processor, a communication interface, a memory and a communication bus, wherein the processor, the communication interface and the memory are communicated with each other through the communication bus;
a memory for storing a computer program;
a processor for implementing the method steps of any of claims 1-4 when executing a program stored on a memory.
10. A computer-readable storage medium, characterized in that a computer program is stored in the computer-readable storage medium, which computer program, when being executed by a processor, carries out the method steps of any one of claims 1 to 4.
CN202111264674.9A 2021-10-28 2021-10-28 Loop-based PFC deadlock processing method and device Withdrawn CN113965523A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111264674.9A CN113965523A (en) 2021-10-28 2021-10-28 Loop-based PFC deadlock processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111264674.9A CN113965523A (en) 2021-10-28 2021-10-28 Loop-based PFC deadlock processing method and device

Publications (1)

Publication Number Publication Date
CN113965523A true CN113965523A (en) 2022-01-21

Family

ID=79467958

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111264674.9A Withdrawn CN113965523A (en) 2021-10-28 2021-10-28 Loop-based PFC deadlock processing method and device

Country Status (1)

Country Link
CN (1) CN113965523A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114640635A (en) * 2022-03-17 2022-06-17 新华三技术有限公司合肥分公司 Method and device for processing PFC deadlock

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140146666A1 (en) * 2012-11-28 2014-05-29 Broadcom Corporation Deadlock recovery for distributed devices
CN106506279A (en) * 2016-11-11 2017-03-15 盛科网络(苏州)有限公司 Network lock-up condition detection method and device
CN110995534A (en) * 2019-11-26 2020-04-10 锐捷网络股份有限公司 Detection method and device for PFC deadlock
US20210006502A1 (en) * 2019-02-02 2021-01-07 Huawei Technologies Co., Ltd. Flow control method and apparatus
CN112242914A (en) * 2019-07-18 2021-01-19 华为技术有限公司 Network abnormal root cause positioning method, device and system and computer storage medium
CN112448900A (en) * 2019-09-02 2021-03-05 华为技术有限公司 Data transmission method and device
CN113300975A (en) * 2020-07-20 2021-08-24 阿里巴巴集团控股有限公司 Control method of network equipment, network transmission method, device and equipment

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140146666A1 (en) * 2012-11-28 2014-05-29 Broadcom Corporation Deadlock recovery for distributed devices
CN106506279A (en) * 2016-11-11 2017-03-15 盛科网络(苏州)有限公司 Network lock-up condition detection method and device
US20210006502A1 (en) * 2019-02-02 2021-01-07 Huawei Technologies Co., Ltd. Flow control method and apparatus
CN112242914A (en) * 2019-07-18 2021-01-19 华为技术有限公司 Network abnormal root cause positioning method, device and system and computer storage medium
US20210021540A1 (en) * 2019-07-18 2021-01-21 Huawei Technologies Co., Ltd. Method, Apparatus, and System for Locating Root Cause of Network Anomaly, and Computer Storage Medium
CN112448900A (en) * 2019-09-02 2021-03-05 华为技术有限公司 Data transmission method and device
CN110995534A (en) * 2019-11-26 2020-04-10 锐捷网络股份有限公司 Detection method and device for PFC deadlock
CN113300975A (en) * 2020-07-20 2021-08-24 阿里巴巴集团控股有限公司 Control method of network equipment, network transmission method, device and equipment

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114640635A (en) * 2022-03-17 2022-06-17 新华三技术有限公司合肥分公司 Method and device for processing PFC deadlock
CN114640635B (en) * 2022-03-17 2024-02-09 新华三技术有限公司合肥分公司 PFC deadlock processing method and device

Similar Documents

Publication Publication Date Title
CN108989235B (en) Message forwarding control method and device
US6618357B1 (en) Queue management for networks employing pause time based flow control
CN113190487B (en) PCIe (peripheral component interconnect express) equipment hot-drawing method and device
CN101582852A (en) Method and system for network congestion management
CN107977302B (en) CPU utilization rate output method and device
CN102611630B (en) A kind of message acceptance control method and system
WO2013169512A1 (en) Grace state and pacing in link aggregation
CN109379246B (en) Memory detection method and device
CN113965523A (en) Loop-based PFC deadlock processing method and device
CN109828945B (en) Service message processing method and system
CN114640635A (en) Method and device for processing PFC deadlock
US9929899B2 (en) Snapshot message
CN108563530A (en) House dog and its implementation
CN118101568A (en) CPU soft forwarding congestion processing method, system, equipment and readable storage medium
US11550694B2 (en) Packet backpressure detection method, apparatus, and device
CN114070799A (en) Processing method and processing device of priority pause frame and target network equipment
CN103746856B (en) Method and network device for preventing service interruption
US10951526B2 (en) Technologies for efficiently determining a root of congestion with a multi-stage network switch
CN105224411B (en) It is a kind of to avoid CPU from being depleted the method for causing house dog to be restarted
EP4304150A1 (en) Frame preemption method and apparatus, and device and storage medium
CN108781170B (en) Configuration device and method
CN108616461B (en) Policy switching method and device
CN103873386B (en) Method for managing network transmission sequence
CN112769667B (en) Loop protection control method and device for multi-device cascade communication network
US20200336433A1 (en) Limiting Backpressure With Bad Actors

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication
WW01 Invention patent application withdrawn after publication

Application publication date: 20220121