CN114338366B - Data center fault alarm signal positioning method and system - Google Patents
Data center fault alarm signal positioning method and system Download PDFInfo
- Publication number
- CN114338366B CN114338366B CN202111586005.3A CN202111586005A CN114338366B CN 114338366 B CN114338366 B CN 114338366B CN 202111586005 A CN202111586005 A CN 202111586005A CN 114338366 B CN114338366 B CN 114338366B
- Authority
- CN
- China
- Prior art keywords
- analysis module
- network
- fault
- data center
- alarm signal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 13
- 238000004458 analytical method Methods 0.000 claims abstract description 54
- 238000005206 flow analysis Methods 0.000 claims abstract description 26
- 238000007726 management method Methods 0.000 claims description 23
- 238000001514 detection method Methods 0.000 claims description 3
- 238000013468 resource allocation Methods 0.000 claims description 3
- 238000003032 molecular docking Methods 0.000 claims description 2
- 230000008569 process Effects 0.000 claims description 2
- 238000013024 troubleshooting Methods 0.000 claims 1
- 238000012423 maintenance Methods 0.000 abstract description 4
- 230000006855 networking Effects 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 5
- 238000011835 investigation Methods 0.000 description 3
- 239000013307 optical fiber Substances 0.000 description 3
- 238000002955 isolation Methods 0.000 description 2
- 230000009471 action Effects 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000005538 encapsulation Methods 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
Landscapes
- Data Exchanges In Wide-Area Networks (AREA)
Abstract
The invention relates to the technical field of fault positioning, in particular to a data center fault alarm signal positioning method and system; the data center fault alarm signal positioning system comprises a protocol analysis module, a flow analysis module, a device analysis module, a management interface and an SDN network architecture, wherein the management interface comprises an SNMP (simple network management protocol), a Netconf (network conf) and an FTP (file transfer protocol) interface, the protocol analysis module, the flow analysis module and the device analysis module are all connected with the SDN network architecture through the management interface, and the system is deployed on a physical server or a virtual machine to perform multidimensional network fault positioning analysis through protocols, flow and devices, and replaces the splitting analysis of network equipment endpoints by an end-to-end concept, so that the network fault positioning speed is accelerated, and the operation and maintenance fault positioning pressure can be well relieved.
Description
Technical Field
The invention relates to the technical field of fault positioning, in particular to a data center fault alarm signal positioning method and system.
Background
With the rapid increase of the number of cloud service users and service contents, the scale of a data center is also larger and larger, the number of contained servers is exponentially increasing, and the data exchange volume is also increased in a super-linear mode. Within a data center, fiber optic communication technology is used to connect a large number of switches and servers, providing an efficient solution for high capacity, high performance, scalable, survivable services of the data center. However, since the optical fiber is easily damaged, once the optical fiber link in the network is damaged, all traffic on the link is interrupted. The data center optical network adopts a parallel mode to transmit data, and the transmission rate is quite high, so that even if only one optical fiber link is destroyed, a large amount of service interruption and data loss can be caused.
The traditional data center is not separated from the control and forwarding in architecture, the control function and the forwarding function are concentrated in the same network equipment, and the whole network is fixed, inconvenient to adjust and incapable of being controlled in a centralized way.
Disclosure of Invention
The invention aims to provide a data center fault alarm signal positioning method and system, and aims to solve the technical problems that a data center in the prior art is not separated in structure from forwarding, a control function and a forwarding function are concentrated in the same network equipment, and the whole network is fixed, inconvenient to adjust and incapable of being controlled in a centralized manner.
In order to achieve the above objective, the present invention provides a data center fault alarm signal positioning system, where the data center fault alarm signal positioning system includes a protocol analysis module, a flow analysis module, an equipment analysis module, a management interface and an SDN network architecture, where the protocol analysis module, the flow analysis module and the equipment analysis module are all connected with the SDN network architecture through the management interface, and the management interface includes SNMP, netconf and FTP interfaces;
the SDN network architecture is responsible for resource allocation, redundancy management, error management and elastic adjustment, and realizes automatic opening and deployment of network resources in cloud services;
the protocol analysis module is used for checking the trend and the loss problem of the data packet by taking the analysis of the switch flow table as a main line;
the flow analysis module analyzes through statistical information in a statistical domain to realize fault location;
the device analysis module checks the actual condition of the data packet by checking the physical network device.
The switch is an OpenFlow switch, and comprises a flow table, a group table and an OpenFlow channel connected with the SND controller, and the vSwitch is used as a VXLAN tunnel.
The physical network equipment comprises an OpenFlow switch, a switch for access or convergence and a VXLAN gateway.
The invention also provides a data center fault alarm signal positioning method, which adopts the data center fault alarm signal positioning system and comprises the following steps:
The data center fault alarm signal positioning system is deployed on a physical server or a virtual machine;
After the fault occurs, checking the trend and the loss problem of the data packet through the flow analysis module and the flow analysis module;
And when the protocol analysis and the traffic analysis do not trace back to the fault source, performing fault investigation of the physical network equipment.
Wherein, in the step of deploying the data center fault alarm signal positioning system on a physical server or a virtual machine:
When the data center fault alarm signal positioning system is deployed, a network channel with an SDN network is required to be opened on the network, or the data center fault alarm signal positioning system is directly deployed in the SDN network; after deployment is completed, a firewall is arranged for isolation, and remote access of an external network is limited.
After the fault occurs, through the flow analysis module and the flow analysis module, the data packet trend and the loss problem are checked in the steps of: when the protocol analysis module performs VTEP fault detection, the VXLAN message flow direction is taken as a main line, and the trend and the loss problem of the data packet are checked.
After the fault occurs, through the flow analysis module and the flow analysis module, the data packet trend and the loss problem are checked in the steps of: the flow analysis module analyzes from the dimensionalities of the load, the time delay, the packet loss rate and the flow distribution, and achieves fault positioning.
According to the data center fault alarm signal positioning method and system, the system is deployed on a physical server or a virtual machine, the protocol analysis module is used for analyzing the trend and the loss problem of the data packet by taking the flow table analysis of the switch as a main line, the flow analysis module is used for analyzing the statistical information in the statistical domain to realize fault positioning, the equipment analysis module is used for checking physical network equipment to check the actual condition of the data packet, and the system replaces the splitting analysis of the end point of the network equipment by the end-to-end concept through the protocol, the flow and the equipment to develop the multi-dimensional network fault positioning analysis, so that the network fault positioning speed is accelerated, and the operation and maintenance fault positioning pressure can be well relieved.
Drawings
In order to more clearly illustrate the embodiments of the invention or the technical solutions in the prior art, the drawings that are required in the embodiments or the description of the prior art will be briefly described, it being obvious that the drawings in the following description are only some embodiments of the invention, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a functional block diagram of a data center fault alarm signal locating system provided by the present invention.
FIG. 2 is a flow chart of steps of a method for locating a data center fault alarm signal provided by the invention.
Detailed Description
Embodiments of the present invention are described in detail below, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to like or similar elements or elements having like or similar functions throughout. The embodiments described below by referring to the drawings are illustrative and intended to explain the present invention and should not be construed as limiting the invention.
Referring to fig. 1, the invention provides a method and a system for positioning a fault alarm signal of a data center, wherein the system for positioning the fault alarm signal of the data center comprises a protocol analysis module, a flow analysis module, an equipment analysis module, a management interface and an SDN network architecture, wherein the protocol analysis module, the flow analysis module and the equipment analysis module are all connected with the SDN network architecture through the management interface, and the management interface comprises SNMP, netconf and FTP interfaces; the SDN network architecture is responsible for resource allocation, redundancy management, error management and elastic adjustment, and realizes automatic opening and deployment of network resources in cloud services; the protocol analysis module is used for checking the trend and the loss problem of the data packet by taking the analysis of the switch flow table as a main line; the flow analysis module analyzes through statistical information in a statistical domain to realize fault location; the device analysis module checks the actual condition of the data packet by checking the physical network device; the switch is an OpenFlow switch and comprises a flow table, a group table and an OpenFlow channel connected with the SND controller, and the vSwitch is used as a VXLAN tunnel; the physical network device comprises an OpenFlow switch, a switch for access or convergence and a VXLAN gateway.
In this embodiment, the system is deployed on a physical server or a virtual machine, the protocol analysis module uses the analysis of the switch flow table as a main line to check the trend and loss problem of the data packet, the flow analysis module analyzes the statistical information in the statistical domain to realize fault location, the device analysis module checks the physical network device to check the actual condition of the data packet, and performs multi-dimensional network fault location analysis by the protocol, the flow and the device, and the system replaces the splitting analysis of the end point of the network device with the end-to-end concept, thereby accelerating the network fault location rate and better relieving the operation and maintenance fault location pressure.
Furthermore, the data center fault alarm signal positioning system also comprises a third party adapter interface, and the third party adapter interface is used for docking the third party system and supporting protocols such as SNMP, FTP and the like.
In this embodiment, the application range of the system is enlarged through the third party adaptation interface.
Further, the SDN network architecture includes an application layer, an orchestration layer, and a control layer, where the application layer is connected to the control layer, and the orchestration layer is connected to the control layer; the application layer comprises a cloud management platform, an SDN application program and an SDN management program, and SDN network management is carried out through software; the arrangement layer distributes resources, redundancy management, error management and elastic adjustment through the SDN arrangement device to realize automatic opening and deployment of network resources in cloud service, and the control layer ensures that the intelligent network meets the requirements of the cloud service on the network resources through flow control by the SDN controller.
In this embodiment, the application layer, the orchestration layer, and the control layer construct the SDN network architecture, and under the SDN networking architecture, the data center core protocol is the OpenFlow protocol. In addition, VXLAN networking is also the most common technology. Compared with VLAN networking, VXLAN networking breaks through the limit of 4000+ subnets, and has higher expansibility because the VXLAN protocol is erected on the UDP protocol. Therefore, in VDC fault localization based on SDN technology, protocol analysis based on OpenFlow and VXLAN is the preferred solution. The fault locating technology based on protocol analysis focuses on fault locating on two protocol carriers of an OpenFlow switch and VXLAN and VTEP.
Further, the SDN network architecture further includes a forwarding layer, where the forwarding layer is connected to the application layer, and the forwarding layer implements connection of a physical layer through SDN forwarding devices.
In this embodiment, the forwarding layer is used to connect the physical layer, so that the forwarding layer is convenient to deploy to the physical server.
Referring to fig. 2, the invention further provides a data center fault alarm signal positioning method, which adopts the data center fault alarm signal positioning system, and comprises the following steps:
S1: the data center fault alarm signal positioning system is deployed on a physical server or a virtual machine;
S2: after the fault occurs, checking the trend and the loss problem of the data packet through the flow analysis module and the flow analysis module;
S3: and when the protocol analysis and the traffic analysis do not trace back to the fault source, performing fault investigation of the physical network equipment.
In step S1, when the data center fault alarm signal positioning system is deployed, a network channel with an SDN network needs to be opened on the network, or the data center fault alarm signal positioning system is directly deployed in the SDN network; after deployment is completed, a firewall is arranged for isolation, and remote access of an external network is limited.
In step S2, the OpenFlow switch, i.e. a switch supporting the OpenFlow protocol, includes a flow table, a group table, and an OpenFlow channel connected to the SDN controller. In the existing commercial products, the OpenFlow switch can be an Openvswitch or a hybrid physical OpenFlow switch after the traditional switch is modified, and fault investigation and positioning take OpenFlow switch flow table analysis as a main line to check the trend and loss problems of data packets; VXLAN VTEP is the network device that encapsulates and decapsulates VXLAN. In data center networking, there are generally three roles: VXLAN VTEP, VXLAN GW, VXLAN IP GW. VTEP is the device directly connected to the Virtual Machine (VM) responsible for the encapsulation and decapsulation of the VXLAN enclosed by the original ethernet. The VXLAN GW converts the VXLAN message into a corresponding traditional two-layer network and sends the corresponding traditional two-layer network to the traditional Ethernet, and the VXLAN GW is suitable for two-layer interconnection of a server and a remote server in the VXLAN network. The VXLAN IP GW converts the VXLAN message into a traditional three-layer message and sends the traditional three-layer message to the IP network, is suitable for three-layer mutual access between a server and a remote terminal in the VXLAN network, and is also used for intercommunication of different VXLAN networks. The VTEP fault detection takes VXLAN message flow direction as a main line to check the trend and the loss problem of the data packet; the OpenFlow switch supports the OpenFlow protocol, and processes (forwards, loses, buffers) traffic according to flow table rules. The flow table is divided into three parts: header field, statistics field, action field. The statistical domain stores basic statistical information of the message matched with the flow table item, and comprises flow table, flow, interface and queue statistical information, and the value of the statistical item is automatically updated when the OpenFlow switch operates; fault location can be analyzed from the dimensions of load, delay, packet loss rate, traffic distribution, etc.
In step S3, in the current VDC network networking based on SDN technology, there are physical network devices, such as a physical OpenFlow switch, a switch for access or aggregation, a VXLAN gateway, etc.; the physical equipment is used for checking the actual condition of the interface data packet and the packet forwarding routing state, which are also a direction of fault location, and generally can determine the hardware or software fault of the physical equipment; by means of protocols, flow, equipment and multi-dimensional network fault location analysis, the system replaces the fracturing analysis of the end points of the network equipment by the end-to-end concept, the network fault location speed is accelerated, and the operation and maintenance fault location pressure can be well relieved.
The above disclosure is only a preferred embodiment of the present invention, and it should be understood that the scope of the invention is not limited thereto, and those skilled in the art will appreciate that all or part of the procedures described above can be performed according to the equivalent changes of the claims, and still fall within the scope of the present invention.
Claims (3)
1. A data center fault alarm signal positioning system is characterized in that,
The data center fault alarm signal positioning system comprises a protocol analysis module, a flow analysis module, a device analysis module, a management interface and an SDN network architecture, wherein the protocol analysis module, the flow analysis module and the device analysis module are all connected with the SDN network architecture through the management interface, and the management interface comprises SNMP, netconf and an FTP interface;
the SDN network architecture is responsible for resource allocation, redundancy management, error management and elastic adjustment, and realizes automatic opening and deployment of network resources in cloud services;
the protocol analysis module is used for checking the trend and the loss problem of the data packet by taking the analysis of the switch flow table as a main line;
the flow analysis module analyzes through statistical information in a statistical domain to realize fault location;
The device analysis module checks the actual condition of the data packet by checking the physical network device;
the data center fault alarm signal positioning system also comprises a third party adapter interface, wherein the third party adapter interface is used for docking a third party system and supporting SNMP and FTP protocols;
The SDN network architecture comprises an application layer, an arrangement layer and a control layer, wherein the application layer is connected with the control layer, and the arrangement layer is connected with the control layer; the application layer comprises a cloud management platform, an SDN application program and an SDN management program, and SDN network management is carried out through software; the arrangement layer distributes resources, redundancy management, error management and elastic adjustment through the SDN arrangement device to realize automatic opening and deployment of network resources in cloud service, and the control layer ensures that the intelligent network meets the requirements of the cloud service on the network resources through flow control by the SDN controller;
the data center fault alarm signal positioning system comprises the following steps:
The data center fault alarm signal positioning system is deployed on a physical server or a virtual machine;
after the fault occurs, checking the trend and the loss problem of the data packet through the flow analysis module and the protocol analysis module;
when the protocol analysis and the flow analysis do not trace back to the fault source, performing fault troubleshooting of the physical network equipment;
in the step of deploying the data center fault alarm signal positioning system on a physical server or virtual machine:
When the data center fault alarm signal positioning system is deployed, a network channel with an SDN network is required to be opened on the network, or the data center fault alarm signal positioning system is directly deployed in the SDN network; after deployment is completed, setting a firewall to isolate, and limiting an external network to remotely access;
the switch is an OpenFlow switch and comprises a flow table, a group table and an OpenFlow channel connected with the SND controller, and the vSwitch is used as a VXLAN tunnel; in the protocol analysis process, performing fault positioning based on protocols of OpenFlow and VXLAN;
after the fault occurs, through the flow analysis module and the protocol analysis module, the steps of checking the trend and the loss problem of the data packet are as follows:
The flow analysis module analyzes from the dimensionalities of the load, the time delay, the packet loss rate and the flow distribution, and achieves fault positioning.
2. A data center fault alert signal location system as claimed in claim 1, wherein,
The physical network device comprises an OpenFlow switch, a switch for access or convergence and a VXLAN gateway.
3. The data center fault alert signal location system of claim 1, wherein after a fault occurs, the steps of checking packet trends and loss problems by the traffic analysis module and the protocol analysis module:
When the protocol analysis module performs VTEP fault detection, the VXLAN message flow direction is taken as a main line, and the trend and the loss problem of the data packet are checked.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111586005.3A CN114338366B (en) | 2021-12-20 | 2021-12-20 | Data center fault alarm signal positioning method and system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111586005.3A CN114338366B (en) | 2021-12-20 | 2021-12-20 | Data center fault alarm signal positioning method and system |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114338366A CN114338366A (en) | 2022-04-12 |
CN114338366B true CN114338366B (en) | 2024-10-22 |
Family
ID=81054406
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111586005.3A Active CN114338366B (en) | 2021-12-20 | 2021-12-20 | Data center fault alarm signal positioning method and system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114338366B (en) |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106130767A (en) * | 2016-09-23 | 2016-11-16 | 深圳灵动智网科技有限公司 | The system and method that a kind of service path failure monitoring and fault solve |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7360124B2 (en) * | 2005-02-09 | 2008-04-15 | Viasat Geo-Technologie Inc. | Autonomous network fault detection and management system |
CN106027626A (en) * | 2016-05-12 | 2016-10-12 | 赛特斯信息科技股份有限公司 | SDN-based system for realizing virtualization data center |
CN107819596B (en) * | 2016-09-12 | 2022-07-29 | 中兴通讯股份有限公司 | SDN network fault diagnosis method, device and system |
CN110113205B (en) * | 2019-05-06 | 2021-07-30 | 南京大学 | A network troubleshooting system based on software-defined network technology and its working method |
CN111147516B (en) * | 2019-12-31 | 2020-11-24 | 中南民族大学 | SDN-based dynamic interconnection and intelligent routing decision system and method for security equipment |
CN112769632A (en) * | 2020-11-30 | 2021-05-07 | 锐捷网络股份有限公司 | Method and system for detecting network fault of data center |
CN112787959B (en) * | 2020-12-03 | 2023-12-26 | 观脉科技(北京)有限公司 | Flow scheduling method and system |
-
2021
- 2021-12-20 CN CN202111586005.3A patent/CN114338366B/en active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106130767A (en) * | 2016-09-23 | 2016-11-16 | 深圳灵动智网科技有限公司 | The system and method that a kind of service path failure monitoring and fault solve |
Also Published As
Publication number | Publication date |
---|---|
CN114338366A (en) | 2022-04-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9544182B2 (en) | Monitoring gateway systems and methods for openflow type networks | |
CN106612233B (en) | A multi-channel network switching method and system | |
CN102984057B (en) | A kind of Multi-service integration dual-redundancy network system | |
US20160248600A1 (en) | Virtual cable modem termination system | |
US8787396B2 (en) | Centralized control and management planes for different independent switching domains | |
KR20140072343A (en) | Method for handling fault in softwate defined networking networks | |
EP2553870B1 (en) | An operations, administrations and management proxy and a method for handling operations, administrations and management messages | |
CN107104832B (en) | Method and equipment for automatically discovering cross-node service topology on transoceanic multiplexing section ring network | |
CN101883117B (en) | Interface business focuses on method and system | |
CN102326370B (en) | Message processing method, apparatus and system | |
CN105704068B (en) | Service mixing centralized processing method and device | |
CN113630318B (en) | Message transmission method and frame type communication equipment | |
CN114338366B (en) | Data center fault alarm signal positioning method and system | |
WO2008097105A1 (en) | Methods, systems and apparatus for monitoring and/or generating communications in a communications network | |
Wang et al. | A SDN-based heterogeneous networking scheme for profinet and Modbus Networks | |
Mustafa et al. | Using SDN to enhance cyber resiliency in IEC 61850-based substation OT networks | |
CN106533771B (en) | Network equipment and control information transmission method | |
US20060288092A1 (en) | Xml over tcp management protocol with tunneled proxy support and connection management | |
CN107835109B (en) | Method and system for testing packet transport network defined by software | |
CN111885433B (en) | Network system, method and equipment capable of realizing end-to-end monitoring | |
CN114039810B (en) | Flexible automatic control system based on Ethernet | |
CN1426169A (en) | Method for improving route repeat liability of access server | |
CN106888105A (en) | A kind of three layers of discovery method and device of virtual link end to end | |
EP3684011B1 (en) | Method for an improved and simplified operation and architecture of a central office point of delivery within a broadband access network of a telecommunications network, for the enhanced execution of network attachment tasks, further functional or configuration tasks within the central office point of delivery, telecommunications system, program and computer-readable medium | |
CN113037622A (en) | System and method for preventing BFD oscillation |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |