[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN114338366B - Data center fault alarm signal positioning method and system - Google Patents

Data center fault alarm signal positioning method and system Download PDF

Info

Publication number
CN114338366B
CN114338366B CN202111586005.3A CN202111586005A CN114338366B CN 114338366 B CN114338366 B CN 114338366B CN 202111586005 A CN202111586005 A CN 202111586005A CN 114338366 B CN114338366 B CN 114338366B
Authority
CN
China
Prior art keywords
analysis module
network
fault
data center
alarm signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202111586005.3A
Other languages
Chinese (zh)
Other versions
CN114338366A (en
Inventor
梁晓东
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing Dingxinwang Technology Co ltd
Original Assignee
Nanjing Dingxinwang Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing Dingxinwang Technology Co ltd filed Critical Nanjing Dingxinwang Technology Co ltd
Priority to CN202111586005.3A priority Critical patent/CN114338366B/en
Publication of CN114338366A publication Critical patent/CN114338366A/en
Application granted granted Critical
Publication of CN114338366B publication Critical patent/CN114338366B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The invention relates to the technical field of fault positioning, in particular to a data center fault alarm signal positioning method and system; the data center fault alarm signal positioning system comprises a protocol analysis module, a flow analysis module, a device analysis module, a management interface and an SDN network architecture, wherein the management interface comprises an SNMP (simple network management protocol), a Netconf (network conf) and an FTP (file transfer protocol) interface, the protocol analysis module, the flow analysis module and the device analysis module are all connected with the SDN network architecture through the management interface, and the system is deployed on a physical server or a virtual machine to perform multidimensional network fault positioning analysis through protocols, flow and devices, and replaces the splitting analysis of network equipment endpoints by an end-to-end concept, so that the network fault positioning speed is accelerated, and the operation and maintenance fault positioning pressure can be well relieved.

Description

Data center fault alarm signal positioning method and system
Technical Field
The invention relates to the technical field of fault positioning, in particular to a data center fault alarm signal positioning method and system.
Background
With the rapid increase of the number of cloud service users and service contents, the scale of a data center is also larger and larger, the number of contained servers is exponentially increasing, and the data exchange volume is also increased in a super-linear mode. Within a data center, fiber optic communication technology is used to connect a large number of switches and servers, providing an efficient solution for high capacity, high performance, scalable, survivable services of the data center. However, since the optical fiber is easily damaged, once the optical fiber link in the network is damaged, all traffic on the link is interrupted. The data center optical network adopts a parallel mode to transmit data, and the transmission rate is quite high, so that even if only one optical fiber link is destroyed, a large amount of service interruption and data loss can be caused.
The traditional data center is not separated from the control and forwarding in architecture, the control function and the forwarding function are concentrated in the same network equipment, and the whole network is fixed, inconvenient to adjust and incapable of being controlled in a centralized way.
Disclosure of Invention
The invention aims to provide a data center fault alarm signal positioning method and system, and aims to solve the technical problems that a data center in the prior art is not separated in structure from forwarding, a control function and a forwarding function are concentrated in the same network equipment, and the whole network is fixed, inconvenient to adjust and incapable of being controlled in a centralized manner.
In order to achieve the above objective, the present invention provides a data center fault alarm signal positioning system, where the data center fault alarm signal positioning system includes a protocol analysis module, a flow analysis module, an equipment analysis module, a management interface and an SDN network architecture, where the protocol analysis module, the flow analysis module and the equipment analysis module are all connected with the SDN network architecture through the management interface, and the management interface includes SNMP, netconf and FTP interfaces;
the SDN network architecture is responsible for resource allocation, redundancy management, error management and elastic adjustment, and realizes automatic opening and deployment of network resources in cloud services;
the protocol analysis module is used for checking the trend and the loss problem of the data packet by taking the analysis of the switch flow table as a main line;
the flow analysis module analyzes through statistical information in a statistical domain to realize fault location;
the device analysis module checks the actual condition of the data packet by checking the physical network device.
The switch is an OpenFlow switch, and comprises a flow table, a group table and an OpenFlow channel connected with the SND controller, and the vSwitch is used as a VXLAN tunnel.
The physical network equipment comprises an OpenFlow switch, a switch for access or convergence and a VXLAN gateway.
The invention also provides a data center fault alarm signal positioning method, which adopts the data center fault alarm signal positioning system and comprises the following steps:
The data center fault alarm signal positioning system is deployed on a physical server or a virtual machine;
After the fault occurs, checking the trend and the loss problem of the data packet through the flow analysis module and the flow analysis module;
And when the protocol analysis and the traffic analysis do not trace back to the fault source, performing fault investigation of the physical network equipment.
Wherein, in the step of deploying the data center fault alarm signal positioning system on a physical server or a virtual machine:
When the data center fault alarm signal positioning system is deployed, a network channel with an SDN network is required to be opened on the network, or the data center fault alarm signal positioning system is directly deployed in the SDN network; after deployment is completed, a firewall is arranged for isolation, and remote access of an external network is limited.
After the fault occurs, through the flow analysis module and the flow analysis module, the data packet trend and the loss problem are checked in the steps of: when the protocol analysis module performs VTEP fault detection, the VXLAN message flow direction is taken as a main line, and the trend and the loss problem of the data packet are checked.
After the fault occurs, through the flow analysis module and the flow analysis module, the data packet trend and the loss problem are checked in the steps of: the flow analysis module analyzes from the dimensionalities of the load, the time delay, the packet loss rate and the flow distribution, and achieves fault positioning.
According to the data center fault alarm signal positioning method and system, the system is deployed on a physical server or a virtual machine, the protocol analysis module is used for analyzing the trend and the loss problem of the data packet by taking the flow table analysis of the switch as a main line, the flow analysis module is used for analyzing the statistical information in the statistical domain to realize fault positioning, the equipment analysis module is used for checking physical network equipment to check the actual condition of the data packet, and the system replaces the splitting analysis of the end point of the network equipment by the end-to-end concept through the protocol, the flow and the equipment to develop the multi-dimensional network fault positioning analysis, so that the network fault positioning speed is accelerated, and the operation and maintenance fault positioning pressure can be well relieved.
Drawings
In order to more clearly illustrate the embodiments of the invention or the technical solutions in the prior art, the drawings that are required in the embodiments or the description of the prior art will be briefly described, it being obvious that the drawings in the following description are only some embodiments of the invention, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a functional block diagram of a data center fault alarm signal locating system provided by the present invention.
FIG. 2 is a flow chart of steps of a method for locating a data center fault alarm signal provided by the invention.
Detailed Description
Embodiments of the present invention are described in detail below, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to like or similar elements or elements having like or similar functions throughout. The embodiments described below by referring to the drawings are illustrative and intended to explain the present invention and should not be construed as limiting the invention.
Referring to fig. 1, the invention provides a method and a system for positioning a fault alarm signal of a data center, wherein the system for positioning the fault alarm signal of the data center comprises a protocol analysis module, a flow analysis module, an equipment analysis module, a management interface and an SDN network architecture, wherein the protocol analysis module, the flow analysis module and the equipment analysis module are all connected with the SDN network architecture through the management interface, and the management interface comprises SNMP, netconf and FTP interfaces; the SDN network architecture is responsible for resource allocation, redundancy management, error management and elastic adjustment, and realizes automatic opening and deployment of network resources in cloud services; the protocol analysis module is used for checking the trend and the loss problem of the data packet by taking the analysis of the switch flow table as a main line; the flow analysis module analyzes through statistical information in a statistical domain to realize fault location; the device analysis module checks the actual condition of the data packet by checking the physical network device; the switch is an OpenFlow switch and comprises a flow table, a group table and an OpenFlow channel connected with the SND controller, and the vSwitch is used as a VXLAN tunnel; the physical network device comprises an OpenFlow switch, a switch for access or convergence and a VXLAN gateway.
In this embodiment, the system is deployed on a physical server or a virtual machine, the protocol analysis module uses the analysis of the switch flow table as a main line to check the trend and loss problem of the data packet, the flow analysis module analyzes the statistical information in the statistical domain to realize fault location, the device analysis module checks the physical network device to check the actual condition of the data packet, and performs multi-dimensional network fault location analysis by the protocol, the flow and the device, and the system replaces the splitting analysis of the end point of the network device with the end-to-end concept, thereby accelerating the network fault location rate and better relieving the operation and maintenance fault location pressure.
Furthermore, the data center fault alarm signal positioning system also comprises a third party adapter interface, and the third party adapter interface is used for docking the third party system and supporting protocols such as SNMP, FTP and the like.
In this embodiment, the application range of the system is enlarged through the third party adaptation interface.
Further, the SDN network architecture includes an application layer, an orchestration layer, and a control layer, where the application layer is connected to the control layer, and the orchestration layer is connected to the control layer; the application layer comprises a cloud management platform, an SDN application program and an SDN management program, and SDN network management is carried out through software; the arrangement layer distributes resources, redundancy management, error management and elastic adjustment through the SDN arrangement device to realize automatic opening and deployment of network resources in cloud service, and the control layer ensures that the intelligent network meets the requirements of the cloud service on the network resources through flow control by the SDN controller.
In this embodiment, the application layer, the orchestration layer, and the control layer construct the SDN network architecture, and under the SDN networking architecture, the data center core protocol is the OpenFlow protocol. In addition, VXLAN networking is also the most common technology. Compared with VLAN networking, VXLAN networking breaks through the limit of 4000+ subnets, and has higher expansibility because the VXLAN protocol is erected on the UDP protocol. Therefore, in VDC fault localization based on SDN technology, protocol analysis based on OpenFlow and VXLAN is the preferred solution. The fault locating technology based on protocol analysis focuses on fault locating on two protocol carriers of an OpenFlow switch and VXLAN and VTEP.
Further, the SDN network architecture further includes a forwarding layer, where the forwarding layer is connected to the application layer, and the forwarding layer implements connection of a physical layer through SDN forwarding devices.
In this embodiment, the forwarding layer is used to connect the physical layer, so that the forwarding layer is convenient to deploy to the physical server.
Referring to fig. 2, the invention further provides a data center fault alarm signal positioning method, which adopts the data center fault alarm signal positioning system, and comprises the following steps:
S1: the data center fault alarm signal positioning system is deployed on a physical server or a virtual machine;
S2: after the fault occurs, checking the trend and the loss problem of the data packet through the flow analysis module and the flow analysis module;
S3: and when the protocol analysis and the traffic analysis do not trace back to the fault source, performing fault investigation of the physical network equipment.
In step S1, when the data center fault alarm signal positioning system is deployed, a network channel with an SDN network needs to be opened on the network, or the data center fault alarm signal positioning system is directly deployed in the SDN network; after deployment is completed, a firewall is arranged for isolation, and remote access of an external network is limited.
In step S2, the OpenFlow switch, i.e. a switch supporting the OpenFlow protocol, includes a flow table, a group table, and an OpenFlow channel connected to the SDN controller. In the existing commercial products, the OpenFlow switch can be an Openvswitch or a hybrid physical OpenFlow switch after the traditional switch is modified, and fault investigation and positioning take OpenFlow switch flow table analysis as a main line to check the trend and loss problems of data packets; VXLAN VTEP is the network device that encapsulates and decapsulates VXLAN. In data center networking, there are generally three roles: VXLAN VTEP, VXLAN GW, VXLAN IP GW. VTEP is the device directly connected to the Virtual Machine (VM) responsible for the encapsulation and decapsulation of the VXLAN enclosed by the original ethernet. The VXLAN GW converts the VXLAN message into a corresponding traditional two-layer network and sends the corresponding traditional two-layer network to the traditional Ethernet, and the VXLAN GW is suitable for two-layer interconnection of a server and a remote server in the VXLAN network. The VXLAN IP GW converts the VXLAN message into a traditional three-layer message and sends the traditional three-layer message to the IP network, is suitable for three-layer mutual access between a server and a remote terminal in the VXLAN network, and is also used for intercommunication of different VXLAN networks. The VTEP fault detection takes VXLAN message flow direction as a main line to check the trend and the loss problem of the data packet; the OpenFlow switch supports the OpenFlow protocol, and processes (forwards, loses, buffers) traffic according to flow table rules. The flow table is divided into three parts: header field, statistics field, action field. The statistical domain stores basic statistical information of the message matched with the flow table item, and comprises flow table, flow, interface and queue statistical information, and the value of the statistical item is automatically updated when the OpenFlow switch operates; fault location can be analyzed from the dimensions of load, delay, packet loss rate, traffic distribution, etc.
In step S3, in the current VDC network networking based on SDN technology, there are physical network devices, such as a physical OpenFlow switch, a switch for access or aggregation, a VXLAN gateway, etc.; the physical equipment is used for checking the actual condition of the interface data packet and the packet forwarding routing state, which are also a direction of fault location, and generally can determine the hardware or software fault of the physical equipment; by means of protocols, flow, equipment and multi-dimensional network fault location analysis, the system replaces the fracturing analysis of the end points of the network equipment by the end-to-end concept, the network fault location speed is accelerated, and the operation and maintenance fault location pressure can be well relieved.
The above disclosure is only a preferred embodiment of the present invention, and it should be understood that the scope of the invention is not limited thereto, and those skilled in the art will appreciate that all or part of the procedures described above can be performed according to the equivalent changes of the claims, and still fall within the scope of the present invention.

Claims (3)

1. A data center fault alarm signal positioning system is characterized in that,
The data center fault alarm signal positioning system comprises a protocol analysis module, a flow analysis module, a device analysis module, a management interface and an SDN network architecture, wherein the protocol analysis module, the flow analysis module and the device analysis module are all connected with the SDN network architecture through the management interface, and the management interface comprises SNMP, netconf and an FTP interface;
the SDN network architecture is responsible for resource allocation, redundancy management, error management and elastic adjustment, and realizes automatic opening and deployment of network resources in cloud services;
the protocol analysis module is used for checking the trend and the loss problem of the data packet by taking the analysis of the switch flow table as a main line;
the flow analysis module analyzes through statistical information in a statistical domain to realize fault location;
The device analysis module checks the actual condition of the data packet by checking the physical network device;
the data center fault alarm signal positioning system also comprises a third party adapter interface, wherein the third party adapter interface is used for docking a third party system and supporting SNMP and FTP protocols;
The SDN network architecture comprises an application layer, an arrangement layer and a control layer, wherein the application layer is connected with the control layer, and the arrangement layer is connected with the control layer; the application layer comprises a cloud management platform, an SDN application program and an SDN management program, and SDN network management is carried out through software; the arrangement layer distributes resources, redundancy management, error management and elastic adjustment through the SDN arrangement device to realize automatic opening and deployment of network resources in cloud service, and the control layer ensures that the intelligent network meets the requirements of the cloud service on the network resources through flow control by the SDN controller;
the data center fault alarm signal positioning system comprises the following steps:
The data center fault alarm signal positioning system is deployed on a physical server or a virtual machine;
after the fault occurs, checking the trend and the loss problem of the data packet through the flow analysis module and the protocol analysis module;
when the protocol analysis and the flow analysis do not trace back to the fault source, performing fault troubleshooting of the physical network equipment;
in the step of deploying the data center fault alarm signal positioning system on a physical server or virtual machine:
When the data center fault alarm signal positioning system is deployed, a network channel with an SDN network is required to be opened on the network, or the data center fault alarm signal positioning system is directly deployed in the SDN network; after deployment is completed, setting a firewall to isolate, and limiting an external network to remotely access;
the switch is an OpenFlow switch and comprises a flow table, a group table and an OpenFlow channel connected with the SND controller, and the vSwitch is used as a VXLAN tunnel; in the protocol analysis process, performing fault positioning based on protocols of OpenFlow and VXLAN;
after the fault occurs, through the flow analysis module and the protocol analysis module, the steps of checking the trend and the loss problem of the data packet are as follows:
The flow analysis module analyzes from the dimensionalities of the load, the time delay, the packet loss rate and the flow distribution, and achieves fault positioning.
2. A data center fault alert signal location system as claimed in claim 1, wherein,
The physical network device comprises an OpenFlow switch, a switch for access or convergence and a VXLAN gateway.
3. The data center fault alert signal location system of claim 1, wherein after a fault occurs, the steps of checking packet trends and loss problems by the traffic analysis module and the protocol analysis module:
When the protocol analysis module performs VTEP fault detection, the VXLAN message flow direction is taken as a main line, and the trend and the loss problem of the data packet are checked.
CN202111586005.3A 2021-12-20 2021-12-20 Data center fault alarm signal positioning method and system Active CN114338366B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111586005.3A CN114338366B (en) 2021-12-20 2021-12-20 Data center fault alarm signal positioning method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111586005.3A CN114338366B (en) 2021-12-20 2021-12-20 Data center fault alarm signal positioning method and system

Publications (2)

Publication Number Publication Date
CN114338366A CN114338366A (en) 2022-04-12
CN114338366B true CN114338366B (en) 2024-10-22

Family

ID=81054406

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111586005.3A Active CN114338366B (en) 2021-12-20 2021-12-20 Data center fault alarm signal positioning method and system

Country Status (1)

Country Link
CN (1) CN114338366B (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106130767A (en) * 2016-09-23 2016-11-16 深圳灵动智网科技有限公司 The system and method that a kind of service path failure monitoring and fault solve

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7360124B2 (en) * 2005-02-09 2008-04-15 Viasat Geo-Technologie Inc. Autonomous network fault detection and management system
CN106027626A (en) * 2016-05-12 2016-10-12 赛特斯信息科技股份有限公司 SDN-based system for realizing virtualization data center
CN107819596B (en) * 2016-09-12 2022-07-29 中兴通讯股份有限公司 SDN network fault diagnosis method, device and system
CN110113205B (en) * 2019-05-06 2021-07-30 南京大学 A network troubleshooting system based on software-defined network technology and its working method
CN111147516B (en) * 2019-12-31 2020-11-24 中南民族大学 SDN-based dynamic interconnection and intelligent routing decision system and method for security equipment
CN112769632A (en) * 2020-11-30 2021-05-07 锐捷网络股份有限公司 Method and system for detecting network fault of data center
CN112787959B (en) * 2020-12-03 2023-12-26 观脉科技(北京)有限公司 Flow scheduling method and system

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106130767A (en) * 2016-09-23 2016-11-16 深圳灵动智网科技有限公司 The system and method that a kind of service path failure monitoring and fault solve

Also Published As

Publication number Publication date
CN114338366A (en) 2022-04-12

Similar Documents

Publication Publication Date Title
US9544182B2 (en) Monitoring gateway systems and methods for openflow type networks
CN106612233B (en) A multi-channel network switching method and system
CN102984057B (en) A kind of Multi-service integration dual-redundancy network system
US20160248600A1 (en) Virtual cable modem termination system
US8787396B2 (en) Centralized control and management planes for different independent switching domains
KR20140072343A (en) Method for handling fault in softwate defined networking networks
EP2553870B1 (en) An operations, administrations and management proxy and a method for handling operations, administrations and management messages
CN107104832B (en) Method and equipment for automatically discovering cross-node service topology on transoceanic multiplexing section ring network
CN101883117B (en) Interface business focuses on method and system
CN102326370B (en) Message processing method, apparatus and system
CN105704068B (en) Service mixing centralized processing method and device
CN113630318B (en) Message transmission method and frame type communication equipment
CN114338366B (en) Data center fault alarm signal positioning method and system
WO2008097105A1 (en) Methods, systems and apparatus for monitoring and/or generating communications in a communications network
Wang et al. A SDN-based heterogeneous networking scheme for profinet and Modbus Networks
Mustafa et al. Using SDN to enhance cyber resiliency in IEC 61850-based substation OT networks
CN106533771B (en) Network equipment and control information transmission method
US20060288092A1 (en) Xml over tcp management protocol with tunneled proxy support and connection management
CN107835109B (en) Method and system for testing packet transport network defined by software
CN111885433B (en) Network system, method and equipment capable of realizing end-to-end monitoring
CN114039810B (en) Flexible automatic control system based on Ethernet
CN1426169A (en) Method for improving route repeat liability of access server
CN106888105A (en) A kind of three layers of discovery method and device of virtual link end to end
EP3684011B1 (en) Method for an improved and simplified operation and architecture of a central office point of delivery within a broadband access network of a telecommunications network, for the enhanced execution of network attachment tasks, further functional or configuration tasks within the central office point of delivery, telecommunications system, program and computer-readable medium
CN113037622A (en) System and method for preventing BFD oscillation

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant