[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN107918573A - The automatic analysis system of SAS card failure information in a kind of Linux kernel daily records - Google Patents

The automatic analysis system of SAS card failure information in a kind of Linux kernel daily records Download PDF

Info

Publication number
CN107918573A
CN107918573A CN201711102483.6A CN201711102483A CN107918573A CN 107918573 A CN107918573 A CN 107918573A CN 201711102483 A CN201711102483 A CN 201711102483A CN 107918573 A CN107918573 A CN 107918573A
Authority
CN
China
Prior art keywords
code
sas
sas card
module
linux kernel
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711102483.6A
Other languages
Chinese (zh)
Inventor
徐亚洲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhengzhou Yunhai Information Technology Co Ltd
Original Assignee
Zhengzhou Yunhai Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhengzhou Yunhai Information Technology Co Ltd filed Critical Zhengzhou Yunhai Information Technology Co Ltd
Priority to CN201711102483.6A priority Critical patent/CN107918573A/en
Publication of CN107918573A publication Critical patent/CN107918573A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/2268Logging of test results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3466Performance evaluation by tracing or monitoring
    • G06F11/3476Data logging

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Hardware Design (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The automatic analysis system of SAS card failure information in a kind of Linux kernel daily records of the present application.The system is based on SAS cards error code parsing document, the process of manual analysis SAS card error messages is realized in the form of program, the producing cause of SAS card failures can automatically be parsed, realize the automated diagnostic of the SAS card error messages to Linux kernel daily records, reduce human cost, the efficiency of analysis SAS card failures is improved, technical support personnel can conveniently position failure cause.

Description

The automatic analysis system of SAS card failure information in a kind of Linux kernel daily records
Technical field
The present invention relates to server test field, and in particular to SAS card failure information in a kind of Linux kernel daily records Automatic analysis system.
Background technology
The appearance of SAS RAID cards comes from the starting Bensley server platforms in Intel on May 26th, 2006 whole world, this is flat The issue of platform brings the brand-new technology such as FB-DIMM, SAS.Use SAS (Serial Attached SCSI, serial SCSI skills Art) advantage be that the interface of a new generation can improve the installation number of equipment, the installation of its secondary device is simpler, rearmost point Technology to point allows the signal path of each equipment to ensure that maximum bandwidth, and the transmission speed of higher.With SAS technologies Widely use, the analysis to SAS card failure information also it is particularly important that, major hardware vendor official website can all provide corresponding SAS cards The explanation document of failure code, with this come instruct customer analysis position failure cause.
On the server of SAS RAID cards is mounted with, when SAS card generation problems, Linux kernel can print as follows The information of form:
“Jul 20 12:00:11node21kernel:[123432.553213]mpt2sas1:log_info (0x31120303):originator(PL),code(0x12),sub_code(0x0303)”;By consulting official's error code Document is instructed, user can be with orientation problem reason.But the process for consulting document is time-consuming and laborious, it is impossible to timely and effectively Failure judgement reason.
For such case, SAS card failure information automatically analyzes in a kind of Linux kernel daily records of the present application System, it is intended to reduce human input, the process programming of manual analysis, realize automated diagnostic SAS card daily record error informations.
The content of the invention
Specifically, the application is claimed automatically analyzing for SAS card failure information in a kind of Linux kernel daily records and is System, it is characterised in that the system includes:System log SAS card informations retrieve module, error code analysis module, and failure is explained defeated Go out module;Wherein, system log SAS card informations retrieval module is used to drive the crucial phrase of error information to travel through according to SAS cards System log, when finding information, is transferred to error code analysis module;Error code analysis module is used to SAS cards error code instruct Documentor, and according to particular value come orientation problem reason;Failure explains that output module is used to error code analysis module examine The result information that breaks is output to terminal and shows.
The automatic analysis system of SAS card failure information, is further characterized in that in Linux kernel daily records as described above, Crucial phrase can be log_info, originator, code, sub_code.
The automatic analysis system of SAS card failure information, is further characterized in that in Linux kernel daily records as described above, Particular value is tri- values of originator, code, sub_code.
The automatic analysis system of SAS card failure information, is further characterized in that in Linux kernel daily records as described above, Error code analysis module orientation problem reason specifically includes definition structure body, and according to the difference of originator and code values, Multiple Array for structural body are initialized, the content translation in SAS card error code documents is stored into Array for structural body, then obtains system The log information that system daily record SAS card information retrievals module transmits, parses the value of originator, code, sub_code three And temporal information, specific Array for structural body is navigated to further according to originator and code values, finally the original parsed Cause and temporal information are sent to failure and explain output module.
Brief description of the drawings
Fig. 1, SAS card failures log information analytical framework figure of the present invention
Embodiment
The present invention can be divided into three parts:System log SAS card informations retrieve module, error code analysis module, failure solution Release output module.
1st, system log SAS card informations retrieval module
System log SAS card informations retrieve module according to SAS cards driving error information crucial phrase " log_info, Originator, code, sub_code " Ergodic Theory daily record (/var/log/messages), when finding information, are transferred to mistake Bit error analyzing module;
2nd, error code analysis module
Official SAS card error codes are instructed documentor by error code analysis module, and according to originator, code, Tri- values of sub_code carry out orientation problem reason;
3rd, failure explains output module
Failure explains that the result information that error code analysis module is diagnosed to be is output to terminal and shown by output module.
Flow is as shown in Figure 1.Based on this frame, it is possible to achieve the error code parsing of SAS card failure information.
Automatic analysis method of the present invention specific implementation step is illustrated below by one embodiment.
The present invention is realized in centos7.1 operating systems, kernel 3.10.0-229 to SAS card failure information errors codes Resolving.SAS cards error code parses document with Infineon Technologies Corp.《SAS Generation 2 Error Codes Systems Engineering Note》V2.2 is standard.
System log SAS card informations retrieval module passes through crucial phrase " log_info, originator, code, sub_ Code " Ergodic Theories daily record/var/log/messages, finds shaped like " Jul 20 12:00:11node21 kernel: [123432.553213]mpt2sas1:log_info(0x31120303):originator(PL),code(0x12),sub_ The daily record row information of code (0x0303) ", and it is sent to error code analysis module.
Originator, code, sub_code are explained in detail in SAS cards error code parsing document, originator It is one-to-many relation with code, code and sub_code are also one-to-many relation.Citing, originator=0, represents IOP (I/O Processor) message code, when code is 0x01, sub_code and corresponding explanation of error such as following table 2-1, Description row just describe the problem of specific reason.
Table 1SAS card error code resolution tables
Error code analysis module definition structure body struct sas_code_msg_t, member include sub_code and description.Main flow includes:
According to the difference of originator and code values, the knot of multiple struct sas_code_msg_t types is initialized Structure body array, stores the content translation in SAS card error code documents into Array for structural body;
Obtain the log information that system log SAS card informations retrieval module transmits, parse originator, code, The value and temporal information of sub_code three;
Specific sas_code_msg_t Array for structural body is navigated to according to originator and code values, further according to sub_ Code obtains corresponding description member's content;
Finally the reason for parsing and temporal information be sent to failure explain output module.
Failure explains that output module receives the analysis result that error code analysis module passes over, and is presented to terminal and shows Show.
Technical solutions according to the invention can also be applied to whether monitoring Linux kernel daily records generate the event of SAS cards In the function of hindering information, run in an operating system in the form of finger daemon, when SAS card failures, monitor failure in time Information produces, and provides the analysis result of error code.
It should be evident that illustrated above is only the specific embodiment of the present invention, for the common skill in this area For art personnel, without creative efforts, other technical solutions can also be obtained according to above-described embodiment, And the equivalent variations made in the scope of protection of the invention should all be fallen within the scope of protection of the present invention, and belong to the present invention The scope of protection.
The present invention is based on SAS cards error code parsing document, the processes of manual analysis SAS card error messages with the shape of program Formula is realized, can automatically be parsed the producing cause of SAS card failures, be realized the SAS card mistakes to Linux kernel daily records The automated diagnostic of information, reduces human cost, improves the efficiency of analysis SAS card failures, technical support personnel can be square Just failure cause is efficiently positioned.

Claims (4)

1. the automatic analysis system of SAS cards disk array fault message in a kind of Linux kernel daily records, it is characterised in that should System includes:System log SAS card informations retrieve module, error code analysis module, and failure explains output module;Wherein, system Daily record SAS card informations retrieval module is used for the crucial phrase Ergodic Theory daily record that error information is driven according to SAS cards, finds information When, it is transferred to error code analysis module;Error code analysis module is used to SAS card error codes instruct documentor, and according to Particular value carrys out orientation problem reason;Failure explains that the result information that output module is used to error code analysis module be diagnosed to be exports Shown to terminal.
2. the automatic analysis system of SAS cards disk array fault message in Linux kernel daily records as claimed in claim 1, It is further characterized in that, crucial phrase can be log_info, originator, code, sub_code.
3. the automatic analysis system of SAS cards disk array fault message in Linux kernel daily records as claimed in claim 2, It is further characterized in that, tri- values of particular value originator, code, sub_code.
4. the automatic analysis system of SAS cards disk array fault message in Linux kernel daily records as claimed in claim 3, Be further characterized in that, error code analysis module orientation problem reason specifically includes definition structure body, and according to originator and The difference of code values, initializes multiple Array for structural body, and the content translation in SAS card error code documents is deposited into Array for structural body Storage, then obtains the log information that system log SAS card informations retrieval module transmits, parses originator, code, sub_ The value and temporal information of code three, navigates to specific Array for structural body, finally further according to originator and code values The reason for parsing and temporal information be sent to failure explain output module.
CN201711102483.6A 2017-11-10 2017-11-10 The automatic analysis system of SAS card failure information in a kind of Linux kernel daily records Pending CN107918573A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711102483.6A CN107918573A (en) 2017-11-10 2017-11-10 The automatic analysis system of SAS card failure information in a kind of Linux kernel daily records

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711102483.6A CN107918573A (en) 2017-11-10 2017-11-10 The automatic analysis system of SAS card failure information in a kind of Linux kernel daily records

Publications (1)

Publication Number Publication Date
CN107918573A true CN107918573A (en) 2018-04-17

Family

ID=61895312

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711102483.6A Pending CN107918573A (en) 2017-11-10 2017-11-10 The automatic analysis system of SAS card failure information in a kind of Linux kernel daily records

Country Status (1)

Country Link
CN (1) CN107918573A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108920297A (en) * 2018-07-13 2018-11-30 郑州云海信息技术有限公司 A kind of method and system diagnosing PCIE error information in black box log
CN109582612A (en) * 2018-12-24 2019-04-05 郑州云海信息技术有限公司 A kind of device and its design, application method obtaining SAS card log
CN109684525A (en) * 2018-12-20 2019-04-26 Oppo(重庆)智能科技有限公司 Document display method, device, storage medium and test equipment

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106250258A (en) * 2016-07-29 2016-12-21 北京云集智造科技有限公司 A kind of disk failure localization method and device
CN106886470A (en) * 2017-02-22 2017-06-23 郑州云海信息技术有限公司 A kind of fault detection method and system based on disk in linux

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106250258A (en) * 2016-07-29 2016-12-21 北京云集智造科技有限公司 A kind of disk failure localization method and device
CN106886470A (en) * 2017-02-22 2017-06-23 郑州云海信息技术有限公司 A kind of fault detection method and system based on disk in linux

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108920297A (en) * 2018-07-13 2018-11-30 郑州云海信息技术有限公司 A kind of method and system diagnosing PCIE error information in black box log
CN109684525A (en) * 2018-12-20 2019-04-26 Oppo(重庆)智能科技有限公司 Document display method, device, storage medium and test equipment
CN109582612A (en) * 2018-12-24 2019-04-05 郑州云海信息技术有限公司 A kind of device and its design, application method obtaining SAS card log

Similar Documents

Publication Publication Date Title
US8660995B2 (en) Flexible event data content management for relevant event and alert analysis within a distributed processing system
US7664986B2 (en) System and method for determining fault isolation in an enterprise computing system
US9419650B2 (en) Flexible event data content management for relevant event and alert analysis within a distributed processing system
US10037238B2 (en) System and method for encoding exception conditions included at a remediation database
US9256482B2 (en) Determining whether to send an alert in a distributed processing system
US20220050765A1 (en) Method for processing logs in a computer system for events identified as abnormal and revealing solutions, electronic device, and cloud server
CN108984332A (en) A kind of device and method of location-server delay machine failure
US11113142B2 (en) Early risk detection and management in a software-defined data center
CN107918573A (en) The automatic analysis system of SAS card failure information in a kind of Linux kernel daily records
US20240020017A1 (en) Monitoring method and apparatus for electronic device, and electronic device
CN1960297A (en) System and method for monitoring device unit on network
CN109783315A (en) A kind of database platform automation method for inspecting and system
CN108108445A (en) A kind of data intelligence processing method and system
EP3264291A1 (en) Data block processing method and device
CN105005572A (en) Database mapping method and device
CN107247505B (en) Cloud server power supply blackbox design method easy to view
CN114390112A (en) Rail transit emergency disposal method and device, electronic equipment and storage medium
CN113590405A (en) Hard disk error detection method and device, storage medium and electronic device
US10986230B1 (en) Method and apparatus to capture, analyze, organize, and present support chat information
US11816210B2 (en) Risk-based alerting for computer security
CN116340115A (en) Method, system, equipment and medium for monitoring and early warning of error of front end of applet
CN116566810A (en) System exception handling method, device and equipment based on Spring MVC framework
CN115543707A (en) Hard disk fault detection method, system and device, storage medium and electronic device
Stearley et al. A {State-Machine} Approach to Disambiguating Supercomputer Event Logs
CN111639098A (en) Cloud log management method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180417

RJ01 Rejection of invention patent application after publication