CN107918573A - The automatic analysis system of SAS card failure information in a kind of Linux kernel daily records - Google Patents
The automatic analysis system of SAS card failure information in a kind of Linux kernel daily records Download PDFInfo
- Publication number
- CN107918573A CN107918573A CN201711102483.6A CN201711102483A CN107918573A CN 107918573 A CN107918573 A CN 107918573A CN 201711102483 A CN201711102483 A CN 201711102483A CN 107918573 A CN107918573 A CN 107918573A
- Authority
- CN
- China
- Prior art keywords
- code
- sas
- sas card
- module
- linux kernel
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/22—Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
- G06F11/2268—Logging of test results
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
- G06F11/3466—Performance evaluation by tracing or monitoring
- G06F11/3476—Data logging
Landscapes
- Engineering & Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Computer Hardware Design (AREA)
- Quality & Reliability (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Debugging And Monitoring (AREA)
Abstract
The automatic analysis system of SAS card failure information in a kind of Linux kernel daily records of the present application.The system is based on SAS cards error code parsing document, the process of manual analysis SAS card error messages is realized in the form of program, the producing cause of SAS card failures can automatically be parsed, realize the automated diagnostic of the SAS card error messages to Linux kernel daily records, reduce human cost, the efficiency of analysis SAS card failures is improved, technical support personnel can conveniently position failure cause.
Description
Technical field
The present invention relates to server test field, and in particular to SAS card failure information in a kind of Linux kernel daily records
Automatic analysis system.
Background technology
The appearance of SAS RAID cards comes from the starting Bensley server platforms in Intel on May 26th, 2006 whole world, this is flat
The issue of platform brings the brand-new technology such as FB-DIMM, SAS.Use SAS (Serial Attached SCSI, serial SCSI skills
Art) advantage be that the interface of a new generation can improve the installation number of equipment, the installation of its secondary device is simpler, rearmost point
Technology to point allows the signal path of each equipment to ensure that maximum bandwidth, and the transmission speed of higher.With SAS technologies
Widely use, the analysis to SAS card failure information also it is particularly important that, major hardware vendor official website can all provide corresponding SAS cards
The explanation document of failure code, with this come instruct customer analysis position failure cause.
On the server of SAS RAID cards is mounted with, when SAS card generation problems, Linux kernel can print as follows
The information of form:
“Jul 20 12:00:11node21kernel:[123432.553213]mpt2sas1:log_info
(0x31120303):originator(PL),code(0x12),sub_code(0x0303)”;By consulting official's error code
Document is instructed, user can be with orientation problem reason.But the process for consulting document is time-consuming and laborious, it is impossible to timely and effectively
Failure judgement reason.
For such case, SAS card failure information automatically analyzes in a kind of Linux kernel daily records of the present application
System, it is intended to reduce human input, the process programming of manual analysis, realize automated diagnostic SAS card daily record error informations.
The content of the invention
Specifically, the application is claimed automatically analyzing for SAS card failure information in a kind of Linux kernel daily records and is
System, it is characterised in that the system includes:System log SAS card informations retrieve module, error code analysis module, and failure is explained defeated
Go out module;Wherein, system log SAS card informations retrieval module is used to drive the crucial phrase of error information to travel through according to SAS cards
System log, when finding information, is transferred to error code analysis module;Error code analysis module is used to SAS cards error code instruct
Documentor, and according to particular value come orientation problem reason;Failure explains that output module is used to error code analysis module examine
The result information that breaks is output to terminal and shows.
The automatic analysis system of SAS card failure information, is further characterized in that in Linux kernel daily records as described above,
Crucial phrase can be log_info, originator, code, sub_code.
The automatic analysis system of SAS card failure information, is further characterized in that in Linux kernel daily records as described above,
Particular value is tri- values of originator, code, sub_code.
The automatic analysis system of SAS card failure information, is further characterized in that in Linux kernel daily records as described above,
Error code analysis module orientation problem reason specifically includes definition structure body, and according to the difference of originator and code values,
Multiple Array for structural body are initialized, the content translation in SAS card error code documents is stored into Array for structural body, then obtains system
The log information that system daily record SAS card information retrievals module transmits, parses the value of originator, code, sub_code three
And temporal information, specific Array for structural body is navigated to further according to originator and code values, finally the original parsed
Cause and temporal information are sent to failure and explain output module.
Brief description of the drawings
Fig. 1, SAS card failures log information analytical framework figure of the present invention
Embodiment
The present invention can be divided into three parts:System log SAS card informations retrieve module, error code analysis module, failure solution
Release output module.
1st, system log SAS card informations retrieval module
System log SAS card informations retrieve module according to SAS cards driving error information crucial phrase " log_info,
Originator, code, sub_code " Ergodic Theory daily record (/var/log/messages), when finding information, are transferred to mistake
Bit error analyzing module;
2nd, error code analysis module
Official SAS card error codes are instructed documentor by error code analysis module, and according to originator, code,
Tri- values of sub_code carry out orientation problem reason;
3rd, failure explains output module
Failure explains that the result information that error code analysis module is diagnosed to be is output to terminal and shown by output module.
Flow is as shown in Figure 1.Based on this frame, it is possible to achieve the error code parsing of SAS card failure information.
Automatic analysis method of the present invention specific implementation step is illustrated below by one embodiment.
The present invention is realized in centos7.1 operating systems, kernel 3.10.0-229 to SAS card failure information errors codes
Resolving.SAS cards error code parses document with Infineon Technologies Corp.《SAS Generation 2 Error Codes Systems
Engineering Note》V2.2 is standard.
System log SAS card informations retrieval module passes through crucial phrase " log_info, originator, code, sub_
Code " Ergodic Theories daily record/var/log/messages, finds shaped like " Jul 20 12:00:11node21 kernel:
[123432.553213]mpt2sas1:log_info(0x31120303):originator(PL),code(0x12),sub_
The daily record row information of code (0x0303) ", and it is sent to error code analysis module.
Originator, code, sub_code are explained in detail in SAS cards error code parsing document, originator
It is one-to-many relation with code, code and sub_code are also one-to-many relation.Citing, originator=0, represents
IOP (I/O Processor) message code, when code is 0x01, sub_code and corresponding explanation of error such as following table 2-1,
Description row just describe the problem of specific reason.
Table 1SAS card error code resolution tables
Error code analysis module definition structure body struct sas_code_msg_t, member include sub_code and
description.Main flow includes:
According to the difference of originator and code values, the knot of multiple struct sas_code_msg_t types is initialized
Structure body array, stores the content translation in SAS card error code documents into Array for structural body;
Obtain the log information that system log SAS card informations retrieval module transmits, parse originator, code,
The value and temporal information of sub_code three;
Specific sas_code_msg_t Array for structural body is navigated to according to originator and code values, further according to sub_
Code obtains corresponding description member's content;
Finally the reason for parsing and temporal information be sent to failure explain output module.
Failure explains that output module receives the analysis result that error code analysis module passes over, and is presented to terminal and shows
Show.
Technical solutions according to the invention can also be applied to whether monitoring Linux kernel daily records generate the event of SAS cards
In the function of hindering information, run in an operating system in the form of finger daemon, when SAS card failures, monitor failure in time
Information produces, and provides the analysis result of error code.
It should be evident that illustrated above is only the specific embodiment of the present invention, for the common skill in this area
For art personnel, without creative efforts, other technical solutions can also be obtained according to above-described embodiment,
And the equivalent variations made in the scope of protection of the invention should all be fallen within the scope of protection of the present invention, and belong to the present invention
The scope of protection.
The present invention is based on SAS cards error code parsing document, the processes of manual analysis SAS card error messages with the shape of program
Formula is realized, can automatically be parsed the producing cause of SAS card failures, be realized the SAS card mistakes to Linux kernel daily records
The automated diagnostic of information, reduces human cost, improves the efficiency of analysis SAS card failures, technical support personnel can be square
Just failure cause is efficiently positioned.
Claims (4)
1. the automatic analysis system of SAS cards disk array fault message in a kind of Linux kernel daily records, it is characterised in that should
System includes:System log SAS card informations retrieve module, error code analysis module, and failure explains output module;Wherein, system
Daily record SAS card informations retrieval module is used for the crucial phrase Ergodic Theory daily record that error information is driven according to SAS cards, finds information
When, it is transferred to error code analysis module;Error code analysis module is used to SAS card error codes instruct documentor, and according to
Particular value carrys out orientation problem reason;Failure explains that the result information that output module is used to error code analysis module be diagnosed to be exports
Shown to terminal.
2. the automatic analysis system of SAS cards disk array fault message in Linux kernel daily records as claimed in claim 1,
It is further characterized in that, crucial phrase can be log_info, originator, code, sub_code.
3. the automatic analysis system of SAS cards disk array fault message in Linux kernel daily records as claimed in claim 2,
It is further characterized in that, tri- values of particular value originator, code, sub_code.
4. the automatic analysis system of SAS cards disk array fault message in Linux kernel daily records as claimed in claim 3,
Be further characterized in that, error code analysis module orientation problem reason specifically includes definition structure body, and according to originator and
The difference of code values, initializes multiple Array for structural body, and the content translation in SAS card error code documents is deposited into Array for structural body
Storage, then obtains the log information that system log SAS card informations retrieval module transmits, parses originator, code, sub_
The value and temporal information of code three, navigates to specific Array for structural body, finally further according to originator and code values
The reason for parsing and temporal information be sent to failure explain output module.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711102483.6A CN107918573A (en) | 2017-11-10 | 2017-11-10 | The automatic analysis system of SAS card failure information in a kind of Linux kernel daily records |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711102483.6A CN107918573A (en) | 2017-11-10 | 2017-11-10 | The automatic analysis system of SAS card failure information in a kind of Linux kernel daily records |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107918573A true CN107918573A (en) | 2018-04-17 |
Family
ID=61895312
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711102483.6A Pending CN107918573A (en) | 2017-11-10 | 2017-11-10 | The automatic analysis system of SAS card failure information in a kind of Linux kernel daily records |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107918573A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108920297A (en) * | 2018-07-13 | 2018-11-30 | 郑州云海信息技术有限公司 | A kind of method and system diagnosing PCIE error information in black box log |
CN109582612A (en) * | 2018-12-24 | 2019-04-05 | 郑州云海信息技术有限公司 | A kind of device and its design, application method obtaining SAS card log |
CN109684525A (en) * | 2018-12-20 | 2019-04-26 | Oppo(重庆)智能科技有限公司 | Document display method, device, storage medium and test equipment |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106250258A (en) * | 2016-07-29 | 2016-12-21 | 北京云集智造科技有限公司 | A kind of disk failure localization method and device |
CN106886470A (en) * | 2017-02-22 | 2017-06-23 | 郑州云海信息技术有限公司 | A kind of fault detection method and system based on disk in linux |
-
2017
- 2017-11-10 CN CN201711102483.6A patent/CN107918573A/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106250258A (en) * | 2016-07-29 | 2016-12-21 | 北京云集智造科技有限公司 | A kind of disk failure localization method and device |
CN106886470A (en) * | 2017-02-22 | 2017-06-23 | 郑州云海信息技术有限公司 | A kind of fault detection method and system based on disk in linux |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108920297A (en) * | 2018-07-13 | 2018-11-30 | 郑州云海信息技术有限公司 | A kind of method and system diagnosing PCIE error information in black box log |
CN109684525A (en) * | 2018-12-20 | 2019-04-26 | Oppo(重庆)智能科技有限公司 | Document display method, device, storage medium and test equipment |
CN109582612A (en) * | 2018-12-24 | 2019-04-05 | 郑州云海信息技术有限公司 | A kind of device and its design, application method obtaining SAS card log |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8660995B2 (en) | Flexible event data content management for relevant event and alert analysis within a distributed processing system | |
US7664986B2 (en) | System and method for determining fault isolation in an enterprise computing system | |
US9419650B2 (en) | Flexible event data content management for relevant event and alert analysis within a distributed processing system | |
US10037238B2 (en) | System and method for encoding exception conditions included at a remediation database | |
US9256482B2 (en) | Determining whether to send an alert in a distributed processing system | |
US20220050765A1 (en) | Method for processing logs in a computer system for events identified as abnormal and revealing solutions, electronic device, and cloud server | |
CN108984332A (en) | A kind of device and method of location-server delay machine failure | |
US11113142B2 (en) | Early risk detection and management in a software-defined data center | |
CN107918573A (en) | The automatic analysis system of SAS card failure information in a kind of Linux kernel daily records | |
US20240020017A1 (en) | Monitoring method and apparatus for electronic device, and electronic device | |
CN1960297A (en) | System and method for monitoring device unit on network | |
CN109783315A (en) | A kind of database platform automation method for inspecting and system | |
CN108108445A (en) | A kind of data intelligence processing method and system | |
EP3264291A1 (en) | Data block processing method and device | |
CN105005572A (en) | Database mapping method and device | |
CN107247505B (en) | Cloud server power supply blackbox design method easy to view | |
CN114390112A (en) | Rail transit emergency disposal method and device, electronic equipment and storage medium | |
CN113590405A (en) | Hard disk error detection method and device, storage medium and electronic device | |
US10986230B1 (en) | Method and apparatus to capture, analyze, organize, and present support chat information | |
US11816210B2 (en) | Risk-based alerting for computer security | |
CN116340115A (en) | Method, system, equipment and medium for monitoring and early warning of error of front end of applet | |
CN116566810A (en) | System exception handling method, device and equipment based on Spring MVC framework | |
CN115543707A (en) | Hard disk fault detection method, system and device, storage medium and electronic device | |
Stearley et al. | A {State-Machine} Approach to Disambiguating Supercomputer Event Logs | |
CN111639098A (en) | Cloud log management method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180417 |
|
RJ01 | Rejection of invention patent application after publication |