[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN112966003A - Data quality problem troubleshooting method based on neural network algorithm - Google Patents

Data quality problem troubleshooting method based on neural network algorithm Download PDF

Info

Publication number
CN112966003A
CN112966003A CN202110229700.8A CN202110229700A CN112966003A CN 112966003 A CN112966003 A CN 112966003A CN 202110229700 A CN202110229700 A CN 202110229700A CN 112966003 A CN112966003 A CN 112966003A
Authority
CN
China
Prior art keywords
event
data
troubleshooting
checking
scheduling
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110229700.8A
Other languages
Chinese (zh)
Inventor
李保平
谢超
王辉
陈�峰
李启航
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Huitong Guoxin Technology Co Ltd
Original Assignee
Guangzhou Huitong Guoxin Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou Huitong Guoxin Technology Co Ltd filed Critical Guangzhou Huitong Guoxin Technology Co Ltd
Priority to CN202110229700.8A priority Critical patent/CN112966003A/en
Publication of CN112966003A publication Critical patent/CN112966003A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24564Applying rules; Deductive queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Economics (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Human Resources & Organizations (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Business, Economics & Management (AREA)
  • Strategic Management (AREA)
  • Primary Health Care (AREA)
  • Marketing (AREA)
  • Water Supply & Treatment (AREA)
  • Databases & Information Systems (AREA)
  • Public Health (AREA)
  • Tourism & Hospitality (AREA)
  • Artificial Intelligence (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Evolutionary Computation (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention relates to a data quality problem checking method based on a neural network algorithm, which comprises the following steps: s1, compiling a field scheduling experience set; s2, collecting system data and auditing the system data; s3, establishing an event investigation tree structure diagram; s4, reasoning step S3 event troubleshooting tree structure chart each event credibility; s5, calculating the weight distribution of each event in the event troubleshooting tree graph; s6, establishing a checking rule of the event checking tree structure chart; and S7, checking according to the checking rule. The beneficial effects are that: by depending on system data, combining with scheduling experience of a field dispatcher, using a big data algorithm and a neural network algorithm to establish an event troubleshooting tree structure chart, further formulating a troubleshooting rule, analyzing abnormal data and then troubleshooting according to the troubleshooting rule when the system gives an alarm, realizing primary positioning of the alarm event, simultaneously feeding back a positioning result to the field personnel, and improving the automation degree.

Description

Data quality problem troubleshooting method based on neural network algorithm
Technical Field
The invention relates to the technical field of data processing, in particular to a data quality problem troubleshooting method based on a neural network algorithm.
Background
After the power plant or the transformer substation receives the requirement of the investigation data, the investigation process only depends on the experience of field personnel. Different dispatchers have different experiences, and the investigation sequence is different from the steps, so that only suspected problem points can be provided, and then substation operators or distribution network operators are contacted to investigate the problem of abnormal data in a coordinated manner, which requires huge investment in labor and is tedious and time-consuming in work.
Disclosure of Invention
The invention aims to overcome the problems in the prior art and provides a data quality problem troubleshooting method based on a neural network algorithm.
In order to achieve the technical purpose and achieve the technical effect, the invention is realized by the following technical scheme:
a data quality problem checking method based on a neural network algorithm comprises the following steps:
s1, enabling each dispatcher to tidy personal field dispatching experiences, and summarizing and compiling the dispatching experiences to form a field dispatching experience set;
s2, collecting system data and auditing the system data;
s3, performing data characteristic analysis on the system data after auditing is completed, and establishing an event investigation tree structure diagram;
s4, the credibility of each event in the tree structure diagram is checked by using the site scheduling experience set inference step S3 made in the step S1;
s5, calculating the weight distribution of each event in the event troubleshooting tree graph;
s6, establishing a checking rule of the event checking tree structure chart when abnormal data occur in the system;
and S7, when the system gives an alarm, analyzing the abnormal data and then checking according to the checking rule of the event checking tree-shaped structure chart in the step S6.
Wherein the scheduling experience in step S1 includes exception data of the scheduled event, an event name of the scheduled event, and scheduling content of the scheduled event.
In step S1, the field scheduling experience set summarizes according to the event names of the scheduling events, sorts according to the sequence of the system service flow, performs data characteristic analysis and qualification on the abnormal data belonging to the same scheduling event, and merges and summarizes the scheduling contents belonging to the same scheduling event.
The system data in step S2 includes ledger and measurement data, system operation and maintenance data, system alarm data, and system business process.
The auditing of the system data in step S2 includes rechecking integrity of ledgers and metering data issued by the provincial power grid data platform, re-extracting system operation and maintenance data, checking quantity and authenticity of alarm information, and re-combing system business processes.
In step S3, the event-troubleshooting tree structure diagram constructs a frame according to the system service flow, and a single event is used as an independent node, where each node includes an event name, an event data storage location link, and a qualitative word of an event data feature.
In step S4, the inference step S3 of the reliability of each event in the event-finding tree structure diagram by using the field scheduling experience set created in step S1 specifically includes: and calling scheduling experience information which is in the same field scheduling experience set as the event names of the nodes in the event-troubleshooting tree-shaped structure chart, comparing whether the data, the data characteristics and the qualitative words are consistent, if so, informing a dispatcher of carrying out manual reasoning and carrying out manual modification on the label with standard credibility of the corresponding node of the event-troubleshooting tree-shaped structure chart, and if not, informing the dispatcher of carrying out manual reasoning and carrying out manual modification.
In step S5, the calculating the weight distribution of each event in the event-based tree view specifically includes: and carrying out weight distribution from large to small according to the negative influence degree of the event on the normal operation of the power grid.
The invention has the beneficial effects that: by depending on system data, combining with scheduling experience of a field dispatcher, using a big data algorithm and a neural network algorithm to establish an event troubleshooting tree structure chart, further formulating a troubleshooting rule, analyzing abnormal data and then troubleshooting according to the troubleshooting rule when the system gives an alarm, realizing primary positioning of the alarm event, simultaneously feeding back a positioning result to the field personnel, and improving the automation degree.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the invention and together with the description serve to explain the invention without limiting the invention. In the drawings:
fig. 1 is a schematic structural diagram of a data quality problem troubleshooting method in an embodiment of the present invention.
Detailed Description
The present invention will be described in detail below with reference to the accompanying drawings in conjunction with embodiments.
In the description of the present invention, it is to be understood that the terms "opening," "upper," "lower," "thickness," "top," "middle," "length," "inner," "peripheral," and the like are used in an orientation or positional relationship that is merely for convenience in describing and simplifying the description, and do not indicate or imply that the referenced component or element must have a particular orientation, be constructed and operated in a particular orientation, and thus should not be considered as limiting the present invention.
As shown in fig. 1, a method for troubleshooting data quality problems based on a neural network algorithm includes the following steps:
s1, enabling each dispatcher to tidy personal field dispatching experiences, and summarizing and compiling the dispatching experiences to form a field dispatching experience set;
s2, collecting system data and auditing the system data;
s3, performing data characteristic analysis on the system data after auditing is completed, and establishing an event investigation tree structure diagram;
s4, the credibility of each event in the tree structure diagram is checked by using the site scheduling experience set inference step S3 made in the step S1;
s5, calculating the weight distribution of each event in the event troubleshooting tree graph;
s6, establishing a checking rule of the event checking tree structure chart when abnormal data occur in the system;
and S7, when the system gives an alarm, analyzing the abnormal data and then checking according to the checking rule of the event checking tree-shaped structure chart in the step S6.
By depending on system data, combining with scheduling experience of a field dispatcher, using a big data algorithm and a neural network algorithm to establish an event troubleshooting tree structure chart, further formulating a troubleshooting rule, analyzing abnormal data and then troubleshooting according to the troubleshooting rule when the system gives an alarm, realizing primary positioning of the alarm event, simultaneously feeding back a positioning result to the field personnel, and improving the automation degree.
The scheduling experience in step S1 includes the exception data of the scheduled event, the event name of the scheduled event, and the scheduling content of the scheduled event.
In step S1, the field scheduling experience set is summarized according to the event names of the scheduling events, sorted according to the sequence of the system service flow, subjected to data characteristic analysis and qualification on the abnormal data belonging to the same scheduling event, and merged and summarized in the scheduling contents belonging to the same scheduling event.
The system data in step S2 includes ledger and measurement data, system operation and maintenance data, system alarm data, and system business process.
The auditing of the system data in step S2 includes rechecking the integrity of ledgers and metering data issued by the provincial power grid data platform, re-extracting system operation and maintenance data, checking the number and authenticity of alarm information, and rechecking the system business process.
In step S3, the event-troubleshooting tree structure diagram constructs a framework according to the system service flow, and a single event is used as an independent node, where each node includes an event name, an event data storage location link, and a qualitative word of an event data feature.
In step S4, the inference step S3 of the reliability of each event in the event-finding tree structure diagram by using the field scheduling experience set created in step S1 specifically includes: and calling scheduling experience information which is in the same field scheduling experience set as the event names of the nodes in the event-troubleshooting tree-shaped structure chart, comparing whether the data, the data characteristics and the qualitative words are consistent, if so, informing a dispatcher of carrying out manual reasoning and carrying out manual modification on the label with standard credibility of the corresponding node of the event-troubleshooting tree-shaped structure chart, and if not, informing the dispatcher of carrying out manual reasoning and carrying out manual modification.
In step S5, the calculating the weight distribution of each event in the event-based tree includes: and carrying out weight distribution from large to small according to the negative influence degree of the event on the normal operation of the power grid. When one or more data characteristics are close to a plurality of nodes at the same time, the data characteristics are compared in sequence from large to small according to the weight, so that the delay time of an event with large negative influence degree on the normal operation of the power grid is avoided.
The first example of the investigation: IF 2 substation transformation ratio error = is
AND imbalance = mild
AND persistent present = is
AND has not previously occurred = is
THEN determines that the ratio data is not updated
Investigation example two: IF 2 substation transformation ratio error = is
AND imbalance = moderate
AND persistent present = is
AND not before present = no
THEN judges that the transformation ratio data is not standard and the value range is wrong
In the description herein, references to the description of "one embodiment," "an example," "a specific example" or the like are intended to mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.
The foregoing shows and describes the general principles, essential features, and advantages of the invention. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, which are described in the specification and illustrated only to illustrate the principle of the present invention, but that various changes and modifications may be made therein without departing from the spirit and scope of the present invention, which fall within the scope of the invention as claimed.

Claims (8)

1. A data quality problem checking method based on a neural network algorithm is characterized by comprising the following steps:
s1, enabling each dispatcher to tidy personal field dispatching experiences, and summarizing and compiling the dispatching experiences to form a field dispatching experience set;
s2, collecting system data and auditing the system data;
s3, performing data characteristic analysis on the system data after auditing is completed, and establishing an event investigation tree structure diagram;
s4, the credibility of each event in the tree structure diagram is checked by using the site scheduling experience set inference step S3 made in the step S1;
s5, calculating the weight distribution of each event in the event troubleshooting tree graph;
s6, establishing a checking rule of the event checking tree structure chart when abnormal data occur in the system;
and S7, when the system gives an alarm, analyzing the abnormal data and then checking according to the checking rule of the event checking tree-shaped structure chart in the step S6.
2. The data quality problem investigation method of claim 1, wherein: the scheduling experience in step S1 includes the exception data of the scheduled event, the event name of the scheduled event, and the scheduling content of the scheduled event.
3. The data quality problem investigation method of claim 2, wherein: in the step S1, the field scheduling experience set is summarized according to the event names of the scheduling events, sorted according to the sequence of the system service flow, subjected to data characteristic analysis and qualification on the abnormal data belonging to the same scheduling event, and merged and summarized in the scheduling contents belonging to the same scheduling event.
4. The data quality problem investigation method of claim 1, wherein: the system data in the step S2 includes ledger and measurement data, system operation and maintenance data, system alarm data, and system business process.
5. The data quality problem investigation method of claim 4, wherein: the auditing of the system data in step S2 includes rechecking the integrity of the ledger and metering data issued by the provincial power grid data platform, re-extracting the system operation and maintenance data, checking the number and authenticity of alarm information, and re-combing the system business process.
6. The data quality problem investigation method of claim 1, wherein: in the step S3, the event-finding tree structure diagram constructs a framework according to the system service flow, and a single event is used as an independent node, and each node includes an event name, an event data storage location link, and a qualitative word of an event data feature.
7. The method for troubleshooting data quality as recited in claim 6, wherein the step S4 of inferring the credibility of each event in the event troubleshooting tree structure diagram in the step S3 using the field scheduling experience set created in the step S1 specifically comprises: and calling scheduling experience information which is in the same field scheduling experience set as the event names of the nodes in the event-troubleshooting tree-shaped structure chart, comparing whether the data, the data characteristics and the qualitative words are consistent, if so, informing a dispatcher of carrying out manual reasoning and carrying out manual modification on the label with standard credibility of the corresponding node of the event-troubleshooting tree-shaped structure chart, and if not, informing the dispatcher of carrying out manual reasoning and carrying out manual modification.
8. The method for troubleshooting of data quality as claimed in claim 1, wherein the calculating the weight assignment of each event in the event troubleshooting tree in the step S5 specifically includes: and carrying out weight distribution from large to small according to the negative influence degree of the event on the normal operation of the power grid.
CN202110229700.8A 2021-03-02 2021-03-02 Data quality problem troubleshooting method based on neural network algorithm Pending CN112966003A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110229700.8A CN112966003A (en) 2021-03-02 2021-03-02 Data quality problem troubleshooting method based on neural network algorithm

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110229700.8A CN112966003A (en) 2021-03-02 2021-03-02 Data quality problem troubleshooting method based on neural network algorithm

Publications (1)

Publication Number Publication Date
CN112966003A true CN112966003A (en) 2021-06-15

Family

ID=76276255

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110229700.8A Pending CN112966003A (en) 2021-03-02 2021-03-02 Data quality problem troubleshooting method based on neural network algorithm

Country Status (1)

Country Link
CN (1) CN112966003A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115222371A (en) * 2022-09-08 2022-10-21 联信弘方(北京)科技股份有限公司 Problem troubleshooting method and device, electronic equipment and storage medium

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040015906A1 (en) * 2001-04-30 2004-01-22 Goraya Tanvir Y. Adaptive dynamic personal modeling system and method
US20080086363A1 (en) * 2006-10-06 2008-04-10 Accenture Global Services Gmbh Technology event detection, analysis, and reporting system
CN102819239A (en) * 2011-06-08 2012-12-12 同济大学 Intelligent fault diagnosis method of numerical control machine tool
US20140172919A1 (en) * 2012-12-18 2014-06-19 Cisco Technology, Inc. Automatic correlation of dynamic system events within computing devices
CN106022583A (en) * 2016-05-12 2016-10-12 中国电力科学研究院 Electric power communication service risk calculation method and system based on fuzzy decision tree
CN106154209A (en) * 2016-07-29 2016-11-23 国电南瑞科技股份有限公司 Electrical energy meter fault Forecasting Methodology based on decision Tree algorithms
CN106961249A (en) * 2017-03-17 2017-07-18 广西大学 A kind of diagnosing failure of photovoltaic array and method for early warning
US20190286660A1 (en) * 2018-03-13 2019-09-19 HCA Holdings, Inc. Techniques for generating investigatory-event mappings using graph-structure trajectories
US20200177608A1 (en) * 2018-12-04 2020-06-04 International Business Machines Corporation Ontology Based Persistent Attack Campaign Detection

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040015906A1 (en) * 2001-04-30 2004-01-22 Goraya Tanvir Y. Adaptive dynamic personal modeling system and method
US20080086363A1 (en) * 2006-10-06 2008-04-10 Accenture Global Services Gmbh Technology event detection, analysis, and reporting system
CN102819239A (en) * 2011-06-08 2012-12-12 同济大学 Intelligent fault diagnosis method of numerical control machine tool
US20140172919A1 (en) * 2012-12-18 2014-06-19 Cisco Technology, Inc. Automatic correlation of dynamic system events within computing devices
CN106022583A (en) * 2016-05-12 2016-10-12 中国电力科学研究院 Electric power communication service risk calculation method and system based on fuzzy decision tree
CN106154209A (en) * 2016-07-29 2016-11-23 国电南瑞科技股份有限公司 Electrical energy meter fault Forecasting Methodology based on decision Tree algorithms
CN106961249A (en) * 2017-03-17 2017-07-18 广西大学 A kind of diagnosing failure of photovoltaic array and method for early warning
US20190286660A1 (en) * 2018-03-13 2019-09-19 HCA Holdings, Inc. Techniques for generating investigatory-event mappings using graph-structure trajectories
US20200177608A1 (en) * 2018-12-04 2020-06-04 International Business Machines Corporation Ontology Based Persistent Attack Campaign Detection

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
杨剑 等: "基于告警加权的智能光网络故障诊断算法", 《湘潭大学自然科学学报》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115222371A (en) * 2022-09-08 2022-10-21 联信弘方(北京)科技股份有限公司 Problem troubleshooting method and device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
CN110674189B (en) Method for monitoring secondary state and positioning fault of intelligent substation
CN106557991B (en) Voltage monitoring data platform
CN110707692A (en) Online load analysis and modeling system and method for power system
CN103501503B (en) A kind of network problem analysis method and apparatus
CN104933631A (en) Power distribution network operation online analysis and evaluation system
CN110647567B (en) Power failure plan auxiliary decision system based on end data fusion
CN112464995A (en) Power grid distribution transformer fault diagnosis method and system based on decision tree algorithm
CN101883015A (en) Method and system for filtering construction alarms
CN106897779A (en) A kind of processing method of data center's operational system event
CN112865316A (en) Power supply service analysis command system and method based on big data
CN112465167A (en) Intelligent decision management system and method for regional power grid equipment maintenance
CN118279086B (en) Method and system for automatically generating interval protection configuration model data by SCD (stream control device) file
CN112966003A (en) Data quality problem troubleshooting method based on neural network algorithm
CN108710566A (en) A kind of distribution scheduling station integration testing framework and method
CN116743079A (en) Photovoltaic string fault processing method and device, photovoltaic management system and medium
CN115409264A (en) Power distribution network emergency repair stagnation point position optimization method based on feeder line fault prediction
CN109995856A (en) A kind of grid operation data wide area collects method and system
CN113256074A (en) Automatic process operation system and method for electric robot
CN112256922A (en) Fault power failure rapid identification method and system
CN111639839A (en) Micro-service-based power grid fault analysis method and system
CN113904440B (en) Topology analysis-based power system morning operation test scheme generation method
CN110738427A (en) electric power part work quality scoring system
CN112446619B (en) Power distribution network rush-repair processing method and device
CN107748701A (en) A kind of analysis method for reliability of electric energy measurement automation system
CN115542070A (en) Distribution network line fault positioning method and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination