Disclosure of Invention
The invention aims to overcome the problems in the prior art and provides a data quality problem troubleshooting method based on a neural network algorithm.
In order to achieve the technical purpose and achieve the technical effect, the invention is realized by the following technical scheme:
a data quality problem checking method based on a neural network algorithm comprises the following steps:
s1, enabling each dispatcher to tidy personal field dispatching experiences, and summarizing and compiling the dispatching experiences to form a field dispatching experience set;
s2, collecting system data and auditing the system data;
s3, performing data characteristic analysis on the system data after auditing is completed, and establishing an event investigation tree structure diagram;
s4, the credibility of each event in the tree structure diagram is checked by using the site scheduling experience set inference step S3 made in the step S1;
s5, calculating the weight distribution of each event in the event troubleshooting tree graph;
s6, establishing a checking rule of the event checking tree structure chart when abnormal data occur in the system;
and S7, when the system gives an alarm, analyzing the abnormal data and then checking according to the checking rule of the event checking tree-shaped structure chart in the step S6.
Wherein the scheduling experience in step S1 includes exception data of the scheduled event, an event name of the scheduled event, and scheduling content of the scheduled event.
In step S1, the field scheduling experience set summarizes according to the event names of the scheduling events, sorts according to the sequence of the system service flow, performs data characteristic analysis and qualification on the abnormal data belonging to the same scheduling event, and merges and summarizes the scheduling contents belonging to the same scheduling event.
The system data in step S2 includes ledger and measurement data, system operation and maintenance data, system alarm data, and system business process.
The auditing of the system data in step S2 includes rechecking integrity of ledgers and metering data issued by the provincial power grid data platform, re-extracting system operation and maintenance data, checking quantity and authenticity of alarm information, and re-combing system business processes.
In step S3, the event-troubleshooting tree structure diagram constructs a frame according to the system service flow, and a single event is used as an independent node, where each node includes an event name, an event data storage location link, and a qualitative word of an event data feature.
In step S4, the inference step S3 of the reliability of each event in the event-finding tree structure diagram by using the field scheduling experience set created in step S1 specifically includes: and calling scheduling experience information which is in the same field scheduling experience set as the event names of the nodes in the event-troubleshooting tree-shaped structure chart, comparing whether the data, the data characteristics and the qualitative words are consistent, if so, informing a dispatcher of carrying out manual reasoning and carrying out manual modification on the label with standard credibility of the corresponding node of the event-troubleshooting tree-shaped structure chart, and if not, informing the dispatcher of carrying out manual reasoning and carrying out manual modification.
In step S5, the calculating the weight distribution of each event in the event-based tree view specifically includes: and carrying out weight distribution from large to small according to the negative influence degree of the event on the normal operation of the power grid.
The invention has the beneficial effects that: by depending on system data, combining with scheduling experience of a field dispatcher, using a big data algorithm and a neural network algorithm to establish an event troubleshooting tree structure chart, further formulating a troubleshooting rule, analyzing abnormal data and then troubleshooting according to the troubleshooting rule when the system gives an alarm, realizing primary positioning of the alarm event, simultaneously feeding back a positioning result to the field personnel, and improving the automation degree.
Detailed Description
The present invention will be described in detail below with reference to the accompanying drawings in conjunction with embodiments.
In the description of the present invention, it is to be understood that the terms "opening," "upper," "lower," "thickness," "top," "middle," "length," "inner," "peripheral," and the like are used in an orientation or positional relationship that is merely for convenience in describing and simplifying the description, and do not indicate or imply that the referenced component or element must have a particular orientation, be constructed and operated in a particular orientation, and thus should not be considered as limiting the present invention.
As shown in fig. 1, a method for troubleshooting data quality problems based on a neural network algorithm includes the following steps:
s1, enabling each dispatcher to tidy personal field dispatching experiences, and summarizing and compiling the dispatching experiences to form a field dispatching experience set;
s2, collecting system data and auditing the system data;
s3, performing data characteristic analysis on the system data after auditing is completed, and establishing an event investigation tree structure diagram;
s4, the credibility of each event in the tree structure diagram is checked by using the site scheduling experience set inference step S3 made in the step S1;
s5, calculating the weight distribution of each event in the event troubleshooting tree graph;
s6, establishing a checking rule of the event checking tree structure chart when abnormal data occur in the system;
and S7, when the system gives an alarm, analyzing the abnormal data and then checking according to the checking rule of the event checking tree-shaped structure chart in the step S6.
By depending on system data, combining with scheduling experience of a field dispatcher, using a big data algorithm and a neural network algorithm to establish an event troubleshooting tree structure chart, further formulating a troubleshooting rule, analyzing abnormal data and then troubleshooting according to the troubleshooting rule when the system gives an alarm, realizing primary positioning of the alarm event, simultaneously feeding back a positioning result to the field personnel, and improving the automation degree.
The scheduling experience in step S1 includes the exception data of the scheduled event, the event name of the scheduled event, and the scheduling content of the scheduled event.
In step S1, the field scheduling experience set is summarized according to the event names of the scheduling events, sorted according to the sequence of the system service flow, subjected to data characteristic analysis and qualification on the abnormal data belonging to the same scheduling event, and merged and summarized in the scheduling contents belonging to the same scheduling event.
The system data in step S2 includes ledger and measurement data, system operation and maintenance data, system alarm data, and system business process.
The auditing of the system data in step S2 includes rechecking the integrity of ledgers and metering data issued by the provincial power grid data platform, re-extracting system operation and maintenance data, checking the number and authenticity of alarm information, and rechecking the system business process.
In step S3, the event-troubleshooting tree structure diagram constructs a framework according to the system service flow, and a single event is used as an independent node, where each node includes an event name, an event data storage location link, and a qualitative word of an event data feature.
In step S4, the inference step S3 of the reliability of each event in the event-finding tree structure diagram by using the field scheduling experience set created in step S1 specifically includes: and calling scheduling experience information which is in the same field scheduling experience set as the event names of the nodes in the event-troubleshooting tree-shaped structure chart, comparing whether the data, the data characteristics and the qualitative words are consistent, if so, informing a dispatcher of carrying out manual reasoning and carrying out manual modification on the label with standard credibility of the corresponding node of the event-troubleshooting tree-shaped structure chart, and if not, informing the dispatcher of carrying out manual reasoning and carrying out manual modification.
In step S5, the calculating the weight distribution of each event in the event-based tree includes: and carrying out weight distribution from large to small according to the negative influence degree of the event on the normal operation of the power grid. When one or more data characteristics are close to a plurality of nodes at the same time, the data characteristics are compared in sequence from large to small according to the weight, so that the delay time of an event with large negative influence degree on the normal operation of the power grid is avoided.
The first example of the investigation: IF 2 substation transformation ratio error = is
AND imbalance = mild
AND persistent present = is
AND has not previously occurred = is
THEN determines that the ratio data is not updated
Investigation example two: IF 2 substation transformation ratio error = is
AND imbalance = moderate
AND persistent present = is
AND not before present = no
THEN judges that the transformation ratio data is not standard and the value range is wrong
In the description herein, references to the description of "one embodiment," "an example," "a specific example" or the like are intended to mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.
The foregoing shows and describes the general principles, essential features, and advantages of the invention. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, which are described in the specification and illustrated only to illustrate the principle of the present invention, but that various changes and modifications may be made therein without departing from the spirit and scope of the present invention, which fall within the scope of the invention as claimed.