CN117648543A

CN117648543A - Self-evolving substation equipment learning method

Info

Publication number: CN117648543A
Application number: CN202410127488.8A
Authority: CN
Inventors: 周洪海; 金志浩; 谢丽萍; 赵玉薇
Original assignee: Jinshu Information Technology Suzhou Co ltd
Current assignee: Jinshu Information Technology Suzhou Co ltd
Priority date: 2024-01-30
Filing date: 2024-01-30
Publication date: 2024-03-05

Abstract

The invention discloses a self-evolving substation equipment learning method, which comprises the steps of S1, collecting data from a substation by using a plurality of sensors, and establishing a substation data set; s2, processing the collected data set; s3, collecting data in a fixed time interval; s4, data desensitization is carried out by adopting a differential privacy technology; s5, performing time sequence analysis on the equipment operation parameters and the external parameters by adopting an autoregressive integrated moving average model; s6, performing fault prediction by using a decision tree model; s7, collecting the latest operation data from the transformer substation sensor at fixed time intervals; s8, carrying out a time sequence analysis method by the continuation model, and updating key features periodically; s9, adjusting a splitting criterion and a pruning strategy of the decision tree; s10, calculating the accuracy, recall rate and F1 score of the decision tree model; s11, identifying the change trend of the model performance; s12, fine tuning of the model is performed. The invention improves the accuracy and timeliness of fault prediction.

Description

Self-evolving substation equipment learning method

Technical Field

The invention relates to the technical field of information, in particular to a self-evolving substation equipment learning method.

Background

In the operation of a power system, a transformer substation is used as a key node for electric energy transmission, and the stability and safety of the whole power grid are directly influenced by the health condition of equipment. Conventional substation equipment monitoring and fault prediction methods rely on periodic physical inspection and empirically driven predictive models, which have significant limitations in facing complex, dynamically changing operating environments.

First, conventional methods often fail to fully utilize the large amount of data generated by the substation. With the development of sensor technology and data acquisition systems, substation equipment can generate a large amount of data about the running state in real time, including parameters such as temperature, pressure, current and the like. However, conventional methods fail to integrate and analyze such data effectively, thereby missing the opportunity to use such data for in-depth analysis and prediction. Second, conventional fault prediction methods often lack adaptivity. These methods are typically designed based on a fixed set of rules or models that are difficult to adapt to widely varying actual operating conditions. Due to the complexity of the substation environment, such as seasonal changes, weather effects, and equipment aging, these fixed models often fail to accurately predict equipment failure. Furthermore, the prior art has a disadvantage in terms of privacy protection. In collecting and analyzing sensitive operational data, how to ensure data security and privacy protection is an important challenge. Conventional methods often lack effective privacy protection measures in processing sensitive data, which may lead to risks of data disclosure or abuse. Therefore, how to provide a self-evolving substation equipment learning method is a problem that needs to be solved by those skilled in the art.

Disclosure of Invention

The invention aims to provide a self-evolving substation equipment learning method, equipment faults can be predicted more accurately by comprehensively utilizing operation data collected in real time, and the accuracy and timeliness of fault prediction are improved by combining time sequence analysis and a machine learning model.

According to the embodiment of the invention, the self-evolving substation equipment learning method comprises the following steps of:

s1, collecting temperature from a transformer substation by using a plurality of sensorsPressure->Current->Record of operations->Historical maintenance data->External environmental information->And weather conditions->And build up a substation data set；

S2, collecting data setsPerforming outlier processing by using the Z-Score, and adopting Min-Max standardization to process data in a data set;

s3, processing the high-speed data stream by adopting a time window method, and collecting data in a fixed time interval;

s4, adding random noise in the data set, and performing data desensitization by adopting a differential privacy technology;

s5, performing time sequence analysis on equipment operation parameters and external parameters by adopting an autoregressive integrated moving average model, and extracting trend, seasonal and random components;

S6, performing fault prediction by using a decision tree model, and predicting whether the equipment is likely to be in fault or not based on the characteristics extracted from the time sequence analysis;

s7, configuring a high-frequency acquisition module, and collecting the latest operation data from the transformer substation sensor at fixed time intervals;

s8, transmitting the latest operation data to a central data warehouse in real time, marking the latest operation data with a time stamp, continuing the model to perform a time sequence analysis method, and updating key features regularly;

s9, adopting an online learning algorithm increment version of the decision tree to enable the model to learn and self-adjust based on the latest data, and adjusting the splitting criterion and pruning strategy of the decision tree;

s10, regularly calculating the accuracy, recall rate and F1 score of the decision tree model;

s11, comparing the current performance index with preset reference or historical performance data, and identifying the change trend of the model performance;

and S12, periodically collecting user feedback including accuracy and practicability evaluation of model prediction, and carrying out fine adjustment of the model by combining the user feedback and performance monitoring results, modifying a decision threshold or introducing new data features.

Optionally, the S2 specifically includes:

s21, regarding transformer substation data sets In relation to temperature->Pressure->Current->Record of operations->Historical maintenance dataExternal environmental information->And weather conditions->Data of->Calculate each data +.>Mean->And standard deviation->For each data point->Calculate Z-Score value:

；

s22, for each dataDefining a specific threshold for identifying and eliminating outliers of corresponding data dimensionsFor some data->When->The data point is regarded as an outlier;

s23, regarding the data points identified as abnormalTaking an average of the data points from which the anomaly was removed or replaced with neighboring data points of the dimension;

s24, repeating S21-S23 until the substation data setMiddle temperature->Pressure->Current->Record of operations->Historical maintenance data->External environmental information->And weather conditions->No more abnormal values exceeding the respective threshold value exist for the data of (a), the processed data set is defined as +.>；

S25 forIs +.>Calculate its minimum +.>Maximum->；

S26, applying Min-Max standardization to each data point：

；

Wherein,is the data dimension +.>Normalized data points in the middle, the normalized data set after processing is defined as。

Optionally, the step S4 specifically includes:

S41, data setIs +.>Definition of +.>Pressure->Current->Operation recordHistorical maintenance data->External environmental information->And weather conditions->Privacy budget for each data type of (2)>And calculates the corresponding data type +.>Sensitivity reflecting maximum range of variation of substation operation data in differential privacy processing>；

S42, each data point：

；

Wherein,represents->Data points after desensitization in dimensions;

s43, finally generating a temperaturePressure->Current->Record of operations->Historical maintenance data->External environmental information->And weather conditions->Desensitization dataset->。

Optionally, the step S5 specifically includesAn autoregressive integrated moving average model for analyzing and predicting the operating state of the substation equipment is constructed, and trend, seasonal and random components are extracted from the time series data:

；

wherein,represents time t>Predictive value of type data->、And->Constant term, autoregressive coefficient and moving average coefficient of model respectively, < >>、And->The long-term trend, periodic variation of seasonal components, and irregular fluctuation of random components, which are trend components of the disclosure data, are coordinated.

Optionally, the step S6 specifically includes:

s61, constructing a decision tree model based on key features extracted by time sequence analysis, wherein the decision tree model uses data pointsAs input->Representative dataset +.>Trend, seasonal and random components extracted in the time series analysis:

；

wherein,parameters representing the decision tree, including split point and threshold value +.>Is based on data set->For evaluating whether a specific condition is met;

s62, each internal node of the decision tree represents a test condition:

；

wherein,、and->For determining the current temperature->Pressure->And current->Whether the set corresponding thresholds t, p and i are exceeded;

s63, each branch of the decision tree represents the result of the test condition and points to a further node or a final leaf node if、And->If a certain item exceeds a threshold value, the path points to a fault class, otherwise, the path points to a normal class;

s64, each leaf node of the decision tree represents a fault or a normal state based on the data pointsAnd a predefined fault criterion, meaning that the substation is at risk of a fault according to the current analysis if the path of the decision tree points to the fault.

Optionally, the step S8 specifically includes:

S81, defining data acquisition frequency f, and in each acquisition periodIn, for all collection about temperature +.>Pressure->Current->Record of operations->Historical maintenance data->External environmental information->And weather conditions->Data points>Adding a timestamp mark;

s82, data points with time stampsThe updated data set is formed by outlier detection, min-Max standardization and differential privacy technology processing>；

S83, useUpdating a comprehensive time series model which comprehensively considers all data points and time stamps thereof:

；

s84, according to the decision tree modelUpdating the structure and the condition correspondingly;

s85, continuously updating a time sequence analysis model of key operation parameters of the substation equipment according to the latest data collected by the high-frequency acquisition module;

s86, for each running parameter, the autoregressive integrated moving average model periodically uses the newly acquired dataTo update the autoregressive coefficients in the model parameters +.>Moving average coefficient->And model constant term->：

。

Optionally, the step S9 specifically includes:

s91, applying an incremental learning version of the decision tree model to process a new data set in real timeThe latest operation parameter data of the transformer substation are collected from the real-time data collection module;

Wherein, the time series characteristic: time-dependent features such as trends, seasonal patterns are extracted from the temperature, pressure and current data. Maintenance and operation features: key features such as maintenance frequency, operational anomalies are extracted from historical repair and operational records. Environmental characteristics: the effect of weather and external environment on the performance of the device is considered.

S92, the decision tree model is based on the new data setDynamically updating, wherein the updating rule is based on a time sequence analysis result:

；

wherein,is data point +.>Middle temperature->Pressure->Current->Record of operations->Historical maintenance data->External environmental information->And weather conditions->The amount of change at successive time points, +.>Is a preset update threshold;

s93, the update operation includes splitting and pruning of the nodes for adapting to the new data pattern.

Optionally, the step S10 specifically includes:

s101, periodically using the slave data setAnd evaluating the performance of the model with the updated model;

s102, calculating accuracy and data variationAnd (3) associating:

；

wherein each data pointRepresentative includes temperature->Pressure->Current->Record of operations->Historical maintenance data->External environmental information->And weather conditions->Is a different data type of (a);

S103, the recall rate is related to updating of the decision tree model:

；

the recall reflects the capability of the model to identify the actual faults, and changes of various operation parameters are considered;

s104, F1 score combines accuracy and recall, and is closely related to a model updating mechanism:

；

wherein each ofThe calculation is as follows:

；

s105, according to the accuracy, recall and F1 score results, adjusting the splitting criterion and pruning strategy of the decision tree and updating the parameters of the time sequence model, and carrying out real-time fault prediction of the transformer substation.

Optionally, the step S11 specifically includes:

s111, periodically evaluating and comparing indexes of the accuracy rate, the recall rate and the F1 score in the current model with preset reference or historical performance data;

s112, in the change trend, a mathematical formula is used for quantifying the change of the performance index and the performance changeExpressed as the difference between the current index value and the historical index value:

；

wherein,accuracy, recall and F1 score;

s113, identifying the improvement or reduction of the model performance and the field possibly needing further adjustment by analyzing the accuracy, recall and F1 score change trend;

s114, if the recall rate is reduced, indicating that the decision tree model needs to be further adjusted or the time sequence analysis needs to be updated to reflect the latest trend and mode;

If the accuracy rate is reduced, indicating that the overall performance of the model is reduced in distinguishing between the fault and non-fault states of the substation, properly processing the abnormal values by checking the data collection and integration steps, properly selecting and standardizing the characteristics, and re-evaluating the initial model construction process, including whether the training data set needs to be adjusted, the model parameters need to be changed or different algorithms need to be tried;

if F1 score is reduced, the defect of the model on the recognition capability of the fault state is indicated, and the model can not only recognize the fault but also correctly distinguish the non-fault condition by optimizing the splitting criterion and pruning strategy of the decision tree model.

Optionally, the step 12 specifically includes:

s121, user feedback from substation operators and maintenance teams is collected regularly, and accuracy and practicability evaluation of model prediction are focused on;

s122, combining user feedback with performance monitoring results of accuracy, recall and F1 score to comprehensively evaluate the performance and practicability of the model;

s123, based on the collected feedback and performance data, performing fine adjustment for modifying decision threshold or introducing new data features on the model, wherein a specific adjustment formula is expressed as an adjusted performance index Coefficients associated with user feedback +.>Is the product of:

；

wherein,is based on the userFeedback quantization score, representing improvement or decrease of model prediction accuracy or practicality, ++>Is an adjustment coefficient for controlling the degree of influence of user feedback on the model performance adjustment;

the adjustment process includes modifying splitting criteria or time series model parameters of the decision tree model;

s124, the effect of model adjustment is evaluated by recalculating the accuracy, recall and F1 score.

The beneficial effects of the invention are as follows:

the invention can more accurately predict the equipment faults by comprehensively utilizing the operation data collected in real time, and improves the accuracy and timeliness of fault prediction by combining time sequence analysis and a machine learning model. Meanwhile, the self-evolution learning mechanism enables the model to learn and self-adjust based on the latest data, and the model can adapt to operation changes such as ageing, seasonal changes, weather influences and the like of substation equipment, so that the prediction performance of the model is kept.

According to the invention, the accuracy, recall rate and F1 score of the model are regularly calculated, and the performance indexes are compared with the historical data, so that the degradation of the performance of the model can be timely identified and solved, the prediction model is ensured to always maintain an optimal state, and meanwhile, the model is regularly collected and adjusted by combining the feedback, so that the model is ensured to be not only theoretically effective, but also has high practicability and reliability in actual operation. This enhances the consistency between the model and the actual operation.

The invention can reduce the downtime caused by equipment faults, and enables a maintenance team to plan and execute maintenance tasks more effectively by predicting and identifying potential problems in time, thereby improving the overall operation and maintenance efficiency.

Drawings

The accompanying drawings are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate the invention and together with the embodiments of the invention, serve to explain the invention. In the drawings:

fig. 1 is a flowchart of a self-evolving substation equipment learning method according to the present invention;

fig. 2 is a flowchart of initial model construction in a self-evolving substation equipment learning method provided by the invention;

fig. 3 is a flowchart of a performance evaluation model in a self-evolving substation equipment learning method provided by the invention.

Description of the embodiments

The invention will now be described in further detail with reference to the accompanying drawings. The drawings are simplified schematic representations which merely illustrate the basic structure of the invention and therefore show only the structures which are relevant to the invention.

Referring to fig. 1-3, a self-evolving substation equipment learning method includes the steps of:

s1, collecting temperature from a transformer substation by using a plurality of sensors Pressure->Current->Record of operations->Historical maintenance data->External environmental information->And weather conditions->And build up a substation data set；

in this embodiment, S2 specifically includes:

s21, regarding transformer substation data setsIn relation to temperature->Pressure->Current->Record of operations->Historical maintenance dataExternal environmental information->And weather conditions->Data of->Calculate each data +.>Mean->And standard deviation->For each data point->Calculation of Z-Score value:

；

s24, repeating S21-S23 until the substation data setMiddle temperature->Pressure->Current->Record of operations->Historical maintenance data->External environmental information->And weather conditions->No more abnormal values exceeding the respective threshold value exist for the data of (a), the processed data set is defined as +. >The consistency and reliability of the data are ensured;

the multi-dimensional data, including the key operation parameter temperature, relied on by the substation equipment learning method can be ensured through the steps S21-S24Pressure->Current->And other important information operation records->Historical maintenance dataExternal environmental information->And weather conditions->Proper outlier processing is carried out before analysis and modeling, so that the quality of data and the accuracy of a model are improved.

S25 forIs +.>Calculate its minimum +.>Maximum->；

S26, applying Min-Max standardization to each data point：

；

Wherein,is the data dimension +.>Normalized data points in the middle, the normalized data set after processing is defined as，For training of self-evolving learning models.

According to the method, through the step S25-the step S26, the multi-dimensional data collected by the transformer substation is ensured to keep the data quality before the subsequent self-evolution learning model training, and meanwhile, the data privacy is effectively protected.

In this embodiment, S4 specifically includes:

S42, each data point：

；

Wherein,represents->Data points after desensitization in dimensions;

s43, finally generating a temperaturePressure->Current->Record of operations->Historical maintenance data->External environmental information->And weather conditions->Desensitization dataset->The method and the device keep the integral statistical property of the data while protecting the privacy of the data, and are suitable for training of a self-evolution learning model.

The present embodiment considersThe characteristics and the importance of each data dimension ensure that effective learning and prediction are performed, and meanwhile, possible privacy leakage is prevented, so that the method is particularly suitable for transformer substation data containing sensitive information, the privacy protection of the key data of the transformer substation in the self-evolution learning process is ensured, meanwhile, the effectiveness and the reliability of the data are maintained, and a solid data base is provided for the efficient operation and fault prediction of transformer substation equipment.

in the present embodiment, S5 specifically includes, forAn autoregressive integrated moving average model for analyzing and predicting the operating state of the substation equipment is constructed, and trend, seasonal and random components are extracted from the time series data:

；

wherein,represents time t>Predictive value of type data->、And->Constant term, autoregressive coefficient and moving average coefficient of model respectively, < >>、And->The long-term trend, periodic variation of seasonal components, and irregular fluctuation of random components, which cooperate as trend components of the revealed data, reflect contributions of different data types, such as environmental information and historical maintenance records, to model predictions.

ModelThe self-evolution learning model of the transformer substation can process and analyze different types of data, such as physical measured values of temperature, pressure, current and the like, operation records, historical maintenance data and other types of data, simultaneously can help predict and understand behaviors of transformer substation equipment under different conditions, such as temperature rise or current fluctuation can indicate impending equipment faults, and maintenance records and environmental factors, such as weather changes, can influence the performance and maintenance requirements of the equipment, so that the self-evolution learning model can accurately predict future states according to the historical and current data, deep data driving insight is provided for the transformer substation, preventive measures or optimized maintenance plans can be adopted in advance by the transformer substation, unexpected shutdown and fault risks are reduced, and the reliability and efficiency of the whole system are improved.

in this embodiment, S6 specifically includes:

；

s62, each internal node of the decision tree represents a test condition:

；

s63, each branch of the decision tree represents the result of the test condition and points to a further node or a final leaf node if、And->Something exceedingA threshold value, the path points to the fault class, otherwise, the path points to the normal class;

According to the method, new operation data and user feedback are continuously received, and the model periodically updates the structure and the splitting criterion of the decision tree to adapt to the change of the operation environment of the transformer substation, so that the accuracy and the efficiency of fault prediction are improved.

in this embodiment, S8 specifically includes:

；

S84, according to the decision tree modelCorrespondingly updating the structure and the condition of the device to better reflect the latest fault prediction and the device state;

according to the method, the model can be self-adjusted and optimized according to the latest operation data and time sequence characteristics of the transformer substation through the steps S81-S84, the self-evolution learning model of the transformer substation can be used for learning and predicting by effectively utilizing the latest time sequence coherent data, and the accuracy and efficiency of monitoring and fault prediction of transformer substation equipment are improved. By associating each data point with its corresponding timestamp, the model is able to more accurately track and analyze changes in device status over time, thereby more effectively predicting potential faults and guiding maintenance decisions.

。

According to the method, the continuous updating mechanism of the step S85 and the step S86 can adapt to the dynamic changes of the environment and the equipment performance of the transformer substation, real-time data can be processed and analyzed, the prediction mechanism of the transformer substation can be dynamically adjusted according to the latest data trend, the transformer substation can quickly respond to the changes of the equipment state, the maintenance decision is optimized, the running efficiency and the reliability of the whole power grid are finally improved, faults can be prevented better by continuously updating key features, the downtime is reduced, and therefore the running efficiency and the service life of the equipment are improved.

in this embodiment, S9 specifically includes:

；/>

；

wherein,is data point +.>Middle temperature->Pressure->Current->Record of operations->Historical maintenance data->External environmental information->And weather conditions->The amount of change at successive time points, +.>Is presetUpdating a threshold value;

s93, the update operation includes splitting and pruning of nodes to accommodate new data patterns, for example, if the amount of change in temperature exceeds a threshold, which may mean that the device temperature is abnormally rising, the decision tree needs to be adjusted to better predict potential failures.

Each update of the decision tree also considers the latest key features of the time series model, ensures that the decision tree structure and conditions reflect the latest running state of the device, and in the update process, the model follows a differential privacy technology to ensure the privacy of data processing and model update.

The self-evolution learning model of the transformer substation can adapt to data changes in real time, effectively predicts and deals with equipment faults, for example, a decision tree node updated according to current and temperature data can accurately predict the possibility of faults, and a decision tree node updated according to weather conditions and external environment information can accurately predict equipment fault risks caused by extreme weather.

Through continuous updating and optimizing of the model, the transformer substation can respond to the change of the state of the equipment in time, and equipment faults and unexpected shutdown are reduced.

in this embodiment, S10 specifically includes:

s102, calculating accuracy and data variationAnd (3) associating:

；

s103, the recall rate is related to updating of the decision tree model:

；

wherein each ofThe calculation is as follows:

；

The method ensures that the self-evolution learning model of the transformer substation can accurately reflect the latest equipment state and running conditions, and the transformer substation can timely discover and correct the defects in the model and keep the prediction capability and the actual running environment synchronous by periodically calculating and monitoring the key performance indexes.

in this embodiment, S11 specifically includes:

；

wherein,accuracy, recall and F1 score;

when the accuracy drops significantly, this indicates an overall performance degradation of the model in distinguishing between positive and negative classes (i.e., fault and non-fault conditions). In this case, the following needs to be reviewed:

data preprocessing and feature selection: checking the data collection and integration steps, ensuring that the data is accurate, the outliers are processed correctly, and the features are properly selected and standardized.

Training process of model: the model build process is reinitialized, including whether the training dataset needs to be adjusted, the model parameters changed, or a different algorithm tried.

A significant drop in F1 score indicates that the model has a problem in the balance of accuracy and recall. This typically involves the ability of the model to identify a minority class (e.g., a fault condition). In this case, a re-inspection is required:

constructing and optimizing a decision tree model: in particular, the splitting criteria and pruning strategy of the decision tree model, so as to ensure that the model not only can identify faults, but also can correctly distinguish non-fault conditions.

Unbalanced data processing: considering whether the distribution of the classes in the dataset is balanced, resampling is taken or class-specific weights are used as necessary to adjust the model's attention to the different classes.

The transformer substation can continuously optimize the prediction model, keep the consistency with the state of actual equipment, and the continuous performance evaluation and optimization strategy based on the data is beneficial to maintaining the long-term stable operation of the transformer substation.

In this embodiment, step 12 specifically includes:

s123, modifying the decision threshold for the model based on the collected feedback and performance dataOr introducing new fine adjustment of data characteristics, the specific adjustment formula is expressed as an adjusted performance index Coefficients associated with user feedback +.>Is the product of:

；

wherein,based on the quantitative score of the user feedback, representing the improvement or decrease of the model prediction accuracy or practicality, +.>Is an adjustment coefficient for controlling the degree of influence of user feedback on the model performance adjustment;

s124, the effect of model adjustment is evaluated by recalculating the accuracy, the recall rate and the F1 score, so that the transformer substation can flexibly adjust the model thereof to adapt to the continuously-changing running conditions and the user demands, and better performance and usability are achieved in practical application.

According to the method and the system, the prediction model of the transformer substation can be ensured to continuously adapt to the actual operation requirement through the model adjustment strategy based on the user feedback and the performance data, and a dynamic and interactive mode is provided for the transformer substation to optimize the prediction capability of the transformer substation.

Examples:

in summer 2023 (6 months to 8 months), a self-evolving substation equipment learning method is adopted to improve the efficiency and accuracy of equipment monitoring in a certain large-scale substation located in the eastern China. The main challenges faced by this substation are equipment aging and environmental factor changes (often facing weather conditions with large high temperature and humidity changes), which lead to an increase in equipment failure rate and an increase in prediction difficulty.

To address these issues, workers implement the following strategies:

the staff installs the key parameter such as high accuracy sensor real-time supervision temperature, pressure, electric current to data collection and integration have been carried out in three months:

month of month	Data volume	Data type
			6 months of	0.91TB	Temperature, pressure, current, operational records, historical maintenance data, environmental information, weather conditions
7 months of	1.63TB	Temperature, pressure, current, operational records, historical maintenance data, environmental information, weather conditions
			8 months of	0.92TB	Temperature, pressure, current, operational records, historical maintenance data, environmental information, weather conditions

Table 1 data collection and integration

As can be seen from table 1 above, over three months, approximately 2.46TB of data was collected, including equipment operational data, maintenance history, and weather-related environmental data. The time series data of temperature and current were analyzed using an autoregressive integrated moving average model, and some periodic wave patterns were found. Through the decision tree model, workers can predict potential equipment faults according to historical fault data and real-time parameters.

The staff is provided with an automatic data acquisition system, the latest operation data is collected every one hour, and the model is continuously updated through an online learning algorithm. Throughout the summer, the model underwent multiple iterations and optimizations to accommodate environmental changes and fluctuations in device status, while weekly evaluations were made of model accuracy, recall, and F1 score:

TABLE 2 Performance monitoring and optimization before and after use

Performance index	6 months old (reference)	7 middle ten days of month (after improvement)	8 months bottom (end)
				Accuracy rate (ACC)	85.2%	95.12%	98.34%
Recall (Recall)	88.33%	993.32%	95.89%
				F1 Score of	86.16%	94.65%	95.92%

As can be seen from the above Table 2, in the middle of 7 months, the accuracy rate is improved to 95.12%, the recall rate is 93.32%, and the F1 score reaches 94.65%. The accuracy is improved to 98.34% at the end of 8 months, the recall rate is 95.89%, the F1 score reaches 95.92%, and by comparing the performance indexes with the reference data at the beginning of 6 months, the performance of the model is obviously improved by workers.

Meanwhile, staff periodically collect feedback from substation operators, and particularly, the model prediction accuracy and the practicability are evaluated. At the end of 7 months, a series of valuable advice and observations were collected through a detailed user feedback meeting. Based on these feedback, the staff adjusts some parameters in the decision tree model and adds new data features such as grid load changes over a specific period of time.

In three months of implementation, the model successfully predicts the failure of one transformer on a high-temperature and high-humidity weekend (on the day of 7 months), and the potential failure of the cooling system of the primary transformer at the beginning of 8 months. This prediction enables the maintenance team to take action in advance, avoiding long downtime and potential loss. Compared with 6 months, the equipment failure rate of 8 months is reduced by about 71%, and the effect of model optimization is obviously demonstrated. This reduction not only means less maintenance costs, but also means a significant improvement in the stability and reliability of the overall operation. And meanwhile, according to the model after the user feedback adjustment, higher robustness is shown when data under extreme weather conditions are processed. In a thunderstorm weather of 8 months, the model successfully predicts and avoids multiple equipment failures caused by voltage fluctuations.

By implementing the self-evolving substation equipment learning method, the substation achieves remarkable results in high-efficiency fault prediction, equipment maintenance and environmental change coping. The integration of the user feedback enables the model to be more fit with actual operation requirements, and the practicality of the model and the satisfaction degree of the operation and maintenance team are improved. In general, the method has obvious advantages in the aspects of improving the prediction accuracy, reducing the failure rate, reducing the maintenance cost, improving the operation and maintenance efficiency and the like, and provides powerful technical support for the development of the intelligent power grid.

The foregoing is only a preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art, who is within the scope of the present invention, should make equivalent substitutions or modifications according to the technical scheme of the present invention and the inventive concept thereof, and should be covered by the scope of the present invention.

Claims

1. The self-evolving substation equipment learning method is characterized by comprising the following steps of:

s1, makingCollecting temperature from a substation with multiple sensorsPressure->Current->Record of operations->Historical maintenance data->External environmental information->And weather conditions->And establishes a substation data set +.>；

2. The self-evolving substation equipment learning method according to claim 1, wherein S2 specifically comprises:

s21, regarding transformer substation data setsIn relation to temperature->Pressure->Current->Record of operations->Historical maintenance data->External environmental information->And weather conditions->Data of->Calculate each data +.>Mean->And standard deviation->For each data point->Calculate Z-Score value:

；

S25 forIs +.>Calculate its minimum +.>Maximum->；

S26, applying Min-Max standardization to each data point ：

；

Wherein,is the data dimension +.>Normalized data points in (1), the normalized data set after processing is defined as +.>。

3. The self-evolving substation equipment learning method according to claim 2, wherein S4 specifically comprises:

s41, data setIs +.>Definition of +.>Pressure->Current->Record of operations->Historical maintenance data->External environmental information->And weather conditions->Privacy budget for each data type of (2)>And calculates the corresponding data type +.>Sensitivity reflecting maximum range of variation of substation operation data in differential privacy processing>；

S42, each data point：

；

Wherein,represents->Data points after desensitization in dimensions;

s43, finally generating a temperaturePressure->Current->Record of operations->Historical maintenance data->External environment informationAnd weather conditions->Desensitization dataset->。

4. A self-evolving substation equipment learning method according to claim 3, characterized in that S5 comprises in particular forAn autoregressive integrated moving average model for analyzing and predicting the operating state of the substation equipment is constructed, and trend, seasonal and random components are extracted from the time series data:

；

5. The self-evolving substation equipment learning method according to claim 4, wherein S6 specifically comprises:

；

s62, each internal node of the decision tree represents a test condition:

；

s63, each branch of the decision tree represents the result of the test condition and points to a further node or a final leaf node if 、And->If a certain item exceeds a threshold value, the path points to a fault class, otherwise, the path points to a normal class;

6. The self-evolving substation equipment learning method according to claim 5, wherein S8 specifically comprises:

；

。

7. The self-evolving substation equipment learning method according to claim 1, wherein S9 specifically comprises:

；

8. The self-evolving substation equipment learning method according to claim 7, wherein S10 specifically comprises:

S102, calculating accuracy and data variationAnd (3) associating:

；

s103, the recall rate is related to updating of the decision tree model:

；

wherein each ofThe calculation is as follows:

；

9. The self-evolving substation equipment learning method according to claim 8, wherein S11 specifically comprises:

；

Wherein,accuracy, recall and F1 score;

10. The self-evolving substation equipment learning method according to claim 9, wherein the step 12 specifically includes:

s123, based on the collected feedback and performance data, performing fine adjustment for modifying decision threshold or introducing new data features on the model, wherein a specific adjustment formula is expressed as an adjusted performance indexAssociated with user feedbackCoefficient of->Is the product of:

；