CN117371553A

CN117371553A - Machine learning model iteration method and system of intelligent operation and maintenance system

Info

Publication number: CN117371553A
Application number: CN202311328158.7A
Authority: CN
Inventors: 章海兵; 汪中原; 李林; 刁节武; 张艺博; 卞磊
Original assignee: Nanchang Keneng Urban Rail Technology Co ltd; Hefei Technological University Intelligent Robot Technology Co ltd; CSG Smart Science and Technology Co Ltd
Current assignee: Nanchang Keneng Urban Rail Technology Co ltd; Hefei Technological University Intelligent Robot Technology Co ltd; CSG Smart Science and Technology Co Ltd
Priority date: 2023-10-13
Filing date: 2023-10-13
Publication date: 2024-01-09

Abstract

The invention relates to a machine learning model iteration method and a system of an intelligent operation and maintenance system, which comprise two stages of offline modeling, online model reasoning and iterative updating, and comprise an offline machine learning system, an edge computing device set and the intelligent operation and maintenance system; the offline machine learning system comprises sample labeling, data management, model training, model management, model verification and model release; the edge computing device set comprises a robot, an environment monitoring sensor and a device monitoring sensor; the offline machine learning system realizes offline modeling, provides the trained model for the intelligent operation and maintenance system, and the intelligent operation and maintenance system acquires perception data of the edge equipment in real time, performs reasoning by using the model constructed offline and the model iterated autonomously online, and displays results in the intelligent display system. The invention combines short-term manual assistance and long-term autonomous model iteration and evaluation to realize a new machine model iteration method, which can greatly reduce the manual investment and continuously improve the accuracy of the system.

Description

Machine learning model iteration method and system of intelligent operation and maintenance system

Technical Field

The invention relates to the technical field of rail transit intelligent operation and maintenance, in particular to a machine learning model iteration method and system of an intelligent operation and maintenance system.

Background

With the rapid development of the rail transit industry, the operation and maintenance work of subway companies is increasingly heavy. In recent years, all subway companies surround the safety, reliability and maintainability of the lifting rail transit power supply equipment, promote the efficient cooperation of operation and maintenance management, realize cost reduction and synergy, and develop a series of construction and application. For example, intelligent equipment such as an intelligent inspection robot and on-line monitoring is deployed, an intelligent operation and maintenance system is built, and digital transformation and intelligent upgrading of operation and maintenance are promoted.

The intelligent operation and maintenance system can solve the following problems: and (1) reducing operation and maintenance to ensure the working strength. For example, the inspection robot can realize remote control and inspection at any time, inspection contents cover all contents of manual inspection, and the working intensity of operation and maintenance personnel is greatly reduced; and (2) improving the operation and maintenance work efficiency. By means of deployment of intelligent robots, online monitoring and the like, advanced technologies such as artificial intelligence, big data, the Internet of things and the like are utilized, equipment conditions are monitored in real time, and fault reasons are analyzed at the first time; and (3) ensuring the operation safety of the subway. The intelligent operation and maintenance system comprehensively covers key parameter indexes of equipment, utilizes advanced technologies such as artificial intelligence, knowledge maps, expert libraries and the like, discovers potential hazards possibly existing in the equipment in advance, and actively alarms and early warnings; and (4) improving the control force of the equipment assets. The system establishes a full life cycle file of the equipment, comprises basic data, inspection data, test data, maintenance data and the like, and can realize information maintenance, operation analysis and health management of the equipment. (5) the emergency treatment capacity is improved. A set of closed-loop mechanism of pre-judging in advance, processing in advance and re-coiling in advance is established through the power supply intelligent operation and maintenance system, so that the emergency processing capability is improved. Through on-line monitoring sensor and intelligent analysis function, place and reason that can the quick location trouble emergence, first time dispatch personnel go to on-the-spot processing.

From the above, the core of the intelligent operation and maintenance system is a machine learning model capable of realizing intelligent diagnosis and defect identification. In the rail transportation industry, the intelligent operation and maintenance system comprises intelligent operation and maintenance systems such as power supply, vehicles, rail running areas, stations and the like, and as the sensing layer comprises robots and various sensors, the intelligent operation and maintenance system relates to collection of 2D images, 3D point clouds, voiceprints, partial discharge, lightning arresters, currents, voltages, oil chromatograms, temperatures and the like and a plurality of corresponding machine learning models for diagnosis and identification. Further, continuous learning and iteration are performed by utilizing daily data, even if the defects of the identification pair are added into the model for continuous iterative learning, the model can be further helped to learn more defect feature distribution spaces, the reasoning accuracy of the model can be continuously improved, and the purpose of replacing people can be achieved. However, most of the current intelligent operation and maintenance systems still cannot further iterate effectively after integrating machine learning models and formally delivering the models even if the model reasoning accuracy is not ideal, and the main reasons are as follows: 1. no function of continuing model iteration; 2. the method has an autonomous iteration function, but because the actual defects are few, the initial accuracy of the model is low, and the autonomous identification type cannot be added into the model iterative learning, the method cannot be applied to an intelligent operation and maintenance system in the rail traffic industry.

Disclosure of Invention

The machine learning model iteration method and system of the intelligent operation and maintenance system provided by the invention can be applied to the intelligent operation and maintenance system facing rail traffic, effectively manage the full life cycle of the machine learning model, continuously improve the reasoning accuracy of the model and truly realize intelligent operation and maintenance.

In order to achieve the above purpose, the present invention adopts the following technical scheme:

the machine learning model iteration method of the intelligent operation and maintenance system comprises the steps of firstly carrying out offline modeling, then carrying out online model reasoning and iterative updating, wherein the offline machine learning system comprises the functions of sample labeling, data management, model training, model management, model verification, model release and the like, an edge computing equipment set comprises edge equipment such as a robot, an environment monitoring sensor and an equipment monitoring sensor, and the intelligent operation and maintenance system comprises an online machine learning system and an intelligent display system; the offline machine learning system realizes offline modeling, provides the trained model for the intelligent operation and maintenance system, and the intelligent operation and maintenance system acquires perception data of the edge equipment in real time, performs reasoning by using the model constructed offline and the model iterated online, and displays results in the intelligent display system.

Further, the off-line modeling stage includes the following steps;

step one, collecting data collected by the field robot and the sensor, and preprocessing the data, wherein the preprocessing comprises data cleaning, data format unification and the like. For the image and the point cloud data types, eliminating repeated, overexposed, underexposed, blurred, non-target areas and other data; for the categories of the digital data and the character data, unifying list formats, and eliminating abnormal data; for the audio data category, eliminating abnormal data, unifying specification, data compression and the like;

and step two, defining labels and corresponding categories, and manually marking the cleaned data. For the image and the point cloud category, rectangular frames or outline labels are adopted on the image and the point cloud, and text label information is generated; for the category of digital and character data, adding a column of tag information on the original format; and for the audio data types, classifying the audio data types into different catalogues or setting text label information, and reading the setting type labels through software. For the category with fewer defects, a data augmentation algorithm can be adopted for expansion;

and thirdly, training by using the prepared data and adopting different machine learning algorithms according to different data types and identification requirements thereof, constructing different machine learning models, and forming a machine learning model set. Constructing models such as target detection, defect detection and the like according to the image and point cloud data types; constructing models of regression, classification and the like of the digital and character data types; for the audio data category, constructing models such as anomaly detection, fault diagnosis and the like;

step four, verifying the accuracy rate of the machine learning model set by adopting a verification set, and if the accuracy rate is greater than S1 and the false detection rate is less than W1, issuing a model for online reasoning; otherwise, repeating the first to third steps until the accuracy meets the requirement.

Further, the online reasoning and model iteration stage includes the steps of,

as the defects of various scenes in the rail traffic industry are fewer, the model identification accuracy of the prior training is not high and more false detection and missing detection exist. The model iteration is divided into two stages, mainly comprising manual auxiliary labeling and model iteration, algorithm autonomous labeling and model iteration, and the main steps are as follows:

step one, deploying an offline trained machine learning model set R to an intelligent operation and maintenance system. Operating an intelligent operation and maintenance system, collecting data of a robot or a sensor, and automatically preprocessing the data according to priori knowledge;

and step two, inputting the processed various data into each corresponding machine learning model, and reasoning each model to obtain a corresponding result. The intelligent operation and maintenance system interface displays all results through viewing, listening and reasoning results, and different data types are different in display effect, including image and point cloud identification, data distribution and audio effect;

and step three, switching to a manual annotation page of the intelligent operation and maintenance system, and importing an reasoning result and corresponding data. Judging the image and the point cloud category data by visually checking the frame selected or marked targets on the image and the point cloud; for the digital and character class data, judging according to the prior knowledge of the data distribution of the abnormal class; and for the audio data, specifically judging according to the prior knowledge of the sound and the listening test condition. Manually checking each reasoning result and corresponding data, and executing the fourth step if the accuracy is smaller than S2; otherwise, turning to the step six;

step four, executing manual auxiliary labeling operation: 1. case of correct reasoning result: for the image and the point cloud category data, fine tuning is carried out on the autonomous identification rectangular frame and outline results with deviation larger than a threshold value, and whether the autonomous identification labeling results are reserved; for digital and character or audio class data, the inferred result is free from the situation such as image and point cloud target frame selection deviation, and the autonomously-identified labeling result is reserved; 2. for the case of missed detection: adding a labeling frame or outline of the target for the image and the point cloud class data; for character or audio class data, modifying the normal label into a label corresponding to the fault class; 3. in the case of false detection, deleting false mark frames or outlines for the image and the point cloud class data; for character or audio category data, the error category is modified to a normal label. The corrected data are merged into an iterative training set X;

step five, the training system calls the latest machine learning model set, incremental iterative training is carried out on the model by utilizing an iterative training set X of manual auxiliary labeling, various indexes such as Loss, accuracy, false detection rate and the like are monitored in the training process, training is stopped after the various indexes meet the set rules, the model is saved and updated, and the step two is returned;

and step six, entering an algorithm autonomous labeling and model autonomous iteration stage. And (3) processing the online collected data according to the first step, inputting the online collected data into a machine learning model set for reasoning, and outputting the result the same as the second step, wherein the step of automatically identifying and completing the labeling result and the original data by utilizing each model. The intelligent operation and maintenance system interface displays all results through visual, audible and reasoning results, and different data types have different display effects;

and step seven, the training system calls the latest machine learning model set, and incremental iterative training is carried out on each model by utilizing the data autonomously marked by the algorithm. Monitoring various indexes of the training process, particularly when the accuracy rate on the verification set is not lower than the accuracy rate before iteration and the false detection rate is not higher than the accuracy rate before iteration, considering the current autonomous iteration behavior as qualified, and updating by adopting a machine learning model after iteration; otherwise, still adopting a machine learning model before iteration;

and step eight, continuing the step six to the step seven until the life cycle of the intelligent operation and maintenance system is ended.

In yet another aspect, the invention also discloses a computer readable storage medium storing a computer program which, when executed by a processor, causes the processor to perform the steps of the method as described above.

In yet another aspect, the invention also discloses a computer device comprising a memory and a processor, the memory storing a computer program which, when executed by the processor, causes the processor to perform the steps of the method as above.

According to the technical scheme, the iterative method and the iterative system of the machine learning model of the intelligent operation and maintenance system aim to overcome the influence caused by dynamic change of distribution in a time sequence, so that a model with time sequence invariance is learned by using time sequence emission data of a mobile source, and the tail gas concentration prediction under the dynamic change of the time sequence data distribution is realized.

Compared with the prior art, the invention has the beneficial effects that:

the existing intelligent operation and maintenance systems are generally divided into two types:

1. the technical scheme provides an autonomous iteration scheme and system without the iteration function of a machine learning model, and self-learning capability is injected into an intelligent operation and maintenance system.

2. The method combines short-term manual assistance and long-term autonomous model iteration and evaluation to realize a new machine model iteration method according to the characteristics of the rail traffic industry, so that the labor investment can be greatly reduced, and the accuracy of a system can be continuously improved.

Drawings

FIG. 1 is a flow chart of the main steps of the present invention;

FIG. 2 is a flowchart showing the steps in detail in accordance with the present invention;

FIG. 3 is a schematic diagram of a system framework of the present invention.

Detailed Description

For the purpose of making the objects, technical solutions and advantages of the embodiments of the present invention more apparent, the technical solutions of the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention, and it is apparent that the described embodiments are some embodiments of the present invention, but not all embodiments of the present invention.

As shown in fig. 1 and fig. 2, the machine learning model iteration method of the intelligent operation and maintenance system according to the present embodiment is divided into two phases, wherein the first phase is offline modeling, and the second phase is online model reasoning and iterative updating, as shown in fig. 2, and the specific steps are as follows:

the machine learning model iteration method of the intelligent operation and maintenance system described in this embodiment is implemented under the system framework shown in fig. 3: the system comprises an offline machine learning system (comprising functions of sample labeling, data management, model training, model management, model verification, model release and the like), an edge computing device set (comprising edge devices such as robots, environment monitoring sensors, device monitoring sensors and the like), and an intelligent operation and maintenance system (comprising an online machine learning system and an intelligent display system). The offline machine learning system realizes offline modeling, the trained model is provided for the intelligent operation and maintenance system, the intelligent operation and maintenance system collects perception data of the edge equipment in real time, the offline constructed model and the online iterative model are utilized for reasoning, the display result is displayed in the intelligent display system, and the specific implementation steps are divided into two parts:

a first part: offline modeling stage

Step one, collecting data collected by a field robot (such as a train intelligent inspection robot, a power distribution room station inspection robot, a track area inspection robot and the like), and a sensor (such as a main transformer on-line monitoring sensor, a GIS on-line monitoring sensor, a high-voltage cable on-line monitoring sensor, a train track side comprehensive detection system sensor and the like), preprocessing the data, including data cleaning, data format unification and the like. For the image and the point cloud data types, eliminating repeated, overexposed, underexposed, blurred, non-target areas and other data; for the categories of the digital data and the character data, unifying list formats, and eliminating abnormal data; for the audio data category, eliminating abnormal data, unifying specification, data compression and the like;

and step two, defining labels and corresponding categories, and manually marking the cleaned data (marking modes can adopt open source and self-grinding marking software). For the image and the point cloud data category, rectangular frames or outline labels are adopted to generate text label information; for the character data category, adding a column of label information on the original format; and for the audio data types, classifying the audio data types into different catalogues or setting text label information, and reading the setting type labels through software. For the category with fewer defects, a data augmentation algorithm can be adopted for expansion;

and thirdly, training by using the prepared 70% data and adopting different machine learning algorithms according to different data types and identification requirements thereof, and constructing different machine learning models to form a machine learning model set.

1) And constructing models such as target detection, defect detection and the like according to the image and the point cloud data types. In the example, a deep learning algorithm based on example segmentation is adopted for training a bolt loosening detection model for a train part image acquired by a robot, and a loss function is shown in the following formula:

wherein,represents->And->The euclidean distance between the two,Yfor a tag that is a match between two samples,Nfor the number of samples, m is the distance threshold, +.>Is a parameter matrix.

2) And constructing models such as regression, classification and the like of the character data types. In the embodiment, a classification model is obtained by training a classification algorithm based on deep learning. In the process of data generation, standard Gaussian distribution is applied to generate abnormal gradient values, and the gradient values are added to the calculated gradient of the model to improve the single classification distinguishing strength of the model.

The calculation mode of the abnormal gradient value generated by Gaussian distribution is shown as follows:

where x is the raw data, σ is the standard deviation (here taken as 1), and μ is the mean (here taken as 0).

The algorithm adds the learning process of the abnormal gradient value in the training, and can be expressed as the following formula:

in the method, in the process of the invention,θas a parameter of the model, it is possible to provide,xas the original value of the value,hto generate a gradient of the light beam,las a function of the loss,ηis a parameter (value between 0 and 1) that controls the range of generated data.

3) And constructing models such as anomaly detection, fault diagnosis and the like for the audio data types. In this example, the fault diagnosis algorithm performs feature learning by using a deep CNN network, and the loss function is represented by the following formula:

wherein,Nas a total number of samples,d ² for the euclidean distance between pairs of samples,ybelonging to the group of {0,1},y=0 means that they do not belong to the same class,y=1 means that the sample pairs belong to the same class; thus, the larger the distance between the different classes, the smaller the loss; the smaller the distance between the same class, the smaller the loss. This serves to reduce the inter-class spacing while expanding the inter-class spacing to at leastmarginIs a target of (a).

Step four, adopting 30% of data as a verification set to verify the accuracy of the machine learning model set, and if the accuracy is more than 90% and the false detection rate is less than 5%, issuing a model for online reasoning; otherwise, repeating the first to third steps until the accuracy meets the requirement.

A second part: on-line reasoning and model iteration stage

step one, deploying an offline trained machine learning model set R to an online machine learning system in an intelligent operation and maintenance system. The online machine learning system may be invoked by the intelligent presentation system for a software library or communicatively interact with the intelligent presentation system for a binary. The intelligent operation and maintenance system is operated, data such as robots and sensors are collected, automatic data preprocessing is carried out according to priori knowledge, and the data normalization, the automatic abnormal data elimination and the like are included;

and step two, inputting the processed various data into each corresponding machine learning model, and reasoning each model to obtain a corresponding result. The intelligent operation and maintenance system interface displays all results through viewing, listening and reasoning results, and different data types and different display effects are achieved, including image and point cloud identification, data distribution and audio effects.

In the example, the bolt looseness of the train part collected by the robot is identified, and the method comprises two steps of registration and model reasoning.

In the registration stage, the feature matching relationship between every 2 feature points consists of 2 parts, and the similarity score and the matchability score are as follows:

similarity score:

，/>

for the pixel of image A, +.>Is the pixel point of the template image B, +.>Is the similarity score of the point of the image A to be identified and the pixel point of the image B.

The matable score:

is pixel point +.>The matching score corresponding to the pixel point. Combining the similar score with the matchable score to obtain a local matching matrix as

For the similarity score of pixel k in A and pixel j in B, +.>For the similarity score of pixel k in B and pixel i in A, +.>Matching score corresponding to i pixel points of A +.>Matching score corresponding to j pixels of B>The matching value of the pixel point k in the A and the pixel point j in the B is obtained. When->Greater than the threshold t and greater than the other values of the rows and columns, i.e., the similarity of two points is higher than the other points in the two images, then the two points are in matching relationship. After the matching relation of all clicks is determined, the transformation relation of two images can be determined, and then the images to be identified are converted into template images to obtain an image Ix.

In the model reasoning stage, ix is input into an offline trained model for reasoning, and a reasoning result is obtained.

And step three, switching to a manual annotation page of the intelligent operation and maintenance system, and importing an reasoning result and corresponding data. Judging the image and the point cloud category data by visually checking the frame selected or marked targets on the image and the point cloud; judging the character class data according to the prior knowledge of the data distribution of the abnormal class; and for the audio data, specifically judging according to the prior knowledge of the sound and the listening test condition. Manually checking each reasoning result and corresponding data, and executing the fourth step if the accuracy is less than 95%; otherwise, turning to the step six;

step four, executing manual auxiliary labeling operation: 1. case of correct reasoning result: for the image and the point cloud category data, fine tuning is carried out on the autonomous identification rectangular frame and outline results with deviation larger than a threshold value, and whether the autonomous identification labeling results are reserved; for character or audio class data, the inferred result is free from the situation such as image and point cloud target frame selection deviation, and the autonomously-identified labeling result is reserved; 2. for the case of missed detection: adding a labeling frame or outline of the target for the image and the point cloud class data; for character or audio class data, modifying the normal label into a label corresponding to the fault class; 3. in the case of false detection, deleting false mark frames or outlines for the image and the point cloud class data; for character or audio category data, the error category is modified to a normal label. The corrected data are merged into an iterative training set X;

and step eight, continuing the step six to the step seven until the life cycle of the intelligent operation and maintenance system is finished, wherein the reasoning accuracy of the model can approach or even reach 100%.

In summary, the technical scheme provides an autonomous iteration scheme and system, and self-learning capability is injected into an intelligent operation and maintenance system; according to the rail transit industry characteristics, the novel machine model iteration method is realized by combining short-term manual assistance and long-term autonomous model iteration and evaluation, so that the manual investment can be greatly reduced, and the accuracy of a system can be continuously improved.

In yet another embodiment provided herein, there is also provided a computer program product containing instructions that, when run on a computer, cause the computer to perform the machine learning model iteration method of any one of the intelligent operation and maintenance systems of the above embodiments.

It may be understood that the system provided by the embodiment of the present invention corresponds to the method provided by the embodiment of the present invention, and explanation, examples and beneficial effects of the related content may refer to corresponding parts in the above method.

The embodiment of the application also provides an electronic device, which comprises a processor, a communication interface, a memory and a communication bus, wherein the processor, the communication interface and the memory are communicated with each other through the communication bus,

a memory for storing a computer program;

and the processor is used for realizing the machine learning model iteration method of the intelligent operation and maintenance system when executing the program stored in the memory.

The communication bus mentioned by the above electronic device may be a peripheral component interconnect standard (english: peripheral Component Interconnect, abbreviated: PCI) bus or an extended industry standard architecture (english: extended Industry Standard Architecture, abbreviated: EISA) bus, or the like. The communication bus may be classified as an address bus, a data bus, a control bus, or the like.

The communication interface is used for communication between the electronic device and other devices.

The Memory may include random access Memory (Random Access Memory, abbreviated as RAM) or nonvolatile Memory (NVM), such as at least one disk Memory. Optionally, the memory may also be at least one memory device located remotely from the aforementioned processor.

The processor may be a general-purpose processor, including a central processing unit (Central Processing Unit, CPU for short), a network processor (Network Processor, NP for short), etc.; it may also be a digital signal processor (English: digital Signal Processing; DSP; for short), an application specific integrated circuit (English: application Specific Integrated Circuit; ASIC; for short), a Field programmable gate array (English: field-Programmable Gate Array; FPGA; for short), or other programmable logic device, discrete gate or transistor logic device, discrete hardware components.

In the above embodiments, it may be implemented in whole or in part by software, hardware, firmware, or any combination thereof. When implemented in software, may be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When loaded and executed on a computer, produces a flow or function in accordance with embodiments of the present application, in whole or in part. The computer may be a general purpose computer, a special purpose computer, a computer network, or other programmable apparatus. The computer instructions may be stored in or transmitted from one computer-readable storage medium to another, for example, by wired (e.g., coaxial cable, optical fiber, digital Subscriber Line (DSL)), or wireless (e.g., infrared, wireless, microwave, etc.). The computer readable storage medium may be any available medium that can be accessed by a computer or a data storage device such as a server, data center, etc. that contains an integration of one or more available media. The usable medium may be a magnetic medium (e.g., floppy Disk, hard Disk, magnetic tape), an optical medium (e.g., DVD), or a semiconductor medium (e.g., solid State Disk (SSD)), etc.

It is noted that relational terms such as first and second, and the like are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Moreover, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.

In this specification, each embodiment is described in a related manner, and identical and similar parts of each embodiment are all referred to each other, and each embodiment mainly describes differences from other embodiments. In particular, for system embodiments, since they are substantially similar to method embodiments, the description is relatively simple, as relevant to see a section of the description of method embodiments.

The above embodiments are only for illustrating the technical solution of the present invention, and are not limiting; although the invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical scheme described in the foregoing embodiments can be modified or some technical features thereof can be replaced by equivalents; such modifications and substitutions do not depart from the spirit and scope of the technical solutions of the embodiments of the present invention.

Claims

1. The machine learning model iteration method of the intelligent operation and maintenance system comprises the steps of firstly carrying out offline modeling, and then carrying out online reasoning and iterative updating of the model, and is characterized by being based on an offline machine learning system, an edge computing device set and the intelligent operation and maintenance system;

the offline machine learning system comprises sample labeling, data management, model training, model management, model verification and model release functions;

the edge computing device set comprises a robot, an environment monitoring sensor and a device monitoring sensor;

the intelligent operation and maintenance system comprises an online machine learning system and an intelligent display system;

the offline machine learning system realizes offline modeling, provides the trained model for the intelligent operation and maintenance system, and the intelligent operation and maintenance system acquires perception data of the edge equipment in real time, performs reasoning by using the model constructed offline and the model iterated autonomously online, and displays results in the intelligent display system.

2. The iterative method of a machine learning model of an intelligent operation and maintenance system of claim 1, wherein: the off-line modeling phase includes the following steps,

s11, collecting data acquired by the field robot and the sensor, and preprocessing the data;

s12, defining labels and corresponding categories, and manually marking the cleaned data;

s13, training by using the prepared data and adopting different machine learning algorithms according to different data types and identification requirements thereof, constructing different machine learning models, and forming a machine learning model set;

s14, verifying the accuracy of the machine learning model set by adopting a verification set, and if the accuracy and the false detection rate meet the set requirements, issuing a model for online reasoning; otherwise, repeating steps S11 to S13 until the accuracy meets the requirement.

3. The iterative method of a machine learning model of an intelligent operation and maintenance system of claim 1, wherein: the model online reasoning and iterative update phase includes,

s21, deploying the offline trained machine learning model set R to an intelligent operation and maintenance system. Operating an intelligent operation and maintenance system, collecting data of a robot or a sensor, and automatically preprocessing the data according to priori knowledge;

s22, inputting the processed various data into each corresponding machine learning model, and reasoning each model to obtain a corresponding result; the intelligent operation and maintenance system interface displays all results through viewing, listening and reasoning results, and different data types are different in display effect, including image and point cloud identification, data distribution and audio effect;

s23, switching to a manual annotation page of the intelligent operation and maintenance system, and importing an reasoning result and corresponding data; judging the image and the point cloud category data by visually checking the frame selected or marked targets on the image and the point cloud; judging the character class data according to the prior knowledge of the data distribution of the abnormal class; for the audio data, specific judgment is carried out according to the prior knowledge of sound and the listening test condition;

manually checking each reasoning result and corresponding data, and executing S24 if the accuracy is less than 95%; otherwise go to S26;

s24, performing manual auxiliary labeling operation, and merging the corrected data into an iterative training set X;

s25, the training system calls the latest machine learning model set, incremental iterative training is carried out on the model by using the iterative training set X with manual auxiliary labeling, various indexes of Loss, accuracy and false detection rate are monitored in the training process, training is terminated after various indexes meet the set rules, the model is saved and updated, and the step S22 is returned;

s26, entering an algorithm autonomous labeling and model autonomous iteration stage; processing the online collected data according to the step S21, inputting the online collected data into a machine learning model set for reasoning, and outputting the result the same as the step S22, wherein the step comprises the steps of autonomously identifying and completing the labeling result and the original data by utilizing each model; the intelligent operation and maintenance system interface displays all results through visual, audible and reasoning results, and different data types have different display effects;

s27, the training system calls the latest machine learning model set, and incremental iterative training is carried out on each model by utilizing the data autonomously marked by the algorithm; monitoring various indexes of the training process, particularly when the accuracy rate on the verification set is not lower than the accuracy rate before iteration and the false detection rate is not higher than the accuracy rate before iteration, considering the current autonomous iteration behavior as qualified, and updating by adopting a machine learning model after iteration; otherwise, still adopting a machine learning model before iteration;

and S28, continuing the steps S26 to S27 until the life cycle of the intelligent operation and maintenance system is finished, wherein the reasoning accuracy of the model can approach or even reach 100%.

4. A machine learning model iteration method of an intelligent operation and maintenance system according to claim 3, wherein: the manual auxiliary labeling operation in step S24 specifically includes:

s241, the case that the reasoning result is correct: for the image and the point cloud category data, fine tuning is carried out on the autonomous identification rectangular frame and outline results with deviation larger than a threshold value, and whether the autonomous identification labeling results are reserved; for character or audio class data, the inferred result is free from the situation such as image and point cloud target frame selection deviation, and the autonomously-identified labeling result is reserved;

s242, for the case of missing detection: adding a labeling frame or outline of the target for the image and the point cloud class data; for character or audio class data, modifying the normal label into a label corresponding to the fault class;

s243, in the case of false detection, deleting false mark frames or outlines of the images and the point cloud class data; for character or audio category data, the error category is modified to a normal label.

5. The iterative method of a machine learning model of an intelligent operation and maintenance system of claim 1, wherein: the on-site robot comprises a train intelligent inspection robot, a power distribution station inspection robot and a track area inspection robot;

the sensor comprises a main transformer on-line monitoring sensor, a GIS on-line monitoring sensor, a high-voltage cable on-line monitoring sensor and a train trackside comprehensive detection system sensor.

6. The iterative method of machine learning models of intelligent operation and maintenance system of claim 2, wherein: step S13 comprises training a bolt loosening detection model by adopting a deep learning algorithm based on example segmentation for the train part image acquired by the robot, wherein the loss function is shown in the following formula:

7. The iterative method of a machine learning model of an intelligent operation and maintenance system of claim 1, wherein: step S13 also includes

For a classification model, training by adopting a classification algorithm based on deep learning, generating abnormal gradient values by applying standard Gaussian distribution in the process of data generation, and adding the gradient values into a calculation gradient of the model to improve single classification distinguishing strength of the model;

x in the formula is original data, sigma is standard deviation 1, mu is mean value 0;

the algorithm adds the learning process of the abnormal gradient value in the training, which is expressed as the following formula:

in the method, in the process of the invention,θas a parameter of the model, it is possible to provide,xas the original value of the value,hto generate a gradient of the light beam,las a function of the loss,ηis a control generation data range the parameter of (2) takes a value between 0 and 1.

8. The iterative method of machine learning models of intelligent operation and maintenance system of claim 2, wherein: the step S13 also comprises the steps of constructing a fault diagnosis model for the audio data category, wherein the fault diagnosis algorithm adopts a deep CNN network to perform feature learning, and the loss function is shown in the following formula:

wherein,Nas a total number of samples,d ² for the euclidean distance between pairs of samples,ybelonging to the group of {0,1},y=0 means that they do not belong to the same class,y=1 means that the sample pairs belong to the same class; thus, the larger the distance between the different classes, the smaller the loss; the smaller the distance between the same class, the smaller the loss.

9. The iterative method of a machine learning model of an intelligent operation and maintenance system of claim 1, wherein: identifying bolt looseness of a train component collected by a robot, wherein the method comprises two steps of registration and model reasoning;

similarity score:

，/>

for the pixel of image A, +.>Is the pixel point of the template image B, +.>A similarity score of the point of the image A to be identified and the pixel point of the image B;

the matable score:

is pixel point +.>Matching scores corresponding to the pixel points; combining the similar score with the matchable score to obtain a local matching matrix as

For the similarity score of pixel k in A and pixel j in B, +.>For the similarity score of pixel k in B and pixel i in A, +.>Matching score corresponding to i pixel points of A +.>Matching score corresponding to j pixel points of B, +.>The matching value of the pixel point k in the A and the pixel point j in the B is obtained; when->Greater than the threshold t and greater than other values of the rows and columns, i.e., the similarity of two points is higher than other points in the two images, then the two points are in a matching relationship; after the matching relation of all clicks is determined, the transformation relation of two images can be determined, and then the images to be identified are converted into template images to obtain an image Ix;

10. A computer system comprising a memory and a processor, the memory storing a computer program that, when executed by the processor, causes the processor to perform the steps of the method of any of claims 1 to 9.