CN115510302A - Intelligent factory data classification method based on big data statistics - Google Patents
Intelligent factory data classification method based on big data statistics Download PDFInfo
- Publication number
- CN115510302A CN115510302A CN202211432355.9A CN202211432355A CN115510302A CN 115510302 A CN115510302 A CN 115510302A CN 202211432355 A CN202211432355 A CN 202211432355A CN 115510302 A CN115510302 A CN 115510302A
- Authority
- CN
- China
- Prior art keywords
- data
- intelligent
- obtaining
- time
- intelligent factory
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 33
- 230000002159 abnormal effect Effects 0.000 claims description 98
- 238000012545 processing Methods 0.000 abstract description 3
- 230000005856 abnormality Effects 0.000 description 22
- 238000012544 monitoring process Methods 0.000 description 11
- 238000004364 calculation method Methods 0.000 description 8
- 230000008859 change Effects 0.000 description 7
- 238000004458 analytical method Methods 0.000 description 5
- 238000007405 data analysis Methods 0.000 description 3
- 238000013523 data management Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000007726 management method Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000003064 k means clustering Methods 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/906—Clustering; Classification
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02P—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
- Y02P90/00—Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
- Y02P90/30—Computing systems specially adapted for manufacturing
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Testing And Monitoring For Control Systems (AREA)
Abstract
The invention relates to the field of data processing, in particular to an intelligent factory data classification method based on big data statistics.
Description
Technical Field
The application relates to the field of data processing, in particular to an intelligent factory data classification method based on big data statistics.
Background
With the continuous development of intelligent technology, intelligent monitoring and intelligent management are vigorously developed for various industries, for example, in various large-scale plants, digital intelligent monitoring is realized for the operation monitoring of the plants, that is, the operation abnormality of the plants is reflected by the abnormality of monitoring data. However, due to the fact that the operation data of the plant is increasing due to long-term operation of the plant, a large amount of data analysis is needed in the data analysis of the operation monitoring, and therefore in order to facilitate quick acquisition of abnormal data in the plant operation monitoring, data classification needs to be performed according to the abnormality of the original data, namely, the abnormal data and the normal data are classified and stored, so that the data are needed to be analyzed in an abnormal manner.
In the abnormal analysis of the data, the difference between the data and the distribution density of the data are mainly utilized, for example, in the existing clustering algorithm, but the clustering only aims at the size difference of the data, and the abnormality of the data with the change trend cannot be reflected well, so that the abnormality of the plant operation data cannot be judged accurately. The abnormal degree of the data is determined by respectively determining the overall data distribution relation and the difference of the data on the time sequence, wherein the overall abnormal score of the data is analyzed by using a CBLOF algorithm, but the conventional CBLOF algorithm clustering excessively depends on the distinguishing of the size clusters, the characteristic of the clustering is neglected, the abnormal score of the data is single and the reliability is not high, so the final abnormal score is determined by combining the size of the clustering and the time span of the data in the clustering.
Disclosure of Invention
In order to solve the technical problem, the invention provides an intelligent factory data classification method based on big data statistics, which comprises the following steps:
acquiring an intelligent factory data sequence formed by intelligent factory data, and obtaining a plurality of clusters according to the intelligent factory data sequence;
obtaining the time span of the cluster to which each intelligent factory data belongs according to each cluster, and obtaining the abnormal score of each intelligent factory data according to a plurality of clusters, the time span and the number of data contained in the clusters; obtaining a plurality of time windows according to the intelligent factory data sequence; obtaining the difference of each smart factory data relative to the time window according to the difference of the adjacent data in the time window and the abnormal score; obtaining a first abnormal degree of each smart factory data according to the difference of each smart factory data relative to each time window; obtaining a second abnormal degree of each intelligent factory data according to the first abnormal degree and the abnormal score of the intelligent factory data;
and obtaining an abnormal data set and a normal data set according to the intelligent factory data sequence and the second abnormal degree of each intelligent factory data, and performing distributed storage on the abnormal data set and the normal data set.
Preferably, the method for obtaining the abnormal score of each smart plant data according to the plurality of clusters, the time span and the number of data included in the cluster includes:
acquiring the number of data contained in the cluster to which the intelligent factory data belongs, and recording the number of the data as the first number of the clusters to which the intelligent factory data belongs; acquiring the number of data contained in each cluster and recording the number as a second number, and acquiring the maximum value of the second number of all clusters and recording the maximum number; acquiring the distance between the data of each intelligent factory in each cluster and the center of the cluster to which the data belongs, and recording the distance as a first distance;
and obtaining the abnormal score of each intelligent factory data according to the time span, the first number, the maximum number and the first distance of the cluster to which each intelligent factory data belongs.
Preferably, the formula for obtaining the abnormal score of each smart factory data according to the time span, the first number, the maximum number and the first distance of the cluster to which each smart factory data belongs is as follows:
wherein,the maximum number is represented by the number of the cells,denotes the firstA first number of clusters to which the individual smart factory data belongs,is shown asThe time span of the cluster to which the individual smart factory data belongs,is shown asA first distance of the intelligent factory data,is shown asAbnormal scores of intelligent plant data.
Preferably, the method for obtaining the difference of each smart plant data relative to the time window according to the difference of the neighboring data in the time window and the abnormal score includes:
the method comprises the steps of obtaining a plurality of time windows of each intelligent factory data, calculating a time difference value between each intelligent factory data and each data in each time window, obtaining a plurality of adjacent data of each intelligent factory data in each time window, obtaining a standard deviation of each intelligent factory data in each time window according to the time difference value, and obtaining a difference of each intelligent factory data relative to each time window according to the adjacent data of each time window, an abnormal score of each adjacent data and the standard deviation of each time window of each intelligent factory data, namely the difference of each intelligent factory data relative to each time window.
Preferably, the formula for obtaining the difference of each smart plant data relative to each belonging time window according to the respective neighboring data of each belonging time window of each smart plant data, the anomaly score of each neighboring data and the standard deviation of each belonging time window of each smart plant data is as follows:
wherein,is shown asThe data of the intelligent factory is stored in the database,is shown asThe first of the smart factory dataTo the first of the time windowsThe number of the adjacent data is one,is shown asThe first of the smart factory dataTo the first of the time windowsA normalized value of the anomaly score for each of the neighboring data,is shown asThe first of the smart factory dataThe number of adjacent data is contained in each time window,is shown asThe first of the intelligent factory dataThe standard deviation of the individual time windows to which it belongs,denotes the firstThe smart factory data relative toThe variability of the individual time windows.
Preferably, the method for obtaining the first abnormal degree of each smart factory data according to the difference of each smart factory data relative to each time window includes:
the method comprises the steps of obtaining a plurality of time affiliated time windows of each smart factory data, obtaining a standard deviation of each affiliated time window of each smart factory data by using data in each affiliated time window of each smart factory data, and obtaining a first abnormal degree of each smart factory data according to the difference of each smart factory data relative to each time window and the standard deviation of each affiliated time window of each smart factory data.
Preferably, the formula for obtaining the first abnormal degree of each smart plant data according to the difference of each smart plant data relative to each time window and the standard deviation of each smart plant data in each time window is as follows:
wherein,is shown asThe intelligent factory data relative to the firstThe difference of the time windows to which it belongs,is shown asThe number of the time windows of the intelligent factory data,is shown asStandard deviation of the variability of the individual smart factory data over all time windows,denotes the firstA first degree of anomaly of the intelligent plant data.
Preferably, the method for obtaining the time span of the cluster to which each smart plant data belongs according to each cluster includes:
and forming a data pair by any two intelligent factory data in the cluster to which the intelligent factory data belong, calculating the time difference of the two intelligent factory data in each data pair, and obtaining the time span of the cluster to which each intelligent factory data belongs according to the time difference of all the data pairs in the cluster to which the intelligent factory data belong.
The embodiment of the invention at least has the following beneficial effects: firstly, reflecting the possibility of the cluster itself having abnormality according to the size of the cluster, and highlighting the influence of the cluster size on the data abnormality; and then, judging the influence relationship among the data of the same cluster according to the time sequence span of the data contained in the cluster, namely, considering the influence of the data time sequence relationship on data abnormity judgment, and performing more accurate data abnormity judgment.
Then, the relative difference of the data is determined according to the difference of the time sequence data and the relative relation of the window data in the calculation window on the time sequence, the data difference abnormity caused by the larger difference between the trend change data is avoided, the influence of the abnormal score of other data on the window calculation is considered in the window calculation, and the influence of abnormal data in the window on the abnormal judgment of other data is avoided.
Moreover, in the abnormal degree of the data time sequence, the influence of the data abnormal score on the judgment of the data time sequence abnormality is introduced, the common influence of the data abnormal score and the judgment of the data time sequence abnormality is strengthened, the final abnormal degree of the data is obtained and is used as the basis for data abnormality classification, namely, the data is classified more accurately.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions and advantages of the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and other drawings can be obtained by those skilled in the art without creative efforts.
FIG. 1 is a flowchart of an intelligent plant data classification method based on big data statistics according to the present invention.
Detailed Description
To further illustrate the technical means and effects of the present invention for achieving the predetermined objects, the following detailed description, structures, features and effects of the method for classifying intelligent plant data based on big data statistics according to the present invention are provided with the accompanying drawings and preferred embodiments. In the following description, different "one embodiment" or "another embodiment" refers to not necessarily the same embodiment. Furthermore, the particular features, structures, or characteristics may be combined in any suitable manner in one or more embodiments.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.
The following describes a specific scheme of the intelligent factory data classification method based on big data statistics in detail with reference to the accompanying drawings.
Referring to fig. 1, a flowchart illustrating steps of a big data statistics-based intelligent plant data classification method according to an embodiment of the present invention is shown, where the method includes the following steps:
and S001, acquiring data to obtain an intelligent factory data sequence, and obtaining a plurality of clusters according to the intelligent factory data sequence.
1. Collecting data:
in order to control the running state of the factory in time in the intelligent factory, real-time running monitoring needs to be carried out, and then in order to facilitate data analysis, the generated data needs to be transmitted to a unified data management platform for analysis management. The data that this scheme was gathered are the operation monitoring data of wisdom mill for the data that the unified data management platform of mill related.
Arranging the collected intelligent factory operation monitoring in time sequence to obtain an intelligent factory data sequence, and calling each data in the intelligent factory data sequence as intelligent factory data, such as vibration data of equipment engine, temperature data of equipment, etc
2. Obtaining a plurality of clusters according to the intelligent factory data sequence:
clustering each data in the intelligent factory data sequence by using a K-Means clustering algorithm to obtainCluster, in the schemeAnd taking 10, and taking the average value of all data in each cluster as the central data of each cluster.
And step S002, obtaining the abnormal score of each intelligent factory data according to the clusters.
The intelligent factory operation data is mainly used for monitoring and analyzing the intelligent factory operation abnormity, the intelligent factory operation abnormity is mainly reflected in data abnormity, and abnormal data needs to be frequently called in the factory abnormity monitoring analysis, so that the abnormal data is called for convenience in abnormity analysis.
First, the amount of the intelligent factory data contained in all clusters is obtainedAnd K represents the number of clusters, then the current cluster number is arranged from small to large, the first W clusters in the cluster sequence are selected as small clusters, W =3 is set in the invention, and the rest clusters are large clusters.
The obtained small clusters are respectively represented asThe large clusters are respectively represented asWhere n represents the number of small clusters,indicating the number of large clusters.
Will find the smart factory data in the sequenceForming a data pair by any two intelligent factory data in a cluster to which the intelligent factory data belongs, calculating the time difference of the two intelligent factory data in each data pair, and calculating the time difference of the two intelligent factory data in the data pairThe maximum value of the time difference of all data pairs in the cluster to which the intelligent factory data belongs is taken as the firstTime span of clustering to which intelligent factory data belongs。
Get the firstThe number of data contained in the cluster to which the smart factory data belongs is recorded asFirst number of clusters to which smart factory data belongs(ii) a Acquiring the number of data contained in each cluster and recording the number as a second number, and acquiring the maximum value of the second numbers of all clusters and recording the maximum number(ii) a When it comes toWhen the cluster of the intelligent factory data is a big cluster, the first cluster isThe Euclidean distance between the data of the intelligent factory and the data of the cluster center is recorded as the firstFirst distance of smart factory data(ii) a When it comes toWhen the cluster to which the data of the smart factory belongs is a small cluster, the first cluster is obtainedThe Euclidean distance between the data of the intelligent factory and the data of each large cluster center is recorded as the firstFirst distance of smart factory data。
According to the firstTime span of cluster to which smart factory data belongsFirst number ofMaximum number of cellsAnd a first step ofFirst distance of intelligent factory dataTo obtain the firstNumber of intelligent plantsAccording to the abnormal score:
wherein,the maximum number is indicated by the number of bits,denotes the firstA first number of clusters to which the smart factory data belongs,is shown asThe relative size of the number of the cluster data to which the smart factory data belongs indicates the number of the cluster dataThe smaller the number of the cluster data to which the intelligent factory data belongs, the higher the possibility of reflecting the abnormality of the cluster itself to which the data belongs, soThe larger the anomaly score of the individual intelligent plant data,denotes the firstThe larger the value of the time span of the cluster to which the intelligent factory data belongs, the larger the time sequence span of the data in the cluster to which the data belongs, the smaller the influence relationship among the data, and the larger the possibility of the data in the cluster to which the data belongs to have abnormity, namely the abnormal score of the dataThe larger the number is,denotes the firstA first distance of the intelligent factory data, the larger the value is, the more the first distance isThe greater the distance of the individual smart factory data from the cluster center, i.e., the firstThe data of an intelligent factory is different from most of the data, therebyThe larger the anomaly score of the individual intelligent plant data,is shown asAbnormal scores of individual smart factory data.
Obtaining the abnormal score of each intelligent factory data, and when analyzing the abnormal score of each intelligent factory data, firstly reflecting the possibility of the cluster itself having abnormality according to the size of the cluster aiming at the abnormal score, and highlighting the influence of the cluster size on the data abnormality; and then, judging the influence relationship among the same cluster data according to the time sequence span of the data contained in the cluster, namely, considering the influence of the data time sequence relationship on data abnormity judgment, and performing more accurate data abnormity judgment, namely obtaining more accurate data abnormity scores, thereby performing more accurate classification on the factory data.
Step S003, a second abnormal degree of each intelligent factory data is obtained according to the plurality of clusters and the abnormal score of each intelligent factory data.
1. Calculate the variance of each smart plant data against each time window:
the intelligent factory data may have stable and unchangeable data and may also have data with a certain trend change, so that the abnormal degree of the final data needs to be determined according to the change relation of the data on the time sequence at the moment, and the abnormal degree is used as a classification basis for the final abnormal data.
Is set to a size ofTime window of (2), the schemeAnd taking 40, sliding in the intelligent factory data sequence by using the time window with the step length of 1, wherein each sliding corresponds to one time window, and a plurality of time windows are obtained in the sliding process.
Obtaining includes the firstPersonal wisdom factory dataAll time windows of (2) are denoted asA plurality of time windows of the intelligent factory data;
calculate the firstPersonal intelligent factory dataAnd a firstThe time difference between each data in the belonged time window isEach data in each time window is according to time differenceThe values are arranged from small to large, and the time difference value is obtained beforeData as the firstThe first of the smart factory dataA plurality of adjacent data of the time window, in the schemeTaking 10;
by using the firstPersonal intelligent factory dataAnd a firstCalculating standard deviation of all data in the time window as the secondThe smart factory data andstandard deviation of the associated time window。
According to the firstA plurality of time windows, the first of the intelligent factory dataNumber of intelligent plantsAccording to a plurality of adjacent data and standard deviation of each time windowVariability of individual smart factory data versus time window:
wherein,denotes the firstThe data of the intelligent factory is stored in the database,is shown asThe first of the smart factory dataTo the first of the time windowsThe number of the adjacent data is one,denotes the firstThe smart factory data andthe first of the smart factory dataTo the first of the time windowsA difference between adjacent data, the larger the value is, the more the secondThe difference between the value of the smart factory data and the value of the data adjacent to the time sequence is large,is shown asThe first of the smart factory dataTo which the second time window belongsA normalized value of the anomaly score of the neighboring data, a larger value indicating a larger anomaly score of the neighboring data, and using the data as a referenceWhen the intelligent factory data is analyzed, the smaller the reference value of the data,is shown asThe first of the intelligent factory dataThe number of adjacent data is contained in each time window,is shown asAn intelligent factoryFirst of dataThe standard deviation of each time window is larger, the larger the standard deviation value is, the larger the data difference in the window isThe less the variance of the smart factory data with respect to the window,denotes the firstThe intelligent factory data relative to the firstThe variability of the individual time windows.
2. Calculating a first degree of anomaly of each smart plant data:
data in time series, each smart factory data may exist in a plurality of associated time windows, i.e. dataWith multiple relative disparities with respect to the window data, now forRelative differences in a plurality of associated time windows are determinedFirst degree of abnormality in time series:
wherein,denotes the firstThe intelligent factory data relative to the firstThe difference of the time windows, the larger the value, the second indicationThe greater the degree of abnormality of the individual intelligent plant data,denotes the firstThe number of the time windows of the intelligent factory data,to representIn thatRelative difference mean of each belonged time window is integrally reflectedThe relative difference in the time series of the samples,the larger the size of the tube is,the greater the degree of abnormality in the time series,is shown asStandard deviation of the variance of the individual smart factory data for all time windows,is shown asA first degree of anomaly of the smart factory data.
When the first abnormal degree is determined, the relative difference of the data is determined according to the difference of time sequence data and the relative relation of window data in a time window on a time sequence, the abnormal data difference caused by the large difference of trend change data is avoided, the influence of abnormal scores of other data on window calculation is considered during window operation, and the influence of abnormal score data in the window on the abnormal judgment of other data is avoided; and finally, obtaining the abnormal degree of the final data on the time sequence through the relative difference of a plurality of calculation windows where the data are positioned, wherein the local abnormality of the data is further reflected by considering the difference of the relative difference of the plurality of windows.
3. Calculating a second abnormal degree of each intelligent factory data:
is combined withAbnormal score of individual smart factory data andthe first abnormal degree of the intelligent factory data is judgedThe second degree of anomaly of the smart factory data is:
denotes the firstThe abnormal score of the data of the intelligent factory,is shown asThe greater the first degree of abnormality of the intelligent factory data,second degree of abnormality ofThe larger.
And obtaining a second abnormal degree of the data of each intelligent factory, obtaining a data abnormal score and the abnormal degree of the data on a time sequence by combining the CBLOF algorithm and the data time sequence change during the abnormal degree analysis, and judging the data abnormality from the overall data distribution and the data time sequence. And in the abnormal degree of the data time sequence, the influence of the data abnormal score on the abnormal judgment of the data time sequence is introduced, the common influence of the data abnormal score and the abnormal judgment of the data time sequence is strengthened, the final abnormal degree of the data is obtained and is used as the basis for data abnormal classification, namely, the data is classified more accurately.
And step S004, obtaining an abnormal data set and a normal data set according to the second abnormal degree of each intelligent factory data, and performing distributed storage on the abnormal data set and the normal data set.
Arranging the intelligent factory data according to the second abnormal degree from large to smallThe set formed by the intelligent factory data is used as an abnormal data set, and the intelligent factory data sequence is processedAnd the set formed by the remaining intelligent factory data in the column is used as a normal data set.
And the abnormal data set and the normal data set are stored in a distributed manner, so that the abnormal data can be inquired and called quickly.
In summary, the embodiment of the present invention provides an intelligent factory data classification method based on big data statistics, which reflects the possibility that a cluster itself has an abnormality according to the size of the cluster, and highlights the influence of the cluster size on data abnormality; and then, judging the influence relationship among the data of the same cluster according to the time sequence span of the data contained in the cluster, namely considering the influence of the data time sequence relationship on the data abnormity judgment, and performing more accurate data abnormity judgment. Then, the relative difference of the data is determined according to the difference of the time sequence data and the relative relation of the window data in the calculation window on the time sequence, the data difference abnormity caused by the larger difference between the trend change data is avoided, the influence of the abnormal score of other data on the window calculation is considered in the window calculation, and the influence of abnormal data in the window on the abnormal judgment of other data is avoided. And in the abnormal degree of the data time sequence, the influence of the data abnormal score on the abnormal judgment of the data time sequence is introduced, the common influence of the data abnormal score and the abnormal judgment of the data time sequence is strengthened, the final abnormal degree of the data is obtained and is used as the basis for data abnormal classification, namely, the data is classified more accurately.
It should be noted that: the sequence of the above embodiments of the present invention is only for description, and does not represent the advantages or disadvantages of the embodiments. And specific embodiments thereof have been described above. In addition, the processes depicted in the accompanying figures do not necessarily require the particular order shown, or sequential order, to achieve desirable results. In some embodiments, multitasking and parallel processing may also be possible or may be advantageous.
All the embodiments in the present specification are described in a progressive manner, and the same and similar parts among the embodiments are referred to each other, and each embodiment focuses on the differences from other embodiments.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that are within the spirit of the present invention are intended to be included therein.
Claims (8)
1. The intelligent factory data classification method based on big data statistics is characterized by comprising the following steps:
acquiring an intelligent factory data sequence formed by intelligent factory data, and obtaining a plurality of clusters according to the intelligent factory data sequence;
obtaining the time span of the cluster to which each intelligent factory data belongs according to each cluster, and obtaining the abnormal score of each intelligent factory data according to a plurality of clusters, the time span and the number of data contained in the clusters; obtaining a plurality of time windows according to the intelligent factory data sequence; obtaining the difference of each smart factory data relative to the time window according to the difference of the adjacent data in the time window and the abnormal score; obtaining a first abnormal degree of each smart factory data according to the difference of each smart factory data relative to each time window; obtaining a second abnormal degree of each intelligent factory data according to the first abnormal degree and the abnormal score of the intelligent factory data;
and obtaining an abnormal data set and a normal data set according to the intelligent factory data sequence and the second abnormal degree of each intelligent factory data, and performing distributed storage on the abnormal data set and the normal data set.
2. The intelligent plant data classification method based on big data statistics as claimed in claim 1, wherein the method for obtaining the abnormal score of each intelligent plant data according to the plurality of clusters, the time span and the number of data included in the cluster comprises:
acquiring the number of data contained in the cluster to which the intelligent factory data belongs, and recording the number of the data as the first number of the clusters to which the intelligent factory data belongs; acquiring the number of data contained in each cluster and recording the number as a second number, and acquiring the maximum value of the second number of all clusters and recording the maximum number; acquiring the distance between each intelligent factory data in each cluster and the center of the cluster to which the intelligent factory data belongs, and recording the distance as a first distance;
and obtaining the abnormal score of each intelligent factory data according to the time span, the first number, the maximum number and the first distance of the cluster to which each intelligent factory data belongs.
3. The intelligent plant data classification method based on big data statistics as claimed in claim 2, wherein the formula for obtaining the abnormal score of each intelligent plant data according to the time span, the first number, the maximum number and the first distance of the cluster to which each intelligent plant data belongs is as follows:
wherein,the maximum number is indicated by the number of bits,denotes the firstA first number of clusters to which the smart factory data belongs,is shown asThe time span of the cluster to which the intelligent factory data belongs,is shown asA first distance of the smart factory data,is shown asAbnormal scores of individual smart factory data.
4. The intelligent big data statistics-based plant data classification method of claim 1, wherein the method for obtaining the variance of each intelligent plant data with respect to a time window according to the variance of neighboring data in the time window and the anomaly score comprises:
the method comprises the steps of obtaining a plurality of time windows of each intelligent factory data, calculating a time difference value between each intelligent factory data and each data in each time window, obtaining a plurality of adjacent data of each intelligent factory data in each time window, obtaining a standard deviation of each intelligent factory data in each time window according to the time difference value, and obtaining a difference of each intelligent factory data relative to each time window according to the adjacent data of each time window, an abnormal score of each adjacent data and the standard deviation of each time window of each intelligent factory data, namely the difference of each intelligent factory data relative to each time window.
5. The intelligent big data statistics-based plant data classification method according to claim 4, wherein the formula for obtaining the difference of each smart plant data relative to each belonging time window according to the respective neighboring data of each belonging time window of each smart plant data, the abnormal score of the respective neighboring data and the standard deviation of each belonging time window of each smart plant data is as follows:
wherein,is shown asThe data of the intelligent factory is stored in the database,is shown asThe first of the intelligent factory dataTo the first of the time windowsThe number of the adjacent data is one,is shown asThe first of the smart factory dataTo the first of the time windowsA normalized value of the anomaly score for each of the neighboring data,denotes the firstThe first of the smart factory dataThe number of adjacent data is contained in each time window,is shown asThe first of the smart factory dataThe standard deviation of the individual time windows to which it belongs,is shown asThe smart factory data relative toThe variability of the individual time windows.
6. The method of claim 1, wherein the method for obtaining the first abnormal degree of each smart plant data according to the difference of each smart plant data with respect to each time window comprises:
the method comprises the steps of obtaining a plurality of time affiliated time windows of each intelligent factory data, obtaining the standard deviation of each affiliated time window of each intelligent factory data by utilizing data in each affiliated time window of each intelligent factory data, and obtaining the first abnormal degree of each intelligent factory data according to the difference of each intelligent factory data relative to each time window and the standard deviation of each affiliated time window of each intelligent factory data.
7. The intelligent plant data classification method based on big data statistics as claimed in claim 6, wherein the formula for obtaining the first anomaly degree of each smart plant data according to the variance of each smart plant data with respect to each time window and the standard deviation of each smart plant data belonging to each time window is:
wherein,denotes the firstThe smart factory data relative toThe difference of the time windows to which each belongs,is shown asThe number of the time windows of the intelligent factory data,denotes the firstStandard deviation of the variance of the individual smart factory data for all time windows,is shown asA first degree of anomaly of the smart factory data.
8. The intelligent big data statistics-based plant data classification method of claim 1, wherein the method for obtaining the time span of the cluster to which each intelligent plant data belongs according to each cluster comprises:
and forming a data pair by any two intelligent factory data in the cluster to which the intelligent factory data belong, calculating the time difference of the two intelligent factory data in each data pair, and obtaining the time span of the cluster to which each intelligent factory data belongs according to the time difference of all the data pairs in the cluster to which the intelligent factory data belong.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211432355.9A CN115510302B (en) | 2022-11-16 | 2022-11-16 | Intelligent factory data classification method based on big data statistics |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211432355.9A CN115510302B (en) | 2022-11-16 | 2022-11-16 | Intelligent factory data classification method based on big data statistics |
Publications (2)
Publication Number | Publication Date |
---|---|
CN115510302A true CN115510302A (en) | 2022-12-23 |
CN115510302B CN115510302B (en) | 2023-04-07 |
Family
ID=84513736
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202211432355.9A Expired - Fee Related CN115510302B (en) | 2022-11-16 | 2022-11-16 | Intelligent factory data classification method based on big data statistics |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115510302B (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115858894A (en) * | 2023-02-14 | 2023-03-28 | 温州众成科技有限公司 | Visual big data analysis method |
CN117054902A (en) * | 2023-09-06 | 2023-11-14 | 斯润天朗(合肥)科技有限公司 | Lithium battery voltage sequencing abnormality detection method and device, electronic equipment and medium |
CN117131530A (en) * | 2023-10-20 | 2023-11-28 | 合肥亚明汽车部件有限公司 | Intelligent factory sensitive data encryption protection method |
CN117370917A (en) * | 2023-12-07 | 2024-01-09 | 城光(湖南)节能环保服务股份有限公司 | Urban intelligent street lamp service life prediction method and system |
Citations (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2008191754A (en) * | 2007-02-01 | 2008-08-21 | National Institute Of Advanced Industrial & Technology | Abnormality detection apparatus and abnormality detection method |
JP2012018623A (en) * | 2010-07-09 | 2012-01-26 | Fuji Electric Co Ltd | Abnormality data analysis system |
US20140039834A1 (en) * | 2012-08-01 | 2014-02-06 | Hitachi Power Solutions Co., Ltd. | Method and apparatus for monitoring equipment conditions |
CN110084326A (en) * | 2019-05-13 | 2019-08-02 | 东北大学 | A kind of industrial equipment method for detecting abnormality based on fuzzy set |
US20190302713A1 (en) * | 2018-03-27 | 2019-10-03 | Terminus (Beijing) Technology Co., Ltd. | Method and device for automatically diagnosing and controlling apparatus in intelligent building |
CN110794800A (en) * | 2019-12-11 | 2020-02-14 | 河南中烟工业有限责任公司 | Monitoring system for wisdom mill information management |
CN112783938A (en) * | 2020-12-30 | 2021-05-11 | 河海大学 | Hydrological telemetering real-time data anomaly detection method |
CN113608968A (en) * | 2021-08-23 | 2021-11-05 | 北京邮电大学 | Power dispatching monitoring data anomaly detection method based on density and distance comprehensive decision |
CN113626502A (en) * | 2021-08-13 | 2021-11-09 | 南方电网深圳数字电网研究院有限公司 | Power grid data anomaly detection method and device based on ensemble learning |
CN113726911A (en) * | 2021-11-01 | 2021-11-30 | 南京绛门信息科技股份有限公司 | Factory data acquisition and processing system based on Internet of things technology |
CN113868342A (en) * | 2021-09-06 | 2021-12-31 | 江苏荣辉信息科技有限公司 | Production equipment data acquisition method based on intelligent factory |
US20220043860A1 (en) * | 2020-08-10 | 2022-02-10 | International Business Machines Corporation | Abnormal data detection |
CN114241294A (en) * | 2021-11-09 | 2022-03-25 | 国网浙江省电力有限公司信息通信分公司 | Method, system, equipment and storage medium for equipment abnormity detection |
US20220113938A1 (en) * | 2020-10-12 | 2022-04-14 | Vmware, Inc. | Methods and systems that rank and display log/event messages and transactions |
US20220138504A1 (en) * | 2020-10-29 | 2022-05-05 | Oracle International Corporation | Separation maximization technique for anomaly scores to compare anomaly detection models |
CN114528934A (en) * | 2022-02-18 | 2022-05-24 | 中国平安人寿保险股份有限公司 | Time series data abnormity detection method, device, equipment and medium |
CN114676883A (en) * | 2022-03-02 | 2022-06-28 | 深圳江行联加智能科技有限公司 | Power grid operation management method, device and equipment based on big data and storage medium |
US20220334574A1 (en) * | 2019-09-27 | 2022-10-20 | Tata Consultancy Services Limited | Method and system for diagnosing anomaly in a manufacturing plant |
-
2022
- 2022-11-16 CN CN202211432355.9A patent/CN115510302B/en not_active Expired - Fee Related
Patent Citations (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2008191754A (en) * | 2007-02-01 | 2008-08-21 | National Institute Of Advanced Industrial & Technology | Abnormality detection apparatus and abnormality detection method |
JP2012018623A (en) * | 2010-07-09 | 2012-01-26 | Fuji Electric Co Ltd | Abnormality data analysis system |
US20140039834A1 (en) * | 2012-08-01 | 2014-02-06 | Hitachi Power Solutions Co., Ltd. | Method and apparatus for monitoring equipment conditions |
US20190302713A1 (en) * | 2018-03-27 | 2019-10-03 | Terminus (Beijing) Technology Co., Ltd. | Method and device for automatically diagnosing and controlling apparatus in intelligent building |
CN110084326A (en) * | 2019-05-13 | 2019-08-02 | 东北大学 | A kind of industrial equipment method for detecting abnormality based on fuzzy set |
US20220334574A1 (en) * | 2019-09-27 | 2022-10-20 | Tata Consultancy Services Limited | Method and system for diagnosing anomaly in a manufacturing plant |
CN110794800A (en) * | 2019-12-11 | 2020-02-14 | 河南中烟工业有限责任公司 | Monitoring system for wisdom mill information management |
US20220043860A1 (en) * | 2020-08-10 | 2022-02-10 | International Business Machines Corporation | Abnormal data detection |
US20220113938A1 (en) * | 2020-10-12 | 2022-04-14 | Vmware, Inc. | Methods and systems that rank and display log/event messages and transactions |
US20220138504A1 (en) * | 2020-10-29 | 2022-05-05 | Oracle International Corporation | Separation maximization technique for anomaly scores to compare anomaly detection models |
CN112783938A (en) * | 2020-12-30 | 2021-05-11 | 河海大学 | Hydrological telemetering real-time data anomaly detection method |
CN113626502A (en) * | 2021-08-13 | 2021-11-09 | 南方电网深圳数字电网研究院有限公司 | Power grid data anomaly detection method and device based on ensemble learning |
CN113608968A (en) * | 2021-08-23 | 2021-11-05 | 北京邮电大学 | Power dispatching monitoring data anomaly detection method based on density and distance comprehensive decision |
CN113868342A (en) * | 2021-09-06 | 2021-12-31 | 江苏荣辉信息科技有限公司 | Production equipment data acquisition method based on intelligent factory |
CN113726911A (en) * | 2021-11-01 | 2021-11-30 | 南京绛门信息科技股份有限公司 | Factory data acquisition and processing system based on Internet of things technology |
CN114241294A (en) * | 2021-11-09 | 2022-03-25 | 国网浙江省电力有限公司信息通信分公司 | Method, system, equipment and storage medium for equipment abnormity detection |
CN114528934A (en) * | 2022-02-18 | 2022-05-24 | 中国平安人寿保险股份有限公司 | Time series data abnormity detection method, device, equipment and medium |
CN114676883A (en) * | 2022-03-02 | 2022-06-28 | 深圳江行联加智能科技有限公司 | Power grid operation management method, device and equipment based on big data and storage medium |
Non-Patent Citations (5)
Title |
---|
DEEPALI ARORA,ET AL.: "Big Data Analytics for Classification of Network Enabled Devices", 《2016 30TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS (WAINA)》 * |
MOURAD BOUACHE,ET AL.: "Smart Anomaly Detection: Deep Learning modeling Approach and System Utilization Analysis", 《2022 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (ICAIIC)》 * |
孙宇.: "电力设备监测数据处理和数据库设计", 《中国优秀硕士学位论文全文数据库(电子期刊)》 * |
肖端翔.: "基于机器学习混合模型的用电数据异常检测研究", 《中国优秀硕士学位论文全文数据库(电子期刊)》 * |
闫伟等: "一种设备实时监控新方法的研究与应用", 《计算机集成制造系统》 * |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115858894A (en) * | 2023-02-14 | 2023-03-28 | 温州众成科技有限公司 | Visual big data analysis method |
CN117054902A (en) * | 2023-09-06 | 2023-11-14 | 斯润天朗(合肥)科技有限公司 | Lithium battery voltage sequencing abnormality detection method and device, electronic equipment and medium |
CN117054902B (en) * | 2023-09-06 | 2024-03-19 | 斯润天朗(合肥)科技有限公司 | Lithium battery voltage sequencing abnormality detection method and device, electronic equipment and medium |
CN117131530A (en) * | 2023-10-20 | 2023-11-28 | 合肥亚明汽车部件有限公司 | Intelligent factory sensitive data encryption protection method |
CN117131530B (en) * | 2023-10-20 | 2024-01-30 | 合肥亚明汽车部件有限公司 | Intelligent factory sensitive data encryption protection method |
CN117370917A (en) * | 2023-12-07 | 2024-01-09 | 城光(湖南)节能环保服务股份有限公司 | Urban intelligent street lamp service life prediction method and system |
CN117370917B (en) * | 2023-12-07 | 2024-02-23 | 城光(湖南)节能环保服务股份有限公司 | Urban intelligent street lamp service life prediction method and system |
Also Published As
Publication number | Publication date |
---|---|
CN115510302B (en) | 2023-04-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN115510302A (en) | Intelligent factory data classification method based on big data statistics | |
CN101484910B (en) | Clustering system, and defect kind judging device | |
CN101516099B (en) | Test method for sensor network anomaly | |
KR102215690B1 (en) | Method and apparatus for time series data monitoring | |
CN108762228A (en) | A kind of multi-state fault monitoring method based on distributed PCA | |
CN114114039B (en) | Method and device for evaluating consistency of single battery cells of battery system | |
CN116992322B (en) | Smart city data center management system | |
CN117113235B (en) | Cloud computing data center energy consumption optimization method and system | |
CN111401785A (en) | Power system equipment fault early warning method based on fuzzy association rule | |
CN110795690A (en) | Wind power plant operation abnormal data detection method | |
CN112905583A (en) | High-dimensional big data outlier detection method | |
CN113408548A (en) | Transformer abnormal data detection method and device, computer equipment and storage medium | |
CN114861788A (en) | Load abnormity detection method and system based on DBSCAN clustering | |
CN117171157B (en) | Clearing data acquisition and cleaning method based on data analysis | |
CN116720090A (en) | Self-adaptive clustering method based on hierarchy | |
CN113762377A (en) | Network traffic identification method, device, equipment and storage medium | |
CN118378199A (en) | Real-time anomaly detection method in big data analysis platform | |
CN105306252A (en) | Method for automatically judging server failures | |
CN117575745B (en) | Course teaching resource individual recommendation method based on AI big data | |
CN116595102B (en) | Big data management method and system for improving clustering algorithm | |
CN112422546A (en) | Network anomaly detection method based on variable neighborhood algorithm and fuzzy clustering | |
CN114708919B (en) | Rapid low-loss population single cell big data simplification method | |
CN116757337A (en) | House construction progress prediction system based on artificial intelligence | |
CN116415714A (en) | Wind power prediction method and device, electronic equipment and readable storage medium | |
CN115718691A (en) | Fault root cause positioning method, device and equipment of micro-service system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20230407 |