CN117527611B - Gaussian distribution-based fault dynamic prediction method, system, electronic equipment and storage medium - Google Patents
Gaussian distribution-based fault dynamic prediction method, system, electronic equipment and storage medium Download PDFInfo
- Publication number
- CN117527611B CN117527611B CN202311475288.3A CN202311475288A CN117527611B CN 117527611 B CN117527611 B CN 117527611B CN 202311475288 A CN202311475288 A CN 202311475288A CN 117527611 B CN117527611 B CN 117527611B
- Authority
- CN
- China
- Prior art keywords
- data
- gaussian distribution
- characteristic
- distribution
- fault
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 41
- 230000002159 abnormal effect Effects 0.000 claims abstract description 28
- 238000004590 computer program Methods 0.000 claims description 20
- 238000012544 monitoring process Methods 0.000 claims description 16
- 238000013507 mapping Methods 0.000 claims description 14
- 238000012360 testing method Methods 0.000 claims description 11
- 238000012549 training Methods 0.000 claims description 9
- 230000006870 function Effects 0.000 claims description 5
- 238000004364 calculation method Methods 0.000 claims description 3
- 238000010276 construction Methods 0.000 claims description 3
- 230000004044 response Effects 0.000 claims description 3
- 238000006467 substitution reaction Methods 0.000 claims description 3
- 230000003068 static effect Effects 0.000 abstract description 4
- 230000009286 beneficial effect Effects 0.000 abstract description 2
- 238000011156 evaluation Methods 0.000 abstract 1
- 238000001514 detection method Methods 0.000 description 7
- 238000004422 calculation algorithm Methods 0.000 description 5
- 238000010586 diagram Methods 0.000 description 5
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000005856 abnormality Effects 0.000 description 1
- 230000002547 anomalous effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/14—Network analysis or design
- H04L41/147—Network analysis or design for predicting network behaviour
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
- H04L41/0631—Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/14—Network analysis or design
- H04L41/145—Network analysis or design involving simulating, designing, planning or modelling of a network
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/16—Threshold monitoring
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The application discloses a fault dynamic prediction method and a related device based on Gaussian distribution, wherein the method comprises the following steps: collecting parameter index historical data related to faults of equipment; constructing time windows of equipment fault abnormal index data, and constructing relatively accurate Gaussian distribution of each time window according to historical data; parameter indexes of equipment are collected and monitored regularly, and warning is given out to possible equipment faults according to abnormal data in time. Compared with the traditional static method, the method has higher adaptability, can automatically adjust the threshold according to the resource type and the scene, and reduces the false alarm rate. In addition, the technology is applicable to various resource types and has wide applicability. The method also adds automatic evaluation and adjustment to the model, improves the accuracy and efficiency of fault prediction, is beneficial to taking measures in advance, reduces the influence of equipment faults on the service, and improves the reliability and performance of the resource management system.
Description
Technical Field
The application belongs to the technical field of equipment fault prediction, and relates to a fault dynamic prediction method based on Gaussian distribution and a related device.
Background
The intelligent resource management system manages a wide variety of resources. The set fault thresholds are different for different kinds of resources, and the fault thresholds are also different for the same kind of resources under different task scenes. The index monitoring and early warning system in the prior art generally applies a static fault threshold method to assist in analyzing abnormal conditions of resources. The static fault threshold method is to set a fixed fault threshold for the monitoring index of the resource, compare the current monitoring index data with the set fault threshold, generate an alarm if the fault threshold is exceeded, and treat the fault threshold as normal if the fault threshold is not exceeded. However, existing fault prediction techniques are typically implemented by monitoring a single resource of the device, and their thresholds are typically set empirically by the practitioner. However, when resources are single, the prediction error may be large, and when the kinds of resources are too many, how the threshold of some resources should be set may be unfamiliar to the professional. In addition, due to the characteristics of different resources, performance indexes of the resources may be in a fluctuation state, and at the moment, the abnormal state of the resources cannot be accurately reflected by adopting a static threshold method.
Disclosure of Invention
The application aims to solve the problems in the prior art and provides a fault dynamic prediction method and a related device based on Gaussian distribution.
In order to achieve the above purpose, the application is realized by adopting the following technical scheme:
In a first aspect, the present application provides a method for dynamic fault prediction based on gaussian distribution, comprising the steps of:
continuously monitoring and collecting the characteristics related to the equipment state and the corresponding characteristic data thereof;
dividing the characteristic data to obtain a plurality of data time windows;
Constructing a mapping relation model of the equipment state and the characteristic data by using a Gaussian distribution method;
and calculating probability distribution and a threshold value of each data time window according to the mapping relation model, and comparing the probability distribution with the default threshold value to obtain a fault prediction result.
In a second aspect, the present application provides a dynamic fault prediction system based on gaussian distribution, comprising:
The data acquisition module is used for continuously monitoring and acquiring the characteristics related to the equipment state and the corresponding characteristic data thereof;
The data dividing module is used for dividing the characteristic data to obtain a plurality of data time windows;
the model construction module is used for constructing a mapping relation model of the equipment state and the characteristic data by using a Gaussian distribution method;
and the fault prediction module is used for calculating probability distribution and a threshold value of each data time window according to the mapping relation model, and comparing the probability distribution with the default threshold value to obtain a fault prediction result.
In a third aspect, the application provides an electronic device comprising a memory, a processor and a computer program stored in the memory and executable on the processor, the processor implementing the steps of the method as described above when executing the computer program.
In a fourth aspect, the present application provides a computer readable storage medium storing a computer program which when executed by a processor performs the steps of a method as described above.
Compared with the prior art, the application has the following beneficial effects:
The application adopts a dynamic fault threshold setting method to search the characteristics of service mutation so as to break through the monitoring blind area of the traditional absolute index before ensuring personnel to perceive and discover the abnormal resources. The hidden danger of the resource index is captured early through the idea of active monitoring, the method is realized through Gaussian distribution according to the normal distribution characteristic of the resource index, the threshold value is obtained more reasonably and accurately, the early warning level is automatically generated, the accurate monitoring of the resource is achieved, the invisible faults in the resource are mined, and the active monitoring of the index before the problem analysis and the steering problem are generated in the time of the resource faults in the prior art is finally realized.
The application divides the characteristic data into different data time windows based on the periodicity and distribution characteristics of the characteristic data and the service understanding of the data, and the characteristic data in the same data time window has similar numerical distribution characteristics.
Based on Gaussian distribution, the application automatically adjusts the threshold according to historical data of a time window, monitors the equipment characteristic parameter index with fluctuation through the dynamic threshold, and predicts the possibility of fault occurrence more timely and accurately through abnormal performance of the index.
Drawings
For a clearer description of the technical solutions of the embodiments of the present application, the drawings that are needed in the embodiments will be briefly described below, it being understood that the following drawings only illustrate some embodiments of the present application and should not be considered as limiting the scope, and other related drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a flow chart of the method of the present application.
Fig. 2 is a schematic diagram of the system of the present application.
FIG. 3 is a flow chart of a method and system for dynamic prediction of failure based on Gaussian distribution.
FIG. 4 is a diagram of 2000 historical data points received during the 1:00-2:00 period in the historical time window of example 2 of the present application.
Fig. 5 is a diagram showing the data amount transmitted by the device in the "normal" state according to embodiment 2 of the present application.
Fig. 6 is a diagram showing the data amount transmitted by the device in the "fail-one" state according to embodiment 2 of the present application.
Fig. 7 shows the data amount transmitted by the device in the "fail-two" state according to embodiment 2 of the present application.
FIG. 8 is a diagram showing the classification of the data of comparative test set of example 2 into "abnormal state" and "normal state" according to the present application.
Fig. 9 is a result of calculating epsilon when P (beta) =0.997 is set in example 2 of the present application.
Fig. 10 is a result of calculating epsilon when P (beta) =0.96 is set in example 2 of the present application.
Fig. 11 is a result of calculating epsilon when P (beta) =0.97 is set in example 2 of the present application.
Fig. 12 is a result of calculating epsilon when P (beta) =0.98 is set in example 2 of the present application.
Detailed Description
For the purpose of making the objects, technical solutions and advantages of the embodiments of the present application more apparent, the technical solutions of the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application, and it is apparent that the described embodiments are some embodiments of the present application, but not all embodiments of the present application. The components of the embodiments of the present application generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations.
Thus, the following detailed description of the embodiments of the application, as presented in the figures, is not intended to limit the scope of the application, as claimed, but is merely representative of selected embodiments of the application. All other embodiments, which can be made by those skilled in the art based on the embodiments of the application without making any inventive effort, are intended to be within the scope of the application.
It should be noted that: like reference numerals and letters denote like items in the following figures, and thus once an item is defined in one figure, no further definition or explanation thereof is necessary in the following figures.
In the description of the embodiments of the present application, it should be noted that, if the terms "upper," "lower," "horizontal," "inner," and the like indicate an azimuth or a positional relationship based on the azimuth or the positional relationship shown in the drawings, or the azimuth or the positional relationship in which the inventive product is conventionally put in use, it is merely for convenience of describing the present application and simplifying the description, and does not indicate or imply that the apparatus or element to be referred to must have a specific azimuth, be configured and operated in a specific azimuth, and thus should not be construed as limiting the present application. Furthermore, the terms "first," "second," and the like, are used merely to distinguish between descriptions and should not be construed as indicating or implying relative importance.
Furthermore, the term "horizontal" if present does not mean that the component is required to be absolutely horizontal, but may be slightly inclined. As "horizontal" merely means that its direction is more horizontal than "vertical", and does not mean that the structure must be perfectly horizontal, but may be slightly inclined.
In the description of the embodiments of the present application, it should also be noted that, unless explicitly specified and limited otherwise, the terms "disposed," "mounted," "connected," and "connected" should be construed broadly, and may be, for example, fixedly connected, detachably connected, or integrally connected; can be mechanically or electrically connected; can be directly connected or indirectly connected through an intermediate medium, and can be communication between two elements. The specific meaning of the above terms in the present application can be understood by those of ordinary skill in the art according to the specific circumstances.
The application is described in further detail below with reference to the attached drawing figures:
referring to fig. 1, the embodiment of the application discloses a fault dynamic prediction method based on gaussian distribution, which comprises the following steps:
S1, continuously monitoring and collecting the characteristics related to the equipment state and the corresponding characteristic data thereof.
In practical applications, the feature related to the device state may be one or a combination of multiple features capable of reflecting the device state, such as a CPU temperature, a packet loss rate, or a response delay of the device.
S2, dividing the characteristic data to obtain a plurality of data time windows.
In some embodiments, the feature data is partitioned into a plurality of different data time windows according to the periodicity of the feature data, the distribution characteristics, and the business understanding of the feature data.
In practical applications, the feature data in the same data time window have similar numerical distribution features.
S3, constructing a mapping relation model of the equipment state and the characteristic data by using a Gaussian distribution method.
In some embodiments, the mean and variance of the features are calculated to obtain a gaussian distribution of the features, and the probability distribution of the gaussian distribution and a default threshold are determined.
In some embodiments, the two-dimensional data samples D of the characteristic parameters of the data time window and its device state are as follows:
d= { (x i,yi)|xi∈R,yi. Epsilon. [ normal, abnormal ], I. Epsilon. I)
Wherein x i is a device characteristic value, y i is a device state, and I is a data index;
after the two-dimensional data sample D is divided into a training set D train and a test set D test, taking a characteristic parameter X (n) with a normal corresponding y i in the training set D train:
X(n)=x1,x2,...,xn
the mean mu and variance sigma 2 of the characteristic parameter X (n) are calculated:
The degree of deviation of a certain characteristic value X from the mean value mu is represented by delta, and the probability P (delta) of random distribution of X i in the range [ mu-delta, mu+delta ] in the characteristic parameter X (n) is calculated as follows:
Where ρ (X) is a probability density function of the eigenvalue X in the gaussian distribution of X (n):
Substitution of (4) into (3) gives:
considering a value β such that P (β) =0.95, i.e., the probability that the eigenvalues in X (n) are randomly distributed over the range [ μ - β, μ+β ] is 95%; inverse solving the integral equation to calculate a value beta;
Then, a default threshold ε is calculated using β:
ε=ρ(μ-β;μ,σ2) (6)。
and S4, calculating probability distribution and a threshold value of each data time window according to the mapping relation model, and comparing the probability distribution with the default threshold value to obtain a fault prediction result.
In practical application, in the data time window, if the gaussian distribution density ρ of the characteristic value is smaller than the threshold epsilon, the equipment state is judged to be abnormal.
As shown in fig. 2, an embodiment of the present application discloses a system for dynamic fault prediction based on gaussian distribution, which includes:
The data acquisition module is used for continuously monitoring and acquiring the characteristics related to the equipment state and the corresponding characteristic data thereof;
The data dividing module is used for dividing the characteristic data to obtain a plurality of data time windows;
the model construction module is used for constructing a mapping relation model of the equipment state and the characteristic data by using a Gaussian distribution method;
and the fault prediction module is used for calculating probability distribution and a threshold value of each data time window according to the mapping relation model, and comparing the probability distribution with the default threshold value to obtain a fault prediction result.
The principle of the application is as follows:
The gaussian distribution is used for determining a reference fault threshold of the index and provides support for final threshold prediction of various resources in various scenes. The gaussian distribution is used to describe the probability distribution of the occurrence of the data samples, and the probability distribution of the population is estimated by calculating the mean and variance of the sample values. The basic idea of the anomaly detection algorithm based on Gaussian distribution is to calculate the Gaussian distribution probability of a certain resource characteristic value sample, wherein the probability is larger than or equal to a threshold value and is normal, and the probability is smaller than the threshold value and is abnormal. The mathematical connotation of anomaly detection is derived from a rule of mathematical statistics that small probability events will not generally occur and are considered to be anomalous once they occur.
Constructing a resource abnormality detection algorithm, and firstly selecting a characteristic value related to a resource state, such as the CPU utilization rate of a server. And then calculating the mean value and the variance of each feature, further obtaining the Gaussian distribution of each feature, and determining the abnormal probability threshold value of the Gaussian distribution. Based on the basic theory of the Gaussian distribution detection algorithm, the probability distribution and the threshold value of the resource characteristic value data sample are calculated, and the abnormal state of the resource is rapidly mined and detected by comparing the Gaussian distribution value of the detection sample with the threshold value. The Gaussian normal distribution model constructed by the Gaussian distribution algorithm can rapidly and accurately analyze the state of the resource index and effectively identify the reference threshold value, so that support is provided for dynamic correction of the threshold value. The gaussian distribution can form corresponding distribution for different types of indexes and indexes under different scenes, so that corresponding reference thresholds are obtained.
Example 1:
the embodiment discloses a fault dynamic prediction method based on Gaussian distribution, which comprises the following steps:
Step 1: data preparation
And selecting the relevant characteristics of the equipment state and corresponding characteristic values, such as CPU temperature, packet loss rate, response delay and the like of the server. The feature data comes from the continuous monitoring and collection of features.
Step 2: data periodic decomposition
The data is divided into different time windows based on periodicity of the feature data, distribution of the features, and business understanding of the data. The feature data has similar numerical distribution features in the same time window. Taking the CPU temperature of a certain server as an example, the temperature of the server is lowest between 2 and 5 am, which corresponds to the lowest access amount in the time period, so that between 2 and 5 am can be used as a time window of the CPU temperature data. In the present application, it is assumed that the eigenvalues conform to a gaussian distribution, and that there is a significant difference in the mean and variance of the eigenvalues across different data time windows. In addition, since updating of feature data may cause the periodicity and distribution characteristics of the data to change, it is necessary to dynamically adjust the data time window and perform a gaussian distribution-based failure prediction for each window.
Step 3: modeling method selection
And constructing a mapping relation between the equipment state and the characteristic value by using a Gaussian distribution method. And calculating the mean and variance of the features, and further obtaining the Gaussian distribution of the features. An anomaly probability threshold for the gaussian distribution is determined. Based on the basic theory of the Gaussian distribution detection algorithm, the probability distribution and the threshold value of the equipment characteristic value sample are calculated, and the Gaussian distribution value of the detection sample is compared with the threshold value to quickly mine and detect the abnormal state of the equipment.
Step 4: threshold calculation
The two-dimensional data samples of the characteristic parameters in a certain data time window of the device and the state of the device are expressed as:
d= { (x i,yi)|xi∈R,yi. Epsilon. [ normal, abnormal ], I. Epsilon. I })
Where x i is the device characteristic value, y i is the device state, and I is the data index.
After the sample D is divided into the training set D train and the test set D test, the feature parameters of the training set D train, which correspond to y i and are normal, are expressed as follows:
X(n)=x1,x2,...,xn
the mean μ and variance σ 2 of X (n) are calculated as follows:
The degree of deviation of a certain characteristic value X from the mean value mu is represented by delta, and the probability P (delta) that X i is randomly distributed in the range [ mu-delta, mu+delta ] in X (n) can be calculated as follows:
Where ρ (X) is a probability density function of the eigenvalue X in the gaussian distribution of X (n):
Substitution of (4) into (3) gives:
the value β is considered such that P (β) =0.95, i.e. the probability that the eigenvalues are randomly distributed in the range [ μ - β, μ+β ] in X (n) is 95%. The integral equation is solved inversely to calculate the value β.
Then, a default threshold ε is calculated using β:
ε=ρ(μ-β;μ,σ2) (6)
that is, in this time window, if the gaussian distribution density ρ of the feature value is smaller than the threshold value ε, the device state is determined to be abnormal.
Step 5: classification and assessment
Classifying the test data set of the state parameter of the equipment, calculating the precision p, recall r and F1 score, and further checking the classification accuracy. The calculation method comprises the following steps:
Wherein: TP is the number of correctly predicted normal samples (both detected as abnormal and actually abnormal); FP is the number of abnormal samples that are mispredicted as normal (detected as abnormal actually normal); FN is the number of normal samples that are mispredicted as abnormal (detected as normal and actually abnormal).
When the F1 score is not ideal, different values of P (δ) are attempted to be used to calculate different thresholds epsilon. After attempting to use different thresholds epsilon, the epsilon value with the highest F1 score is used to improve the classification accuracy.
Step 6: time advance
When the time advances to the next time window, the steps are repeated using the data in the time window to continue monitoring the device status and detecting anomalies.
Example 2:
There is some equipment, and when the equipment is normally operated, the data volume that it sent has normal distribution characteristics according to the time of use. The faults of the equipment mainly comprise:
(a) The data amount transmitted increases due to abnormal operation of the device;
(b) The CPU cooling system does not normally operate, so that the working progress of the equipment is slow.
The main working time of the equipment is 1:00-6:00 pm. The size of the data volume sent out in a 30 second time interval is recorded every 30 seconds, with one time window per hour, according to the usage characteristics.
As shown in fig. 4, square dots represent history data with a label of "failure one", x "dots represent history data with a label of" normal ", and dots represent history data with a label of" failure two ". The dashed lines represent the gaussian distribution of the corresponding data.
In terms of splitting, in the historical data, the data volume sent by the device in the "normal" state is as shown in fig. 5, and accords with the gaussian distribution characteristics:
in the history data, the data amount transmitted by the device in the "failure one" state is as shown in fig. 6. When the fault occurs, the data volume sent by the equipment has the characteristics of normal distribution and lower temperature than the normal operation.
In the history data, the data amount transmitted by the device in the "failure two" state is as shown in fig. 7. When the fault occurs, the data volume sent by the equipment has the characteristics of normal distribution and higher temperature than that in normal operation.
After the historical data are divided into a training data set and a test data set according to the proportion of 7:3 (namely 1400 training data points and 600 test data points), the CPU temperature mean value rho and standard deviation sigma in the normal state of the equipment in the training data set are calculated by using formulas (1) - (2), and the result is obtained
μ≈2.0035,σ≈0.25002
When P (β) =0.95 is set, ε is calculated using equations (5) - (6), namely:
ε=ρ(μ-β;μ,,σ2)
Obtaining
ε≈0.233762
By comparing the Gaussian distribution ρ of the data of the test set, the data of the test set is classified into an "abnormal state" and a "normal state", and then the labels thereof are compared, so that the data shown in fig. 8 is obtained.
Among them, 248 data points of TP (predicted as abnormal and actually abnormal); there are 335 data points of TN (predicted as normal, as well as actual as normal); 13 data points of FP (predicted to be abnormal and actually normal); FN (predicted normal, actually abnormal) has 4 data points.
The F1 score is calculated as follows:
Try other P (β):
as shown in fig. 9, when P (β) =0.997, f1≡96.08%;
as shown in fig. 10, when P (β) =0.96, f1≡97.25%;
as shown in fig. 11, when P (β) =0.97, f1≡ 97.43%;
as shown in fig. 12, when P (β) =0.98, f1≡97.80%.
Based on current data, the threshold calculated using P (β) =0.98 can provide a more accurate prediction. At this time, the threshold epsilon=0.106. That is, if the device is operating properly, the size of the data to be transmitted should be in interval 1.42,2.59.
Upon entering a new time window, the system will repeat the above steps to calculate a new threshold epsilon.
The embodiment of the application provides computer equipment. The computer device of this embodiment includes: a processor, a memory, and a computer program stored in the memory and executable on the processor. The steps of the various method embodiments described above are implemented when the processor executes the computer program. Or the processor, when executing the computer program, performs the functions of the modules/units in the above-described device embodiments.
The computer program may be divided into one or more modules/units, which are stored in the memory and executed by the processor to accomplish the present application.
The computer equipment can be a desktop computer, a notebook computer, a palm computer, a cloud server and other computing equipment. The computer device may include, but is not limited to, a processor, a memory.
The Processor may be a central processing unit (Central Processing Unit, CPU), but may also be other general purpose processors, digital signal processors (DIGITAL SIGNAL Processor, DSP), application SPECIFIC INTEGRATED Circuit (ASIC), off-the-shelf Programmable gate array (Field-Programmable GATE ARRAY, FPGA) or other Programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, or the like.
The memory may be used to store the computer program and/or modules, and the processor may implement various functions of the computer device by running or executing the computer program and/or modules stored in the memory, and invoking data stored in the memory.
The modules/units integrated with the computer device may be stored in a computer readable storage medium if implemented in the form of software functional units and sold or used as stand alone products. Based on such understanding, the present application may implement all or part of the flow of the method of the above embodiment, or may be implemented by a computer program to instruct related hardware, where the computer program may be stored in a computer readable storage medium, and when the computer program is executed by a processor, the computer program may implement the steps of each of the method embodiments described above. Wherein the computer program comprises computer program code which may be in source code form, object code form, executable file or some intermediate form etc. The computer readable medium may include: any entity or device capable of carrying the computer program code, a recording medium, a U disk, a removable hard disk, a magnetic disk, an optical disk, a computer Memory, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), an electrical carrier signal, a telecommunications signal, a software distribution medium, and so forth. It should be noted that the computer readable medium contains content that can be appropriately scaled according to the requirements of jurisdictions in which such content is subject to legislation and patent practice, such as in certain jurisdictions in which such content is subject to legislation and patent practice, the computer readable medium does not include electrical carrier signals and telecommunication signals.
The above is only a preferred embodiment of the present application, and is not intended to limit the present application, but various modifications and variations can be made to the present application by those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present application should be included in the protection scope of the present application.
Claims (8)
1. The dynamic fault prediction method based on Gaussian distribution is characterized by comprising the following steps of:
continuously monitoring and collecting the characteristics related to the equipment state and the corresponding characteristic data thereof;
dividing the characteristic data to obtain a plurality of data time windows;
Constructing a mapping relation model of the equipment state and the characteristic data by using a Gaussian distribution method;
calculating probability distribution and a default threshold value of each data time window according to the mapping relation model, and comparing the probability distribution with the default threshold value to obtain a fault prediction result;
Calculating probability distribution and a default threshold value of each data time window according to the mapping relation model, wherein the calculation comprises calculating the mean value and variance of the features, further obtaining Gaussian distribution of the features, and determining the probability distribution and the default threshold value of the Gaussian distribution;
The determining the probability distribution of the Gaussian distribution and the default threshold value comprises the following steps:
characteristic parameters of a data time window and two-dimensional data samples of its device state The following are provided:
Wherein, For the device feature value,/>For device status,/>Data index;
sample two-dimensional data Divided into training sets/>And test set/>Later, get training set/>Corresponds to/>Is normal characteristic parameter/>:
Calculating characteristic parametersMean/>Variance/>:
To be used forRepresenting a characteristic value/>For mean/>Calculating the degree of deviation of the characteristic parameter/>Middle/>Randomly distributed in a rangeProbability/>The following are provided:
Wherein, Is characteristic value/>At/>Probability density function in gaussian distribution of (c):
Substitution of (4) into (3) gives:
consider a value of Make/>I.e./>The characteristic values of the medium are randomly distributed in the range/>The probability of (2) is 95%; inverse solving the integral equation to calculate the value/>;
Subsequently, use is made ofCalculating a default threshold:
Wherein, Is a default threshold.
2. The gaussian distribution based fault dynamic prediction method according to claim 1, wherein said characteristics related to the state of the device include CPU temperature, packet loss rate and response delay of the device.
3. The method for dynamically predicting failure based on gaussian distribution according to claim 1, wherein said dividing the feature data into a plurality of data time windows comprises:
And dividing the characteristic data according to the periodicity, the distribution characteristics and the service understanding of the characteristic data to obtain a plurality of different data time windows.
4. A gaussian distribution based fault dynamic prediction method according to claim 1 or 3, characterized in that the feature data in the same data time window have similar numerical distribution features.
5. The gaussian distribution-based fault dynamic prediction method according to claim 1, wherein comparing the probability distribution with a default threshold value yields a fault prediction result, comprising:
If the Gaussian distribution density of the characteristic values is within the data time window Less than a default threshold/>And judging the equipment state as abnormal.
6. A gaussian distribution-based fault dynamic prediction system for implementing the method of claim 1, comprising:
The data acquisition module is used for continuously monitoring and acquiring the characteristics related to the equipment state and the corresponding characteristic data thereof;
The data dividing module is used for dividing the characteristic data to obtain a plurality of data time windows;
the model construction module is used for constructing a mapping relation model of the equipment state and the characteristic data by using a Gaussian distribution method;
And the fault prediction module is used for calculating probability distribution and a default threshold value of each data time window according to the mapping relation model, and comparing the probability distribution with the default threshold value to obtain a fault prediction result.
7. An electronic device comprising a memory, a processor and a computer program stored in the memory and executable on the processor, characterized in that the processor implements the steps of the method according to any of claims 1-5 when the computer program is executed.
8. A computer readable storage medium storing a computer program, characterized in that the computer program when executed by a processor implements the steps of the method according to any one of claims 1-5.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311475288.3A CN117527611B (en) | 2023-11-07 | 2023-11-07 | Gaussian distribution-based fault dynamic prediction method, system, electronic equipment and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311475288.3A CN117527611B (en) | 2023-11-07 | 2023-11-07 | Gaussian distribution-based fault dynamic prediction method, system, electronic equipment and storage medium |
Publications (2)
Publication Number | Publication Date |
---|---|
CN117527611A CN117527611A (en) | 2024-02-06 |
CN117527611B true CN117527611B (en) | 2024-06-07 |
Family
ID=89759993
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311475288.3A Active CN117527611B (en) | 2023-11-07 | 2023-11-07 | Gaussian distribution-based fault dynamic prediction method, system, electronic equipment and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117527611B (en) |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110830450A (en) * | 2019-10-18 | 2020-02-21 | 平安科技(深圳)有限公司 | Abnormal flow monitoring method, device and equipment based on statistics and storage medium |
CN111913850A (en) * | 2020-07-31 | 2020-11-10 | 北京嘀嘀无限科技发展有限公司 | Data anomaly detection method, device, equipment and storage medium |
CN113297790A (en) * | 2021-05-19 | 2021-08-24 | 哈尔滨工业大学 | High-speed rail response prediction method based on sparse Bayesian width learning |
CN114816901A (en) * | 2022-02-28 | 2022-07-29 | 南开大学 | Method for evaluating health state of software after change in AI (artificial intelligence) mode |
CN116304909A (en) * | 2023-03-13 | 2023-06-23 | 天翼云科技有限公司 | Abnormality detection model training method, fault scene positioning method and device |
CN116821141A (en) * | 2022-03-21 | 2023-09-29 | 中兴通讯股份有限公司 | Data updating method, fault diagnosis method, electronic device, and storage medium |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10984154B2 (en) * | 2018-12-27 | 2021-04-20 | Utopus Insights, Inc. | System and method for evaluating models for predictive failure of renewable energy assets |
-
2023
- 2023-11-07 CN CN202311475288.3A patent/CN117527611B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110830450A (en) * | 2019-10-18 | 2020-02-21 | 平安科技(深圳)有限公司 | Abnormal flow monitoring method, device and equipment based on statistics and storage medium |
CN111913850A (en) * | 2020-07-31 | 2020-11-10 | 北京嘀嘀无限科技发展有限公司 | Data anomaly detection method, device, equipment and storage medium |
CN113297790A (en) * | 2021-05-19 | 2021-08-24 | 哈尔滨工业大学 | High-speed rail response prediction method based on sparse Bayesian width learning |
CN114816901A (en) * | 2022-02-28 | 2022-07-29 | 南开大学 | Method for evaluating health state of software after change in AI (artificial intelligence) mode |
CN116821141A (en) * | 2022-03-21 | 2023-09-29 | 中兴通讯股份有限公司 | Data updating method, fault diagnosis method, electronic device, and storage medium |
CN116304909A (en) * | 2023-03-13 | 2023-06-23 | 天翼云科技有限公司 | Abnormality detection model training method, fault scene positioning method and device |
Non-Patent Citations (1)
Title |
---|
基于多元高斯分布的数据质量波动监测模型研究;王捷;马红艳;;电信工程技术与标准化;20190515(第05期);全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN117527611A (en) | 2024-02-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111045894B (en) | Database abnormality detection method, database abnormality detection device, computer device and storage medium | |
CN112712113B (en) | Alarm method, device and computer system based on index | |
CN114978568B (en) | Data center management using machine learning | |
CN109656793A (en) | A kind of information system performance stereoscopic monitoring method based on multi-source heterogeneous data fusion | |
CN112188531B (en) | Abnormality detection method, abnormality detection device, electronic apparatus, and computer storage medium | |
EP2487860A1 (en) | Method and system for improving security threats detection in communication networks | |
US20110161048A1 (en) | Method to Optimize Prediction of Threshold Violations Using Baselines | |
EP3927000B1 (en) | Network element health status detection method and device | |
CN109088775B (en) | Abnormity monitoring method and device and server | |
EP2951654B1 (en) | Methods and systems for online monitoring using a variable data sampling rate | |
EP2918976A1 (en) | Smart meter Privacy Analyzer | |
CN112671767A (en) | Security event early warning method and device based on alarm data analysis | |
KR101281460B1 (en) | Method for anomaly detection using statistical process control | |
CN115311829A (en) | Accurate alarm method and system based on mass data | |
CN117527611B (en) | Gaussian distribution-based fault dynamic prediction method, system, electronic equipment and storage medium | |
CN112380073B (en) | Fault position detection method and device and readable storage medium | |
CN113992496B (en) | Abnormal alarm method and device based on quartile algorithm and computing equipment | |
CN114297034B (en) | Cloud platform monitoring method and cloud platform | |
CN113835961B (en) | Alarm information monitoring method, device, server and storage medium | |
CN112398706B (en) | Data evaluation standard determining method and device, storage medium and electronic equipment | |
JP7095443B2 (en) | Anomaly detection program, anomaly detection method and anomaly detection device | |
US20200112577A1 (en) | Graph-based sensor ranking | |
JP2018191217A (en) | Data monitoring apparatus, data monitoring method, and data monitoring program | |
EP3457609B1 (en) | System and method for computing of anomalies based on frequency driven transformation and computing of new features based on point anomaly density | |
CN118759245B (en) | Power adapter-based current overload monitoring method and system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant |