Disclosure of Invention
The invention provides a method, a system, a device and a medium for positioning root causes of equipment parameters. The invention not only makes up the defect that the existing method cannot effectively analyze suspicious parameters of single equipment, but also greatly reduces the time cost of the traditional analysis method, realizes quick root cause searching, reduces the burden of manually processing a large amount of resume parameter data, improves the positioning efficiency of bad root causes and reduces the production cost.
In order to achieve the above object, in one aspect, the present invention provides a method for positioning a root cause of a parameter of a device, a process device for a product comprising a device A 1 To apparatus A n N is an integer greater than or equal to 2, the method comprising:
for device a 1 To apparatus A n Obtaining the corresponding association strength value and correction of each deviceThe positive predictive label is used for sorting all the obtained association strength values in a descending order, obtaining correction predictive label data corresponding to a plurality of bits of equipment before sorting, and marking the correction predictive label data as fifth analysis data;
aiming at fifth analysis data, sequentially analyzing each device, extracting second analysis data attribute column data and correction prediction label data of the corresponding device as independent variables and classification dependent variables respectively, analyzing independent variable linear combination of optimal classification dependent variables to obtain combination coefficients, marking the combination coefficients as weight coefficients of device parameters, and sequencing according to the descending order of absolute values of the weight coefficients;
and integrating the combined coefficients by taking the equipment parameters as indexes to form an equipment parameter root cause positioning table, taking the product of the association strength value corresponding to each equipment in the equipment parameter root cause positioning table and the absolute value of the weight coefficient as a sorting field, sorting according to descending order to obtain a sorting result, and obtaining the equipment parameter root cause positioning result based on the sorting result.
For device a 1 To apparatus A n The following treatments were respectively carried out:
for device a i I is more than or equal to 1 and less than or equal to n, and the equipment A is provided with i The recorded actual values of the parameters are arranged to form first analysis data by taking the product names as indexes and the parameter names as attribute names;
normalizing the attribute data in the first analysis data to obtain second analysis data;
the data of the attribute column in the second analysis data are taken out, the principal component in the attribute column data is obtained, the principal component interpretation variance is sequenced according to the principal component interpretation variance, and candidate principal components are selected from the principal components based on the principal component sequencing result and the principal component accumulation interpretation variance duty ratio;
performing linear transformation on the input second analysis data by using the coefficient matrix of the candidate principal component to obtain analysis matrix data, wherein the analysis matrix data is marked as third analysis data;
for the third analysis data, clustering to form 2 categories by using a clustering algorithm, and outputting and recording a clustering label of each product sample to obtain fourth analysis data;
taking out the label data of the product, which are finally detected to be good and bad, and carrying out association analysis on fourth analysis data by taking the product name as an index, grouping according to the clustering labels of the product samples, respectively counting bad occupation ratios, and outputting the maximum bad occupation ratio in the grouping result to obtain an association strength value;
and (3) adjusting the cluster label with the largest reject ratio and the actual reject ratio to be 1 and the rest to be 0 to obtain the correction prediction label.
Preferably, the present invention is directed to device a i I is more than or equal to 1 and less than or equal to n, and the equipment A is provided with i The recorded actual values of the parameters are used as indexes, the names of the parameters are used as attribute names to form a wide table, attribute columns with the deletion rate of the wide table exceeding a preset percentage and attribute columns with the value of the attribute columns being constant and the standard deviation being 0 are removed, the reserved attribute columns are subjected to interpolation of the deletion value by the median of the attribute to obtain first analysis data, wherein the method requires that the data does not have the deletion value, if the deletion needs to be interpolated, if the deletion rate is too high, the interpolation deletion value distorts the data misleading analysis result, and the data needs to be removed; in addition, the constant value attribute does not provide any information, and the complexity of increasing the data dimension by the aid of the evidence is eliminated.
Preferably, the method centers the first analysis data according to the attribute mean and divides the first analysis data by the standard deviation to obtain second analysis data. The method firstly uses principal component analysis, requires data standardization and prevents the influence of individual attribute dimension on analysis results.
Preferably, the method obtains principal components in the attribute column data using principal component analysis. The original data has high parameter attribute dimension, high noise and collinearity, and can interfere subsequent cluster analysis, so that the main components are required to be extracted first to obtain main useful information.
The specific steps or modes for acquiring the principal components in the attribute column data by using principal component analysis are as follows:
1. calculating a covariance matrix of the second analysis data;
2. performing feature decomposition of the covariance matrix, wherein the feature vector is a principal component coefficient, and the feature value is recorded as a principal component interpretation variance;
3. and taking principal components with the cumulative sum of the orders exceeding 80% according to the descending order of interpretation variance.
Preferably, the method uses a clustering algorithm to cluster the third analysis data to form 2 categories, and in this embodiment, a Kmeans clustering algorithm is used, but the method is not limited to a Kmeans clustering algorithm, such as spectral clustering, systematic clustering, and the like.
Preferably, the method uses linear discriminant analysis to give linear combinations of independent variables of the optimal partition classification dependent variables to obtain the combination coefficients. The method is expected to find independent variables capable of identifying the classified dependent variables and contribution effects of the independent variables, and the discriminant analysis and integration meet requirements.
The method for obtaining the combination coefficient by using the independent variable linear combination of the optimal classification dependent variable given by the linear discriminant analysis specifically comprises the following steps: and setting the undetermined coefficient as weight, carrying out weighted summation on independent variables, counting the intra-group variance and the inter-group variance according to two classification groups, and dividing the intra-group variance by the inter-group variance to take the value to minimize as an optimization target, so as to obtain an optimization problem solution, namely a solution of the undetermined coefficient.
The invention also provides a system for positioning the root cause of the equipment parameters, and the processing equipment of the product comprises equipment A 1 To apparatus A n N is an integer greater than or equal to 2, the system comprising:
a first analysis data obtaining unit to be processed for the device A 1 To apparatus A n Obtaining an association strength value and a correction prediction label corresponding to each device, sorting all the obtained association strength values in a descending order, obtaining correction prediction label data corresponding to a plurality of devices before sorting, and marking the correction prediction label data as first to-be-processed analysis data;
a second analysis data obtaining unit for obtaining analysis data for the device A i I is more than or equal to 1 and less than or equal to n, and the equipment A is provided with i Taking the recorded actual value of the parameter as an index, taking the parameter name as an attribute name, and carrying out standardization processing on the attribute data to obtain second analysis data to be processed;
a weight coefficient obtaining and sorting unit for device parameters, which is used for sequentially analyzing each device for the first to-be-processed analysis data, extracting attribute column data and correction prediction label data in the second to-be-processed analysis data corresponding to the device as independent variables and classification dependent variables respectively, analyzing independent variable linear combination of optimal partition classification dependent variables to obtain combination coefficients, wherein the combination coefficients are recorded as weight coefficients of equipment parameters, and are sorted according to descending order of absolute values of the weight coefficients;
the device parameter root cause positioning result obtaining unit is used for integrating the combination coefficients by taking the device parameters as indexes to form a device parameter root cause positioning table, taking the product of the association strength value corresponding to each device in the device parameter root cause positioning table and the absolute value of the weight coefficient as a sequencing field, sequencing the device parameter root cause positioning table according to descending order to obtain a sequencing result, and obtaining the device parameter root cause positioning result based on the sequencing result.
The invention also provides a device for positioning the root cause of the equipment parameter, which comprises a memory, a processor and a computer program stored in the memory and capable of running on the processor, wherein the processor realizes the steps of the method for positioning the root cause of the equipment parameter when executing the computer program.
The invention also provides a computer readable storage medium storing a computer program which when executed by a processor implements the steps of the device parameter root cause positioning method.
The one or more technical schemes provided by the invention have at least the following technical effects or advantages:
generally, the production resume equipment parameter data has the characteristics of large data volume, dispersed effective information, complex flow and the like, and the traditional manual search analysis and investigation mode has low efficiency. The invention can form an automatic searching and matching step, all the most suspicious root causes are listed, and the most suspicious root causes are sorted in descending order according to the degree of suspicion of equipment parameters, so that technicians are assisted to directly verify from the most suspicious root causes, and the most suspicious root causes are positioned as much as possible at the highest speed.
The invention overcomes the defects of slow speed and unsatisfactory effect of the traditional automatic analysis method on the analysis of the equipment parameter data, not only eliminates the interference of the follow-up equipment parameters on the analysis of poor correlation, but also can easily expand and consider the combined action of different equipment parameters by combining with the path analysis, and has the advantages of practicability and reliability.
Detailed Description
In order that the above-recited objects, features and advantages of the present invention will be more clearly understood, a more particular description of the invention will be rendered by reference to the appended drawings and appended detailed description. In addition, the embodiments of the present invention and the features in the embodiments may be combined with each other without collision.
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention, however, the present invention may be practiced in other ways than within the scope of the description, and the scope of the invention is therefore not limited to the specific embodiments disclosed below.
It will be understood that the terms "a" and "an" should be interpreted as referring to "at least one" or "one or more," i.e., in one embodiment, the number of elements may be one, while in another embodiment, the number of elements may be plural, and the term "a" should not be interpreted as limiting the number.
Example 1
The invention will be specifically described below by taking a product as glass as an example, and in practical application, the product can be other types of products, and the invention is not limited to the specific types of products.
Referring to fig. 1, the invention provides a method for positioning root causes of equipment parameters by comprehensively applying a principal component clustering method and a discriminant analysis method.
The first embodiment of the invention provides a method for positioning the root cause of equipment parameters, and the manufacturing equipment of the product comprises equipment A 1 To apparatus A n N is an integer greater than or equal to 2, the method comprising:
for device a 1 To apparatus A n Obtaining an association strength value and a correction prediction label corresponding to each device, sorting all the obtained association strength values in a descending order, obtaining correction prediction label data corresponding to a plurality of devices before sorting, and marking the correction prediction label data as first to-be-processed analysis data;
for device a i I is more than or equal to 1 and less than or equal to n, and the equipment A is provided with i Taking the recorded actual value of the parameter as an index, taking the parameter name as an attribute name, and carrying out standardization processing on the attribute data to obtain second analysis data to be processed;
for the first to-be-processed analysis data, sequentially analyzing each device, extracting attribute column data and correction prediction tag data in the second to-be-processed analysis data corresponding to the device as independent variables and classification dependent variables respectively, analyzing independent variable linear combination of optimal division classification dependent variables to obtain combination coefficients, recording the combination coefficients as weight coefficients of device parameters, and sequencing according to the descending order of absolute values of the weight coefficients;
and integrating the combined coefficients by taking the equipment parameters as indexes to form an equipment parameter root cause positioning table, taking the product of the association strength value corresponding to each equipment in the equipment parameter root cause positioning table and the absolute value of the weight coefficient as a sorting field, sorting according to descending order to obtain a sorting result, and obtaining the equipment parameter root cause positioning result based on the sorting result.
The method comprises the following specific implementation steps:
Step1:
assuming analysis processing equipment A, the actual values of parameters recorded by the equipment are indexed by glass names, the parameter names are arranged by attribute names to form a wide table, attribute columns with the deletion rate exceeding 30% and constant attribute columns are removed, the reserved attribute columns are interpolated by the deletion value of the median of the attribute, the result table is shown as an analysis table 1, wherein the percentage of the removed attribute columns can be flexibly adjusted according to actual needs, and the embodiment of the invention is not particularly limited.
Step2:
The analysis table 1 is normalized by the attribute mean value and divided by the standard deviation, and the result is expressed as analysis 2.
Step3:
The data of the attribute column in the analysis table 2 is taken out, principal components are found by using principal component analysis and are sorted in descending order of interpretation variance, and candidate principal components which are the principal components in front and whose accumulated variance interpretation ratio exceeds 80% are selected. And (3) performing linear transformation on the input analysis data by using the screened principal component coefficient matrix to obtain a scoring matrix, wherein matrix table data are recorded as analysis data 3. The selection percentage of the candidate principal components can be flexibly adjusted according to actual needs, and the embodiment of the invention is not particularly limited. The principal component coefficient matrix is a matrix formed by obtaining a feature vector column corresponding to a principal component according to a pre-description screening method, the screening is a post-screening result which is formed by sorting the principal component according to an interpretation variance descending order and accumulating the condition that the interpretation variance ratio exceeds 80% after the principal component analysis, and the screening is performed to extract main information in an original high-dimensional variable and avoid noise and co-linearity interference.
Step4:
Taking out the analysis data 3, clustering the analysis data into 2 categories by using a Kmeans clustering algorithm, outputting and recording a clustering label of each glass sample, and recording the clustering label as analysis data 4; other clustering algorithms can be selected for clustering in practical application, and the embodiment does not limit specific clustering modes and algorithms.
Step5:
And (3) taking out the label data of the final detected good and bad glass, taking the glass name as index association analysis data 4, grouping according to the clustering labels of the glass samples, respectively counting the bad occupation ratio, outputting the maximum bad occupation ratio in the grouping result, and marking the maximum bad occupation ratio as an association strength value.
Step6:
The cluster label with the largest bad proportion and the actual bad is adjusted to be 1, the rest is adjusted to be 0, and the cluster label is recorded as a correction prediction label
Step7:
And (3) executing the operations of the steps 1-6 on each device, outputting corresponding association strength values, sorting the association strength values in descending order, enabling the device with the earlier sorting to be more suspicious, and acquiring correction prediction label data corresponding to the device with the earlier sorting, and recording the correction prediction label data as analysis data 5.
Step8:
And (5) extracting analysis data 5, sequentially analyzing each device, extracting attribute column data and correction prediction label data of an analysis table 2 of the corresponding device, respectively serving as independent variables and classification dependent variables, using linear discriminant analysis to give independent variable linear combination of optimal classification dependent variables, marking the combination coefficients as weight coefficients of device parameters, and sorting according to an absolute value descending order, wherein the more previous parameters are suspicious.
Step9:
And integrating the combined coefficients into a table by taking the equipment parameters as indexes, calculating the product of the correlation strength value and the absolute value of the weight coefficient as a sorting field, sorting in descending order, and assisting in positioning the root cause of the equipment parameters with bad glass when the former equipment parameters are more suspicious.
The method can form an automatic searching and matching step, all the most suspicious root causes are listed, and the most suspicious root causes are sorted in descending order according to the degree of suspicion of equipment parameters, so that technicians are assisted to directly verify from the most suspicious root causes, and the most suspicious root causes are positioned as much as possible at the highest speed.
The method overcomes the defects of slow speed and unsatisfactory effect of the traditional automatic analysis method on the analysis of the equipment parameter data, eliminates the interference of the follow-up equipment parameters on the analysis of poor correlation, can be combined with the path analysis to easily expand and consider the combined action of different equipment parameters, and has the advantages of practicability and reliability.
Example two
Referring to fig. 2, a second embodiment of the present invention provides an equipment parameter root cause positioning system, and a product manufacturing apparatus includes an equipment a 1 To apparatus A n N is an integer greater than or equal to 2, the system comprising:
a first analysis data obtaining unit to be processed for the device A 1 To apparatus A n Obtaining the corresponding association strength value and correction prediction label of each device, and obtainingAll the obtained association strength values are sorted in descending order, correction prediction tag data corresponding to a plurality of bits of equipment before sorting is obtained, and the correction prediction tag data is recorded as first analysis data to be processed;
a second analysis data obtaining unit for obtaining analysis data for the device A i I is more than or equal to 1 and less than or equal to n, and the equipment A is provided with i Taking the recorded actual value of the parameter as an index, taking the parameter name as an attribute name, and carrying out standardization processing on the attribute data to obtain second analysis data to be processed;
a weight coefficient obtaining and sorting unit for device parameters, which is used for sequentially analyzing each device for the first to-be-processed analysis data, extracting attribute column data and correction prediction label data in the second to-be-processed analysis data corresponding to the device as independent variables and classification dependent variables respectively, analyzing independent variable linear combination of optimal partition classification dependent variables to obtain combination coefficients, wherein the combination coefficients are recorded as weight coefficients of equipment parameters, and are sorted according to descending order of absolute values of the weight coefficients;
the device parameter root cause positioning result obtaining unit is used for integrating the combination coefficients by taking the device parameters as indexes to form a device parameter root cause positioning table, taking the product of the association strength value corresponding to each device in the device parameter root cause positioning table and the absolute value of the weight coefficient as a sequencing field, sequencing the device parameter root cause positioning table according to descending order to obtain a sequencing result, and obtaining the device parameter root cause positioning result based on the sequencing result.
Example III
The third embodiment of the invention provides a device parameter root cause positioning device, which comprises a memory, a processor and a computer program stored in the memory and capable of running on the processor, wherein the processor realizes the steps of the device parameter root cause positioning method when executing the computer program.
The processor may be a central processing unit (CPU, central Processing Unit), other general purpose processors, digital signal processors (digital signal processor), application specific integrated circuits (Application Specific Integrated Circuit), off-the-shelf programmable gate arrays (Fieldprogrammable gate array) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, or the like. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
The memory may be used to store the computer program and/or the module, and the processor may implement various functions of the inventive device parameter root cause locating apparatus by running or executing the data stored in the memory. The memory may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, application programs required for at least one function (such as a sound playing function, an image playing function, etc.), and the like. In addition, the memory may include high-speed random access memory, and may also include non-volatile memory, such as a hard disk, memory, plug-in hard disk, smart memory card, secure digital card, flash memory card, at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device.
Example IV
A fourth embodiment of the present invention provides a computer readable storage medium storing a computer program, where the computer program when executed by a processor implements the steps of the device parameter root cause positioning method.
The device parameter root cause location means may be stored in a computer readable storage medium if implemented in the form of a software functional unit and sold or used as a stand alone product. Based on such understanding that the present invention implements all or part of the flow of the method of the above-described embodiments, the steps of each method embodiment described above may also be implemented by a computer program stored in a computer readable storage medium, where the computer program when executed by a processor. Wherein the computer program comprises computer program code, object code forms, executable files, or some intermediate forms, etc. The computer readable medium may include: any entity or device capable of carrying the computer program code, a recording medium, a USB flash disk, a removable hard disk, a magnetic disk, an optical disk, a computer memory, a read-only memory, a random access memory, a point carrier signal, a telecommunication signal, a software distribution medium, and the like. It should be noted that the content of the computer readable medium can be appropriately increased or decreased according to the requirements of the legislation and the patent practice in the jurisdiction.
While preferred embodiments of the present invention have been described, additional variations and modifications in those embodiments may occur to those skilled in the art once they learn of the basic inventive concepts. It is therefore intended that the following claims be interpreted as including the preferred embodiments and all such alterations and modifications as fall within the scope of the invention.
It will be apparent to those skilled in the art that various modifications and variations can be made to the present invention without departing from the spirit or scope of the invention. Thus, it is intended that the present invention also include such modifications and alterations insofar as they come within the scope of the appended claims or the equivalents thereof.