[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN118037134B - Characteristic information extraction method and system for donkey-hide gelatin cake production process - Google Patents

Characteristic information extraction method and system for donkey-hide gelatin cake production process Download PDF

Info

Publication number
CN118037134B
CN118037134B CN202410430365.1A CN202410430365A CN118037134B CN 118037134 B CN118037134 B CN 118037134B CN 202410430365 A CN202410430365 A CN 202410430365A CN 118037134 B CN118037134 B CN 118037134B
Authority
CN
China
Prior art keywords
temperature
donkey
production data
vector
hide gelatin
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202410430365.1A
Other languages
Chinese (zh)
Other versions
CN118037134A (en
Inventor
东后坤
杨丽娜
王丽
王惠
卓立敏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shandong Dong'a Runkang Donkey Hide Gelatin Products Co ltd
Original Assignee
Shandong Dong'a Runkang Donkey Hide Gelatin Products Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong Dong'a Runkang Donkey Hide Gelatin Products Co ltd filed Critical Shandong Dong'a Runkang Donkey Hide Gelatin Products Co ltd
Priority to CN202410430365.1A priority Critical patent/CN118037134B/en
Publication of CN118037134A publication Critical patent/CN118037134A/en
Application granted granted Critical
Publication of CN118037134B publication Critical patent/CN118037134B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0639Performance analysis of employees; Performance analysis of enterprise or organisation operations
    • G06Q10/06393Score-carding, benchmarking or key performance indicator [KPI] analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/213Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods
    • G06F18/2135Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods based on approximation criteria, e.g. principal component analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/04Manufacturing
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Human Resources & Organizations (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Artificial Intelligence (AREA)
  • Strategic Management (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Economics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Development Economics (AREA)
  • Tourism & Hospitality (AREA)
  • Marketing (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Educational Administration (AREA)
  • General Business, Economics & Management (AREA)
  • Manufacturing & Machinery (AREA)
  • Primary Health Care (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Game Theory and Decision Science (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The application relates to the technical field of data processing, in particular to a method and a system for extracting characteristic information in a donkey-hide gelatin cake production process, wherein the method comprises the following steps: obtaining a production data vector of donkey-hide gelatin cakes in each batch; constructing corresponding instability coefficients according to the data values of the production data vectors of each quality of each batch; obtaining the corresponding dissimilarity coefficient according to the sequence constructed by the production data vector; acquiring local weights of each production data under different qualities based on the local weights; obtaining a difference significance index according to the normalized vector of the production data, and obtaining the global weight of the production data; and extracting characteristic information according to the global weight and the local weight. Therefore, the main factors are strictly controlled, and the quality of the donkey-hide gelatin cake is higher and more controllable.

Description

Characteristic information extraction method and system for donkey-hide gelatin cake production process
Technical Field
The application relates to the technical field of data processing, in particular to a characteristic information extraction method and system in a donkey-hide gelatin cake production process.
Background
Donkey-hide gelatin cake is used as a traditional Chinese medicine health care product, and has a long history and a deep cultural background in China. The donkey-hide gelatin is used as a main raw material, is prepared by combining a plurality of auxiliary materials according to a specific formula and process, and is widely considered to have a plurality of health care effects of enriching blood, nourishing skin, regulating body functions and the like. Along with the improvement of the living standard and the enhancement of health consciousness of people, the market demand of the donkey-hide gelatin cake is increased year by year, and the production process and the quality control of the donkey-hide gelatin cake are promoted to be hot spots for research. However, the production process of donkey-hide gelatin cake is complex, and involves various raw materials and links, so how to ensure the stability and high quality of each batch of products becomes an important challenge for industry.
In the production process of the donkey-hide gelatin cake, factors such as raw material selection, proportion, processing conditions and the like have decisive influence on the quality of the final product. The characteristic information extraction technology provides a new view angle to solve the problem, and key factors affecting the quality of the donkey-hide gelatin cake are more accurately identified by analyzing production data, so that powerful support is provided for optimizing the production process and improving the quality of products. However, the existing feature extraction method has some limitations in the donkey-hide gelatin cake production process, and many methods only pay attention to a certain link of the production process, neglect other factors possibly important as well, and the one-sided analysis mode is difficult to comprehensively reflect the real situation of the production process, and is more likely to cause misjudgment of product quality. Therefore, there is an urgent need for a more comprehensive and efficient feature information extraction method, which can consider multiple factors in the production process and accurately evaluate their influence on the quality of donkey-hide gelatin cake. The method is not only beneficial to improving the intelligentized level of donkey-hide gelatin cake production, but also provides powerful technical support for ensuring the product quality of the donkey-hide gelatin cake and meeting the market demand.
Disclosure of Invention
In order to solve the technical problems, the application aims to provide a characteristic information extraction method and a characteristic information extraction system for a donkey-hide gelatin cake production process, and the adopted technical scheme is as follows:
In a first aspect, an embodiment of the present application provides a method for extracting feature information in a donkey-hide gelatin cake production process, where the method includes the following steps:
obtaining a production data vector of donkey-hide gelatin cakes in each batch;
dividing the donkey-hide gelatin cake into three qualities, and forming a temperature sequence for all temperature vectors in any one quality; acquiring a temperature variation set and a temperature stability set according to data in the temperature vector; acquiring a temperature dissimilarity coefficient according to the cosine similarity and the number of temperature vectors in the temperature sequence; acquiring an instability coefficient of temperature according to elements in the temperature variation set and the temperature stability set; obtaining local weight of the temperature under any quality according to the instability coefficient and the dissimilarity coefficient of the temperature; obtaining local weights of each production data under different qualities according to a method for obtaining the significance coefficient of the temperature;
Normalizing the temperature sequences with different qualities, and acquiring a difference significance index of the temperature according to the normalized temperature sequences with different qualities; acquiring the difference significance index of the rest production data according to the mode of acquiring the difference significance index of the temperature; acquiring global weights of each production data according to the difference significance indexes of all the production data;
and extracting characteristic information according to the global weight of all production data and the local weight under different qualities.
Preferably, the production data vector of the donkey-hide gelatin cake comprises a temperature vector, a humidity vector, a stirring vector and a pressure vector.
Preferably, the acquiring the temperature variation set and the temperature stability set according to the data in the temperature vector includes:
For the temperature vector, if the data values of the continuous moments exist in the temperature vector, the time corresponding to the continuous moments is the temperature duration time; the set of all the temperature duration is recorded as a temperature variation set;
Counting the temperature duration time corresponding to each temperature value, and adding the temperature duration time corresponding to each temperature value to be used as the total temperature duration time corresponding to the temperature value; the set of total duration of temperature is denoted as the temperature stability set.
Preferably, the obtaining the dissimilarity coefficient of the temperature according to the cosine similarity and the number of the temperature vectors in the temperature sequence includes:
Calculating cosine similarity of any two temperature vectors in the temperature sequence, calculating the same temperature vector, and accumulating all the cosine similarity as a first cosine similarity value;
the square of the number of temperature vectors in the temperature sequence is recorded as a first number;
The first number is differenced from the first cosine similarity value, and the exponential function value of the difference is recorded as the dissimilarity coefficient of the temperature.
Preferably, the obtaining the instability coefficient of the temperature according to the elements in the temperature variation set and the temperature stability set includes:
In the method, in the process of the invention, Indicates the number of elements in the temperature variation set of the a-th batch donkey-hide gelatin cake,/>Represents the temperature stability set of the donkey-hide gelatin cake of the a batch,/>Representing the variance of the temperature stability set of batch a of donkey-hide gelatin cake,/>Indicates the number of donkey-hide gelatin cake batches,/>Representing an exponential function based on a natural constant,/>Representing the coefficient of instability of the temperature,/>Is an adjustment parameter.
Preferably, the obtaining the local weight of the temperature under any quality according to the instability coefficient and the dissimilarity coefficient of the temperature includes:
Making the dissimilarity coefficient of the temperature and the value 1 different, and recording the sum of the difference and the instability coefficient of the temperature as the significance coefficient of the temperature;
The sum of the saliency coefficients of all production data is marked as a saliency total value, and the ratio of the saliency coefficient of each type of production data to the saliency total value is marked as a first ratio; the difference between the value 1 and the first ratio is taken as the local weight of the production data.
Preferably, normalizing the temperature sequences with different qualities, and obtaining a difference significance index of the temperature according to the normalized temperature sequences with different qualities includes:
the method for normalizing the temperature sequence comprises the following steps: normalizing each temperature vector in the temperature sequence;
The temperature vector normalization method comprises the following steps: accumulating all the element values in the temperature vector to obtain an element total value, calculating the ratio of each element value in the temperature vector to the element total value, marking the ratio as a normalized value of each element in the temperature vector, and replacing each element value of the temperature vector with the normalized value of the element to complete the normalization of the temperature vector;
the expression of the difference saliency coefficient is:
In the method, in the process of the invention, Represents the mth normalized vector,/>Represents the h normalized vector,/>Normalized temperature sequence representing the nth quality,/>Normalized temperature sequence representing quality z-Representation/>And/>The JS divergence of (c),Representing an exponential function based on a natural constant,/>Indicating a temperature difference significance index.
Preferably, the obtaining the global weight of each production data according to the difference significance index of all the production data specifically includes:
In the method, in the process of the invention, Representing the difference significance coefficient of the k-th class of production data,/>Representing the difference significance coefficient of the v-th class of production data,/>As a sign function,/>As a round-down function,/>A weight duty ratio coefficient representing the kth class of production data;
The accumulated value of the weight ratio coefficients of all types of production data is recorded as a weight total coefficient;
And taking the ratio of the weight duty ratio coefficient to the weight total coefficient of each type of production data as the global weight of each type of production data.
Preferably, the extracting feature information according to the global weight of all production data and the local weight under different qualities includes:
calculating the average value of each type of production data of the donkey-hide gelatin cake of each batch, and taking the product of the sum of the local weight and the global weight and the average value as the weighted value of each type of production data of each batch; constructing all weighted values into a production matrix;
And (3) reducing the dimension of the production matrix by using a PCA algorithm to obtain a load vector of the first main component, sequentially accumulating the load factors in the load vector from large to small, and taking the load factors sequentially accumulated from large to small as characteristic information when the accumulated value exceeds the preset proportion of the total value of all the load factors.
In a second aspect, an embodiment of the present application further provides a system for extracting feature information of a donkey-hide gelatin cake production process, where the system includes a memory, a processor, and a computer program stored in the memory and running on the processor, where the processor implements the steps of the feature information extraction method of any one of the above donkey-hide gelatin cake production processes when executing the computer program.
The application has at least the following beneficial effects:
According to the method, through analyzing the production characteristics of the donkey-hide gelatin cake, the quality similarity significance index of each type of production data in the same quality donkey-hide gelatin cake is calculated by utilizing the characteristic that the production conditions of the same quality donkey-hide gelatin cake have higher similarity in the same type of data, and then the local weight of each type of production data in the same quality donkey-hide gelatin cake is calculated; the production data with larger influence on the quality of the donkey-hide gelatin cake is utilized, the characteristic of larger difference among the donkey-hide gelatin cakes with different quality is utilized, the difference significance index of each type of production data in the donkey-hide gelatin cakes with different quality is calculated, and then the global weight of each type of production data is calculated; and finally, weighting each production data according to the local weight and the global weight, and then reducing the dimension through PCA to determine factors which have main influence on the quality of the donkey-hide gelatin cake. In this way, the change of various factors in the production process between the same quality and different quality is considered, so that the influence of the factors on the quality of the donkey-hide gelatin cake is accurately evaluated, main influencing factors are determined, and in the subsequent donkey-hide gelatin cake production process, the main factors are strictly controlled, so that the quality of the donkey-hide gelatin cake is higher and more controllable.
Drawings
In order to more clearly illustrate the embodiments of the application or the technical solutions and advantages of the prior art, the following description will briefly explain the drawings used in the embodiments or the description of the prior art, and it is obvious that the drawings in the following description are only some embodiments of the application, and other drawings can be obtained according to the drawings without inventive effort for a person skilled in the art.
Fig. 1 is a flowchart illustrating a method for extracting feature information in a donkey-hide gelatin cake production process according to an embodiment of the present application;
fig. 2 is a flowchart of an implementation of a method for extracting feature information in a donkey-hide gelatin cake production process according to an embodiment of the application.
Detailed Description
In order to further describe the technical means and effects adopted by the present application to achieve the preset purpose, the following detailed description refers to the specific implementation, structure, features and effects of a method and a system for extracting feature information in a donkey-hide gelatin cake production process according to the present application, which are described in detail below with reference to the accompanying drawings and preferred embodiments. In the following description, different "one embodiment" or "another embodiment" means that the embodiments are not necessarily the same. Furthermore, the particular features, structures, or characteristics of one or more embodiments may be combined in any suitable manner.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs.
The following specifically describes a specific scheme of a method and a system for extracting characteristic information in a donkey-hide gelatin cake production process provided by the application with reference to the accompanying drawings.
Referring to fig. 1, a flowchart of a method for extracting feature information of a donkey-hide gelatin cake production process according to an embodiment of the application is shown, the method includes the following steps:
step S001, obtaining production data vectors of donkey-hide gelatin cakes in each batch.
In the donkey-hide gelatin cake production process, temperature data during donkey-hide gelatin heating and melting are collected through a temperature sensor in heating equipment, humidity data of a production environment are collected through a humidity sensor in a workshop environment, stirring speed data of a step of fusing auxiliary materials with donkey-hide gelatin stock solution are collected through a rotating speed sensor on stirring equipment, pressure data of a pressing step are collected through a pressure sensor on pressing equipment, and the four data are collectively called production data.
In this embodiment, the above production data are collected once per second, and parameters may be different in the production process of each batch of donkey-hide gelatin cake, so that the production time of each step is changed, and if the temperature, humidity, stirring speed and pressure are collected at fixed time intervals, the same type of production data of different batches of donkey-hide gelatin cakes will have different states at corresponding moments. Taking temperature data as an example, when the donkey-hide gelatin is melted by heating at a high temperature, the donkey-hide gelatin may be melted in 40 th minute and then the next step is performed, but when the temperature is lower, the donkey-hide gelatin is melted by heating, the donkey-hide gelatin may not be melted in 40 th minute, and at this time, the temperatures of two batches cannot be compared. So in order to eliminate this, instead of collecting data at fixed time intervals, data is collected every second until each step is finished, and then 1% of the time used in the current step is taken as a time interval, for example, in the donkey-hide gelatin heating and melting step, if the step is finished for 100 minutes in total, the time interval is 1 minute, and the temperature value is collected every one minute.
For each batch of donkey-hide gelatin cake, a vector is obtained in each step, which is a temperature vector, a humidity vector, a stirring speed vector and a pressure vector.
So far, the production data vector of the donkey-hide gelatin cake in the same batch is obtained.
Step S002, constructing the corresponding instability coefficient according to the data value of the production data vector of each quality of each batch; obtaining the corresponding dissimilarity coefficient according to the sequence constructed by the production data vector; based on this, local weights for each production data at different qualities are obtained.
According to industry evaluation standards, the quality of the produced donkey-hide gelatin cake is measured and divided into three quality grades of excellent, good and unqualified, and the donkey-hide gelatin cake with excellent quality is taken as an example:
Counting a plurality of batches of donkey-hide gelatin cakes with excellent quality, sequencing temperature vectors of all batches according to a time sequence to obtain a temperature sequence, sequencing humidity vectors of all batches according to a time sequence to obtain a humidity sequence, sequencing stirring speed vectors of all batches according to a time sequence to obtain a stirring speed sequence, and sequencing pressure vectors of all batches according to a time sequence to obtain a pressure sequence.
The corresponding production data of the donkey-hide gelatin cake with the same quality should have similarity, and in the production of different batches of donkey-hide gelatin cake, the higher the similarity is, the larger the influence on the quality of the donkey-hide gelatin cake is; the lower the similarity of the production data, the less the effect on the quality of the donkey-hide gelatin cake. Each type of production data has stability, which is helpful for controlling the data in the production process, if the data always fluctuates, the data is difficult to accurately control for production, so that the more stable the data is, the higher the controllability of the production process is, and the higher the consistency and reliability of the final finished product are.
The similarity of the production data is obtained by comparing the similarity of the same type of production data of donkey-hide gelatin cakes in different batches, and the production data is represented by vectors, so that the similarity of the production data is obtained by cosine similarity. The stability of the production data requires judgment of the time series time of the production data and the overall fluctuation frequency. If the data values of any one of the production data are the same, the time of the continuous moment is recorded as the data duration, all the data durations of each vector are counted to form a data fluctuation set, the sum of the data durations corresponding to the same data value in the same data fluctuation set is recorded as the total data duration, and the total data duration forms a data stability set.
In this embodiment, for example, a temperature vector is {1,1,2,1,1,1,3,4,6,6}, where {1,1} corresponds to a temperature duration of 2, {1,1} corresponds to a temperature duration of 3, {6,6} corresponds to a temperature duration of 2, where the temperature variation set is {2,3,2}, the total duration of the temperatures with a temperature value of 2 is 5, the total duration of the temperatures with a temperature value of 6 is 2, and the temperature stability set is {5,2}.
According to the cosine similarity of the vectors in the temperature sequence and the number of the vectors in the temperature sequence, the dissimilarity coefficient of the temperature is obtained, and the formula is as follows:
In the method, in the process of the invention, Representing the i-th vector in the temperature sequence,/>Represents the j-th vector in the temperature sequence,/>Representing cosine similarity function,/>Representing the number of vectors in the temperature sequence,/>Represents an exponential function with a base of a natural constant,The dissimilarity coefficient indicating temperature, t indicates that the production data is temperature data.
If the influence of temperature on the quality of the donkey-hide gelatin cake is larger, the similarity of temperature in the production data of different batches of donkey-hide gelatin cakes is higher, so that the cosine similarity is higherShould be closer to 1, thus/>Should be closer to 0, i.e. the dissimilarity coefficient with temperature/>And/>The closer to 1, the better, i.e., the smaller the dissimilarity factor, the better.
Obtaining a temperature instability coefficient according to the number of elements in the temperature fluctuation set and the variance of element values in the temperature stability set, wherein the formula is as follows:
In the method, in the process of the invention, Indicates the number of elements in the temperature variation set of the a-th batch donkey-hide gelatin cake,/>Represents the temperature stability set of the donkey-hide gelatin cake of the a batch,/>Representing the variance of the temperature stability set of batch a of donkey-hide gelatin cake,/>Indicates the number of donkey-hide gelatin cake batches,/>Representing an exponential function based on a natural constant,/>Representing the coefficient of instability of the temperature,/>Is an adjustment parameter to prevent the denominator from being 0.
The temperature is stable, so that the duration of a certain temperature is longer, that is, the proportion of a certain temperature in the temperature stability set is enough outstanding, so that the uniformity of data in the temperature stability set is broken, the variance is larger, and the frequency of overall temperature change is smaller, that is, the quantity of data in the temperature change set is smaller, so thatSmaller, i.e. coefficient of instability of temperature/>Smaller, and/>The smaller the better.
Obtaining a significance coefficient of the temperature according to the instability coefficient and the dissimilarity coefficient of the temperature, wherein the formula is as follows:
In the method, in the process of the invention, Representing the dissimilarity coefficient of temperature,/>Representing the coefficient of instability of the temperature,/>Representing the coefficient of significance of the temperature. Due to/>The range of values of (2) is/>And/>Smaller and better,/>And/>The closer to 1, the better, and thus the similarity index/>, of temperatureThe smaller the size, the closer to 0, the greater the effect of temperature on the quality of the donkey-hide gelatin cake.
The similarity significance index of humidity, stirring speed and pressure is obtained by calculating the similarity significance index of temperature. Since the calculation of the similarity significance index is not a calculation for one type of data value, the change trend, stability and similarity of the data are measured. Obtaining local weights of various production data according to the significance coefficients of different production data, wherein the formula is as follows:
In the method, in the process of the invention, Representing the significance coefficient of the nth class of production data,/>A significance coefficient representing production data of class v,Expressed at/>The local weight of the n-th class of production data under the quality is larger when the significance coefficient of each class of production data is smaller, so that the quality influence on the donkey-hide gelatin cake is larger, and the weight occupied by the class of production data is larger, thus/>The larger.
According to the local weights of the different production data of the donkey-hide gelatin cake with excellent quality, the local weights of the different production data of the donkey-hide gelatin cake with other quality are obtained
So far, the local weight of different production data under different qualities is obtained.
Step S003, the difference significance index is obtained according to the normalized vector of the production data, and the global weight of the production data is obtained.
According to the steps, the similarity of the donkey-hide gelatin cake with the same quality on the same type of production data is obtained, but if the similarity of the production data of one type of production data in the donkey-hide gelatin cake with all the quality is higher, the production data is not decisive for the quality of the donkey-hide gelatin cake. Therefore, it is necessary to continuously judge the difference of the donkey-hide gelatin cake with different quality on the same production data, and the larger the difference is, the more possibly the decisive influence on the quality of the donkey-hide gelatin cake is explained. Therefore, the difference significance coefficients of donkey-hide gelatin cakes with different qualities on the same production data are calculated.
Taking temperature data as an example, the normalized temperature vector of the donkey-hide gelatin cake with excellent quality forms a normalized temperature sequence with excellent quality, the normalized temperature vector of the donkey-hide gelatin cake with good quality forms a normalized temperature sequence with good quality, and the normalized temperature vector of the donkey-hide gelatin cake with unqualified quality forms a normalized temperature sequence with unqualified quality. The normalization is as follows:
In the method, in the process of the invention, Element value representing the r-th element in the vector,/>Representing the number of elements in a vector,/>Representing vectorsIs a function of the normalization of (a).
Obtaining a temperature difference significance index according to normalized temperature sequences of temperature data under different qualities, wherein the formula is as follows:
In the method, in the process of the invention, Represents the mth normalized vector,/>Represents the h normalized vector,/>Normalized temperature sequence representing the nth quality,/>Normalized temperature sequence representing quality z-Representation/>And/>The JS divergence of (c),Representing an exponential function based on a natural constant,/>Indicating a temperature difference significance index.
If the difference of the donkey-hide gelatin cake with different quality on the temperature data is larger, the difference between the two temperature vectors with different quality is also relatively more obvious, the value of JS divergence is larger, and thus the difference of the temperature is significant indexThe value of (2) will be smaller. If the difference of the donkey-hide gelatin cakes with different qualities on the temperature data is smaller, namely the similarity is larger, the value of JS divergence is closer to 0, so that the difference of the temperature is significant index/>The value of (2) will be greater. And because the index function reflects the index more obviously, the index/>, according to the difference of the temperature, can be obviously obtainedThe range of values of (2) to determine the temperature difference is concentrated between several qualities, i.e. if the difference significance index/>And is closer to 3, the difference between the three qualities of the temperature can be judged to be smaller; if the difference significance index/>And closer to 2, the large probability temperature is less differential between the at least two qualities; if the difference significance index/>The large probability temperature is large in the variability between the three qualities.
Obtaining the difference significance index of humidity, stirring speed and pressure according to the difference significance index mode of obtaining temperature、/>、/>. The more remarkable the difference is, the more likely the donkey-hide gelatin cake quality is affected mainly; the more likely the difference between more qualities is significant, the more production data having a major effect on the quality of donkey-hide gelatin cake should be given a higher weight, so that the global weight of each type of production data is obtained according to the difference significance index, the formula is as follows:
In the method, in the process of the invention, Representing the difference significance coefficient of the k-th class of production data,/>Representing the difference significance coefficient of the v-th class of production data,/>As a sign function,/>As a round-down function,/>A weight duty cycle representing the k-th class of production data,Weight ratio coefficient representing nth class production data,/>Representing global weights for the n-th class of production data.
Thus, global weights for each type of production data are obtained.
Step S004, extracting characteristic information according to the global weight of all production data and the local weight under different qualities.
And respectively calculating the average value of vectors of the production data of each class of donkey-hide gelatin cakes in each batch, wherein the average value of the vectors is the average value of all element values in the vectors, taking the product of the sum of local weight and global weight and the average value as the weighted value of each class of production data in each batch, constructing all weighted values as a production matrix, and listing the production matrix as a production data type.
Taking a production matrix as input, using a PCA algorithm to reduce the dimension to obtain a first principal component load vector, sorting the load factors in the load vector from large to small according to the absolute values of the load factors, accumulating from large to small, and when the accumulated value exceeds 70% of the sum of the absolute values of all the load factors, obtaining the load factors as characteristic information in the production process. The characteristic information includes data from most manufacturing processes. The PCA algorithm is a well-known technique and will not be described in detail herein.
Based on the same inventive concept as the above method, the embodiment of the application also provides a feature information extraction system of a donkey-hide gelatin cake production process, which comprises a memory, a processor and a computer program stored in the memory and running on the processor, wherein the processor executes the computer program to realize any one of the steps of the feature information extraction method of the donkey-hide gelatin cake production process.
It should be noted that: the sequence of the embodiments of the present application is only for description, and does not represent the advantages and disadvantages of the embodiments. And the foregoing description has been directed to specific embodiments of this specification. In addition, the processes depicted in the accompanying figures do not necessarily require the particular order shown, or sequential order, to achieve desirable results. In some embodiments, multitasking and parallel processing are also possible or may be advantageous.
In this specification, each embodiment is described in a progressive manner, and identical and similar parts of each embodiment are all referred to each other, and each embodiment mainly describes differences from other embodiments.
The foregoing description of the preferred embodiments of the present application is not intended to be limiting, but rather, any modifications, equivalents, improvements, etc. that fall within the principles of the present application are intended to be included within the scope of the present application.

Claims (3)

1. The characteristic information extraction method of the donkey-hide gelatin cake production process is characterized by comprising the following steps of:
obtaining a production data vector of donkey-hide gelatin cakes in each batch;
Dividing the donkey-hide gelatin cake into three qualities, and forming a temperature sequence for all temperature vectors in any one quality; acquiring a temperature variation set and a temperature stability set according to data in the temperature vector; acquiring a temperature dissimilarity coefficient according to the cosine similarity and the number of temperature vectors in the temperature sequence; acquiring an instability coefficient of temperature according to elements in the temperature variation set and the temperature stability set; obtaining local weight of the temperature under any quality according to the instability coefficient and the dissimilarity coefficient of the temperature; acquiring local weights of each production data under different qualities according to a method for acquiring the local weights of the temperatures;
Normalizing the temperature sequences with different qualities, and acquiring a difference significance index of the temperature according to the normalized temperature sequences with different qualities; acquiring the difference significance index of the rest production data according to the mode of acquiring the difference significance index of the temperature; acquiring global weights of each production data according to the difference significance indexes of all the production data;
extracting characteristic information according to the global weight of all production data and the local weights under different qualities;
the acquiring the temperature variation set and the temperature stability set according to the data in the temperature vector comprises the following steps:
For the temperature vector, if the data values of the continuous moments exist in the temperature vector, the time corresponding to the continuous moments is the temperature duration time; the set of all the temperature duration is recorded as a temperature variation set;
counting the temperature duration time corresponding to each temperature value, and adding the temperature duration time corresponding to each temperature value to be used as the total temperature duration time corresponding to the temperature value; the set formed by the total duration of the temperature is recorded as a temperature stability set;
the obtaining the dissimilarity coefficient of the temperature according to the cosine similarity and the number of the temperature vectors in the temperature sequence comprises the following steps:
Calculating cosine similarity of any two temperature vectors in the temperature sequence, calculating the same temperature vector, and accumulating all the cosine similarity as a first cosine similarity value;
the square of the number of temperature vectors in the temperature sequence is recorded as a first number;
Taking the first quantity and the first cosine similarity value as differences, and recording the exponential function value of the differences as a dissimilarity coefficient of the temperature;
the obtaining the instability coefficient of the temperature according to the elements in the temperature variation set and the temperature stability set comprises the following steps:
In the method, in the process of the invention, Indicates the number of elements in the temperature variation set of the a-th batch donkey-hide gelatin cake,/>Represents the temperature stability set of the donkey-hide gelatin cake of the a batch,/>Representing the variance of the temperature stability set of batch a of donkey-hide gelatin cake,/>Indicates the number of donkey-hide gelatin cake batches,/>Representing an exponential function based on a natural constant,/>Representing the coefficient of instability of the temperature,/>Is an adjustment parameter;
the obtaining the local weight of the temperature under any quality according to the instability coefficient and the dissimilarity coefficient of the temperature comprises the following steps:
Making the dissimilarity coefficient of the temperature and the value 1 different, and recording the sum of the difference and the instability coefficient of the temperature as the significance coefficient of the temperature;
The sum of the saliency coefficients of all production data is marked as a saliency total value, and the ratio of the saliency coefficient of each type of production data to the saliency total value is marked as a first ratio; taking the difference value of the value 1 and the first ratio as the local weight of the production data;
normalizing the temperature sequences with different qualities, and acquiring a difference significance index of the temperature according to the normalized temperature sequences with different qualities, wherein the method comprises the following steps:
the method for normalizing the temperature sequence comprises the following steps: normalizing each temperature vector in the temperature sequence;
The temperature vector normalization method comprises the following steps: accumulating all the element values in the temperature vector to obtain an element total value, calculating the ratio of each element value in the temperature vector to the element total value, marking the ratio as a normalized value of each element in the temperature vector, and replacing each element value of the temperature vector with the normalized value of the element to complete the normalization of the temperature vector;
the expression of the difference saliency coefficient is:
In the method, in the process of the invention, Represents the mth normalized vector,/>Represents the h normalized vector,/>Normalized temperature sequence representing the nth quality,/>Normalized temperature sequence representing quality z-Representation/>And/>JS divergence of/>Representing an exponential function based on a natural constant,/>A difference significance index representing temperature;
The obtaining the global weight of each production data according to the difference significance index of all the production data specifically comprises the following steps:
In the method, in the process of the invention, Representing the difference significance coefficient of the k-th class of production data,/>Representing the difference significance coefficient of the v-th class of production data,/>As a sign function,/>As a round-down function,/>A weight duty ratio coefficient representing the kth class of production data;
The accumulated value of the weight ratio coefficients of all types of production data is recorded as a weight total coefficient;
Taking the ratio of the weight ratio coefficient of each type of production data to the weight total coefficient as the global weight of each type of production data;
the extracting the characteristic information according to the global weight of all production data and the local weight under different qualities comprises the following steps:
calculating the average value of each type of production data of the donkey-hide gelatin cake of each batch, and taking the product of the sum of the local weight and the global weight and the average value as the weighted value of each type of production data of each batch; constructing all weighted values into a production matrix;
And (3) reducing the dimension of the production matrix by using a PCA algorithm to obtain a load vector of the first main component, sequentially accumulating the load factors in the load vector from large to small, and taking the load factors sequentially accumulated from large to small as characteristic information when the accumulated value exceeds the preset proportion of the total value of all the load factors.
2. The method for extracting characteristic information of donkey-hide gelatin cake production process according to claim 1, wherein the donkey-hide gelatin cake production data vector comprises a temperature vector, a humidity vector, a stirring vector and a pressure vector.
3. A characteristic information extraction system of a donkey-hide gelatin cake production process, comprising a memory, a processor and a computer program stored in the memory and running on the processor, wherein the processor, when executing the computer program, realizes the steps of a characteristic information extraction method of a donkey-hide gelatin cake production process according to any one of claims 1-2.
CN202410430365.1A 2024-04-11 2024-04-11 Characteristic information extraction method and system for donkey-hide gelatin cake production process Active CN118037134B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202410430365.1A CN118037134B (en) 2024-04-11 2024-04-11 Characteristic information extraction method and system for donkey-hide gelatin cake production process

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202410430365.1A CN118037134B (en) 2024-04-11 2024-04-11 Characteristic information extraction method and system for donkey-hide gelatin cake production process

Publications (2)

Publication Number Publication Date
CN118037134A CN118037134A (en) 2024-05-14
CN118037134B true CN118037134B (en) 2024-06-14

Family

ID=90989548

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202410430365.1A Active CN118037134B (en) 2024-04-11 2024-04-11 Characteristic information extraction method and system for donkey-hide gelatin cake production process

Country Status (1)

Country Link
CN (1) CN118037134B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118427881B (en) * 2024-06-25 2024-10-01 大连爱柏斯化工股份有限公司 Cobalt propionate synthesis related data protection method

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116659589B (en) * 2023-07-25 2023-10-27 澳润(山东)药业有限公司 Donkey-hide gelatin cake preservation environment monitoring method based on data analysis
CN117408497B (en) * 2023-12-14 2024-04-05 山东锦都食品有限公司 Food processing material management and control method and system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
"阿胶类保健食品中阿胶成分和牛皮源成分检测及质量评价";王超 等;《食品与发酵工业》;20191231;第45卷(第1期);全文 *
Jinju Zhang等."Identification of donkey-hide gelatin and donkey-bone gelatin based on marker peptides".《LWT -FoodScienceandTechnology》.2023,全文. *

Also Published As

Publication number Publication date
CN118037134A (en) 2024-05-14

Similar Documents

Publication Publication Date Title
CN118037134B (en) Characteristic information extraction method and system for donkey-hide gelatin cake production process
CN109620152B (en) MutifacolLoss-densenert-based electrocardiosignal classification method
CN101484910B (en) Clustering system, and defect kind judging device
CN108959895B (en) Electroencephalogram EEG (electroencephalogram) identity recognition method based on convolutional neural network
Ollech et al. A random forest-based approach to identifying the most informative seasonality tests
Lauguico et al. Lettuce life stage classification from texture attributes using machine learning estimators and feature selection processes
Olesen et al. Deep residual networks for automatic sleep stage classification of raw polysomnographic waveforms
CN111048210A (en) Method and device for evaluating disease risk based on fundus image
Barhate et al. Analysis of classifiers for prediction of type ii diabetes mellitus
CN115969329B (en) Sleep stage method, system, device and medium
CN109325410B (en) Electroencephalogram EEG (electroencephalogram) feature extraction method based on convolutional neural network
CN111297327B (en) Sleep analysis method, system, electronic equipment and storage medium
CN113288157A (en) Arrhythmia classification method based on depth separable convolution and improved loss function
CN110766082B (en) Plant leaf disease and insect pest degree classification method based on transfer learning
CN113076347B (en) Emotion-based push program screening system and method on mobile terminal
CN111967616A (en) Automatic time series regression method and device
US20230200697A1 (en) Automatic evolution method for brainwave database and automatic evolving system for detecting brainwave
CN116763324A (en) Single-channel electroencephalogram signal sleep stage method based on multiple scales and multiple attentions
CN117000780A (en) Self-adaptive cold-rolled strip multichannel shape forecasting method
CN115670417A (en) Health assessment method and system for running monitoring
CN108182288A (en) A kind of recommendation method based on artificial immunity
CN114648270B (en) Method and system for calculating safety inventory of life parts
CN107016345A (en) A kind of demand model construction method applied to Product Conceptual Design
CN117877736B (en) Intelligent ring abnormal health data early warning method based on machine learning
CN115841367B (en) Product recommendation method based on sleeping behaviors of user

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant