CN113283514B - Unknown class classification method, device and medium based on deep learning - Google Patents
Unknown class classification method, device and medium based on deep learning Download PDFInfo
- Publication number
- CN113283514B CN113283514B CN202110603719.4A CN202110603719A CN113283514B CN 113283514 B CN113283514 B CN 113283514B CN 202110603719 A CN202110603719 A CN 202110603719A CN 113283514 B CN113283514 B CN 113283514B
- Authority
- CN
- China
- Prior art keywords
- matrix
- prediction result
- feature
- full
- category
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000013135 deep learning Methods 0.000 title claims abstract description 52
- 238000000034 method Methods 0.000 title claims abstract description 31
- 239000011159 matrix material Substances 0.000 claims abstract description 156
- 230000006870 function Effects 0.000 claims abstract description 46
- 230000004913 activation Effects 0.000 claims abstract description 24
- 238000013145 classification model Methods 0.000 claims abstract description 13
- 238000012549 training Methods 0.000 claims description 32
- 238000005070 sampling Methods 0.000 claims description 7
- 238000000605 extraction Methods 0.000 claims description 3
- 230000007246 mechanism Effects 0.000 abstract description 6
- 230000009286 beneficial effect Effects 0.000 abstract description 5
- 238000004590 computer program Methods 0.000 description 14
- 238000010586 diagram Methods 0.000 description 7
- 238000004891 communication Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 241000282326 Felis catus Species 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000005286 illumination Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/048—Activation functions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Evolutionary Computation (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Software Systems (AREA)
- Mathematical Physics (AREA)
- Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Computing Systems (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Image Analysis (AREA)
Abstract
The invention discloses an unknown class classification method based on deep learning, which comprises the following steps: s1, acquiring a picture of a category to be identified; s2, inputting the picture of the class to be identified into a pre-trained unknown class classification model based on deep learning to obtain a prediction result matrix of the picture; the model comprises a main network, a first full-connection layer, a second full-connection layer and an activation function layer, wherein the picture of the category to be identified obtains a feature map through the main network, the feature map obtains a first feature matrix through the first full-connection layer, the first feature matrix obtains a third feature matrix through the second full-connection layer, and the third feature matrix obtains a prediction result matrix through the activation function layer; s3, through voting resultsObtaining a prediction result; Comparison ofAnd (3) withOf (2), wherein. If it isOrder in principle; Otherwise; For a pair ofOrdering in descending order, at which time the maximum probability valueThe corresponding category is the prediction result, and the unknown category classification method based on the deep learning can distinguish the known category from the unknown category to obtain the prediction result; coding the known and unknown class labels, thereby being beneficial to improving the prediction result; through the voting mechanism, the model is more reliable and stable.
Description
Technical Field
The invention relates to the field of computer vision, in particular to an unknown class classification method based on deep learning.
Background
With the continuous development and breakthrough of deep learning, computer vision technology based on deep learning is receiving more and more attention. However, classification of unknown classes remains a difficulty in computer vision tasks. The known categories represent categories that appear in the training set; the unknown class represents a class that is not in the training set. For example, for a classification task, the training set contains multiple categories of "cat", "dog" and "bird", which belong to known categories; when a category, such as "car", does not exist in the category of the training set, then that category belongs to an unknown category.
In real life, it is necessary to classify known classes well, and also to exclude unknown classes and prevent them from being misclassified into known classes. By excluding unknown classes, the stability and security of the overall computer vision algorithm can be improved.
Currently, a Bayesian probabilistic neural network is typically used for classification of unknown classes. During prediction, the weights of the neurons are randomly sampled, so that a plurality of different prediction results can be obtained for the same input picture. Repeating the process for a plurality of times, collecting a plurality of results, and carrying out weighting operation to obtain a final prediction result. The method needs repeated operation for the same input picture, has low efficiency and cannot be used for high-efficiency engineering application.
The background description provided herein is for the purpose of generally presenting the context of the disclosure. Unless otherwise indicated herein, the materials described in this section are not prior art to the claims in this application and are not admitted to be prior art by inclusion in this section.
Disclosure of Invention
In order to overcome the defects, the invention provides an unknown class classification method based on deep learning. The unknown class classification method based on deep learning can distinguish the known class from the unknown class to obtain a prediction result; coding the known and unknown class labels, thereby being beneficial to improving the prediction result; through the voting mechanism, the model is more reliable and stable.
In a first aspect, the present embodiment provides a deep learning-based unknown class classification method, which includes the following steps:
S1, acquiring a picture of a category to be identified;
S2, inputting the picture of the class to be identified into a pre-trained unknown class classification model based on deep learning to obtain a prediction result matrix of the picture; the model comprises a main network, a first full-connection layer, a second full-connection layer and an activation function layer, wherein the picture of the category to be identified obtains a feature map through the main network, the feature map obtains a first feature matrix through the first full-connection layer, the first feature matrix obtains a third feature matrix through the second full-connection layer, and the third feature matrix obtains a prediction result matrix through the activation function layer;
S3, by matrix of prediction results Voting to obtain prediction result/>The concrete mode of voting is that the prediction result matrix/>, is comparedAnd/>If/>Let/>; Otherwise/>; Pair/>Ordering in descending order, at which time the maximum probability value/>The corresponding category is the prediction result, if/>The prediction result is of unknown class, otherwise is/>The corresponding known category; wherein/>And/>Representing the probability value of the single-heat coding and the inverse single-heat coding of the ith category obtained by the third feature matrix after the sigmoid function, wherein the dimension is 2N; /(I)The classified pictures to be identified are subjected to prediction result matrix/>A prediction result is obtained after voting;
Preferably, the weights of the backbone network in the pre-trained unknown class classification model based on deep learning use the weights of a training network; the training network model has the structure that: the feature map is divided into two branches, and a first branch passes through the full-connection layer to obtain a first feature matrix; the second branch passes through the full connection layer to obtain a first uncertain matrix; after the first uncertain matrix passes through an activation function, a second uncertain matrix is obtained; randomly sampling a number in Gaussian distribution, multiplying the number by the second uncertainty matrix by bits, and adding the number by bits to the first feature matrix to obtain a second feature matrix.
Preferably, the codes of the labels in one category consist of the single thermal codes and the inverse single thermal codes, and the dimension of the labels is 2N.
Preferably, the loss function of the training network is composed of sigmoid cross entropy and KL divergence, and the loss function is shown in the following formula.
;
;
;
Wherein,Representing the label after the coding mode; /(I)Representing the predicted outcome; /(I)Representing a second uncertainty matrix; /(I)Representing a first feature matrix; /(I)Representing the weight balance factor.
In a second aspect, the present embodiment provides an unknown class classification device based on deep learning, which includes the following units:
the image acquisition unit is used for acquiring images of the category to be identified;
The feature extraction unit is used for inputting the picture of the class to be identified into a pre-trained unknown class classification model based on deep learning to obtain a prediction result matrix of the picture; the model comprises a main network, a first full-connection layer, a second full-connection layer and an activation function layer, wherein the picture of the category to be identified obtains a feature map through the main network, the feature map obtains a first feature matrix through the first full-connection layer, the first feature matrix obtains a third feature matrix through the second full-connection layer, and the third feature matrix obtains a prediction result matrix through the activation function layer;
category recognition unit for predicting result matrix Voting to obtain prediction result/>The concrete mode of voting is that the prediction result matrix/>, is comparedAnd/>If/>Order in principle; Otherwise/>; Pair/>Ordering in descending order, at which time the maximum probability value/>The corresponding category is the prediction result, if/>The prediction result is of unknown class, otherwise is/>The corresponding known category; wherein the method comprises the steps ofAnd/>Representing the probability value of the single-heat coding and the inverse single-heat coding of the ith category obtained by the third feature matrix after the sigmoid function, wherein the dimension is 2N; /(I)The classified pictures to be identified are subjected to prediction result matrix/>A prediction result is obtained after voting;
Preferably, the weights of the backbone network in the pre-trained unknown class classification model based on deep learning use the weights of a training network; the training network model has the structure that: the feature map is divided into two branches, and a first branch passes through the full-connection layer to obtain a first feature matrix; the second branch passes through the full connection layer to obtain a first uncertain matrix; after the first uncertain matrix passes through an activation function, a second uncertain matrix is obtained; randomly sampling a number in Gaussian distribution, multiplying the number by the second uncertain matrix in a bit manner, adding the number by the first eigenvalue matrix in a bit manner to obtain a second eigenvalue matrix, and obtaining a third eigenvalue matrix after the second eigenvalue matrix passes through a full connection layer.
Preferably, the codes of the labels in one category consist of the single thermal codes and the inverse single thermal codes, and the dimension of the labels is 2N.
Preferably, the loss function of the training network is composed of sigmoid cross entropy and KL divergence, and the loss function is shown in the following formula.
;
;
;
Wherein,Representing the label after the coding mode; /(I)Representing the predicted outcome; /(I)Representing a second uncertainty matrix; /(I)Representing a first feature matrix; /(I)Representing the weight balance factor.
In a third aspect, the present embodiment is a non-volatile storage medium containing instructions that, when executed, are configured to implement a method as described above.
The invention has the following advantages:
1. The unknown class classification method based on deep learning can distinguish the known class from the unknown class to obtain a prediction result;
2. The invention encodes the known and unknown class labels, which is helpful for improving the prediction result;
3. The invention makes the model more reliable and stable through the voting mechanism.
Drawings
FIG. 1 is a flow chart of a deep learning-based unknown class classification method of the present invention;
FIG. 2 is a schematic diagram of a training model based on a deep learning unknown class classification method of the present invention;
FIG. 3 is a schematic diagram of a predictive model based on a deep learning unknown class classification method of the present invention;
FIG. 4 is a schematic diagram of tag coding based on a deep learning unknown class classification method of the present invention;
FIG. 5 is a flow chart of the method for classifying unknown classes based on deep learning according to the present invention;
FIG. 6 is a schematic diagram of a deep learning-based unknown class classification device in accordance with the present invention;
FIG. 7 is a schematic diagram of a deep learning-based unknown class classification device in accordance with the present invention.
Detailed Description
The invention is further described below with reference to the drawings and examples. But should not be used to limit the scope of the invention.
Example 1
Referring to fig. 1, the embodiment provides an unknown class classification method based on deep learning, which includes the following steps:
S1: confirming N categories required by a task, collecting N categories of data, selecting pictures under different scenes, illumination and angles, and classifying the pictures to form a training set of known categories;
S2: collecting pictures in a public data set as a data set of an unknown class to form a training set of the unknown class;
s3: coding the class labels by using the known class training set of S1 and the unknown class training set of S2, training an unknown class classification training model based on deep learning, and optimizing the training model by using a loss function;
s4: transforming the unknown class classification training model based on the deep learning obtained in the step S3 to obtain an unknown class classification prediction model based on the deep learning;
S5: and (3) inputting a picture by using the unknown class classification prediction model based on the deep learning obtained in the step (S4) to obtain a prediction result matrix of the picture, and obtaining a prediction result in a voting mode.
Preferably, the backbone network is constructed based on ResNet as shown in fig. 2. The size of the input image of the backbone network is 224×224×3, and the size of the feature map output by the backbone network is 1×1×512, where 1*1 is the height and width of the feature map, and 512 is the number of channels of the feature map.
In the training model of S3, the feature map is then split into two branches. And the first branch passes through the full-connection layer to obtain a first feature matrix. And the second branch passes through the full connection layer to obtain a first uncertain matrix. After the first uncertainty matrix is subjected to softplus activation functions, a second uncertainty matrix is obtained. Randomly sampling a number in a gaussian distributionAnd carrying out bit-wise multiplication (ELEMENT WISE multiple) with the uncertainty matrix 2, and carrying out bit-wise addition (ELEMENT WISE addition) with the first feature matrix to obtain a second feature matrix.
Preferably, the feature dimensions of the first feature matrix, the first uncertainty matrix, the second uncertainty matrix, and the second feature matrix are each 512.
Preferably, the third feature matrix is obtained after the second feature matrix passes through the full connection layer, and the feature dimension is 2N, where N is the number of known categories.
Preferably, the third feature matrix is subjected to a sigmoid activation function to obtain a prediction result matrix.
Preferably, the predictive model uses the weights of the training model in S4. As shown in fig. 3, when a picture with a size of 224×224×3 is input into the backbone network of the prediction model, a feature map with a size of 1×1×512 is obtained. And after the feature map passes through the full-connection layer, a first feature matrix is obtained. And the first feature matrix is subjected to a full connection layer to obtain a third feature matrix, and the feature dimension is 2N. And obtaining a prediction result matrix after the third feature matrix passes through a sigmoid activation function.
Preferably, the codes of the labels in one category consist of the single thermal codes and the inverse single thermal codes, and the dimension of the labels is 2N, as shown in fig. 4. Assuming that the number of known classes is 3, the one-hot code of a certain known class is (0, 1, 0), indicating the code that it belongs to that class, and the one-hot code is (1, 0, 1), which is the code that it does not belong to that class. Combining the two forms a dimension 6 tag code, (0,1,0,1,0,1). Similarly, an unknown class of codes is (0,0,0,1,1,1).
Preferably, the loss function used in the training model consists of sigmoid cross entropy and KL divergence together, as shown in the following formula:
;
;
;
Wherein, Representing the label after the coding mode; /(I)Representing the predicted outcome; /(I)Representing a second uncertainty matrix; /(I)Representing a first feature matrix; /(I)Representing the weight balance factor.
Preferably, when a prediction result matrix is obtainedIn this case, it is necessary to obtain the prediction result/>, by votingThe method comprises the following steps:
Step one, comparing And/>Of (3), wherein/>. If/>Let/>; Otherwise/>;
Step two, toOrdering in descending order, at which time the maximum probability value/>The corresponding category is the prediction result.
According to the unknown class classification method based on deep learning, the known class and the unknown class can be distinguished, and a prediction result is obtained; coding the known and unknown class labels, thereby being beneficial to improving the prediction result; through the voting mechanism, the model is more reliable and stable.
Example two
Referring to fig. 5, the embodiment provides an unknown class classification method based on deep learning, which includes the following steps:
S1, acquiring a picture of a category to be identified;
S2, inputting the picture of the class to be identified into a pre-trained unknown class classification model based on deep learning to obtain a prediction result matrix of the picture; the model comprises a main network, a first full-connection layer, a second full-connection layer and an activation function layer, wherein the picture of the category to be identified obtains a feature map through the main network, the feature map obtains a first feature matrix through the first full-connection layer, the first feature matrix obtains a third feature matrix through the second full-connection layer, and the third feature matrix obtains a prediction result matrix through the activation function layer;
S3, by matrix of prediction results Voting to obtain prediction result/>The concrete mode of voting is that the prediction result matrix/>, is comparedAnd/>If/>Let/>; Otherwise/>; Pair/>Ordering in descending order, at which time the maximum probability value/>The corresponding category is the prediction result, if/>The prediction result is of unknown class, otherwise is/>The corresponding known category; wherein/>And/>Representing the probability value of the single-heat coding and the inverse single-heat coding of the ith category obtained by the third feature matrix after the sigmoid function, wherein the dimension is 2N; /(I)The classified pictures to be identified are subjected to prediction result matrix/>A prediction result is obtained after voting;
The codes of the labels in one category consist of the single thermal codes and the inverse single thermal codes, and the dimension of the labels is 2N, as shown in fig. 4. Assuming that the number of known classes is 3, the one-hot code of a certain known class is (0, 1, 0), indicating the code that it belongs to that class, and the one-hot code is (1, 0, 1), which is the code that it does not belong to that class. Combining the two forms a dimension 6 tag code, (0,1,0,1,0,1). Similarly, an unknown class of codes is (0,0,0,1,1,1).
Specifically, the weights of the backbone network in the pre-trained unknown class classification model based on deep learning use the weights of the training network.
The training network model has the structure that: and a backbone network, wherein the size of the feature map output by the backbone network is 1×1×512, 1*1 is the height and width of the feature map, and 512 is the channel number of the feature map.
In the training model, the feature map is then split into two branches. And the first branch passes through the full-connection layer to obtain a first feature matrix. And the second branch passes through the full connection layer to obtain a first uncertain matrix. After the first uncertainty matrix is subjected to softplus activation functions, a second uncertainty matrix is obtained. Randomly sampling a number in the Gaussian distribution, carrying out bit-wise multiplication (ELEMENT WISE multiple) with the second uncertain matrix, and carrying out bit-wise addition (ELEMENT WISE addition) with the first feature matrix to obtain a second feature matrix.
Preferably, the feature dimensions of the first feature matrix, the first uncertainty matrix, the second uncertainty matrix, and the second feature matrix are each 512.
Preferably, the third feature matrix is obtained after the second feature matrix passes through the full connection layer, and the feature dimension is 2N, where N is the number of known categories.
Preferably, the third feature matrix is subjected to a sigmoid activation function to obtain a prediction result matrix.
According to the unknown class classification method based on deep learning, the known class and the unknown class can be distinguished, and a prediction result is obtained; coding the known and unknown class labels, thereby being beneficial to improving the prediction result; through the voting mechanism, the model is more reliable and stable.
Example IV
Referring to fig. 6, the present embodiment provides an unknown class classification device based on deep learning, which includes the following units:
the image acquisition unit is used for acquiring images of the category to be identified;
The feature extraction unit is used for inputting the picture of the class to be identified into a pre-trained unknown class classification model based on deep learning to obtain a prediction result matrix of the picture; the model comprises a main network, a first full-connection layer, a second full-connection layer and an activation function layer, wherein the picture of the category to be identified obtains a feature map through the main network, the feature map obtains a first feature matrix through the first full-connection layer, the first feature matrix obtains a third feature matrix through the second full-connection layer, and the third feature matrix obtains a prediction result matrix through the activation function layer;
category recognition unit for predicting result matrix Voting to obtain prediction result/>The concrete mode of voting is that the prediction result matrix/>, is comparedAnd/>If/>Order in principle; Otherwise/>; Pair/>Ordering in descending order, at which time the maximum probability value/>The corresponding category is the prediction result, if/>The prediction result is of unknown class, otherwise is/>The corresponding known category; wherein the method comprises the steps ofAnd/>Representing the probability value of the single-heat coding and the inverse single-heat coding of the ith category obtained by the third feature matrix after the sigmoid function, wherein the dimension is 2N; /(I)The classified pictures to be identified are subjected to prediction result matrix/>A prediction result is obtained after voting;
Preferably, the weights of the backbone network in the pre-trained unknown class classification model based on deep learning use the weights of a training network; the training network model has the structure that: the feature map is divided into two branches, and a first branch passes through the full-connection layer to obtain a first feature matrix; the second branch passes through the full connection layer to obtain a first uncertain matrix; after the first uncertain matrix passes through an activation function, a second uncertain matrix is obtained; randomly sampling a number in Gaussian distribution, multiplying the number by the second uncertainty matrix by bits, and adding the number by bits to the first feature matrix to obtain a second feature matrix.
Preferably, the codes of the labels in one category consist of the single thermal codes and the inverse single thermal codes, and the dimension of the labels is 2N.
Preferably, the loss function of the training network is composed of sigmoid cross entropy and KL divergence, and the loss function is shown in the following formula.
;
;
;
Wherein,Representing the label after the coding mode; /(I)Representing the predicted outcome; /(I)Representing a second uncertainty matrix; /(I)Representing a first feature matrix; /(I)Representing the weight balance factor.
According to the unknown class classification method based on deep learning, the known class and the unknown class can be distinguished, and a prediction result is obtained; coding the known and unknown class labels, thereby being beneficial to improving the prediction result; through the voting mechanism, the model is more reliable and stable.
Example five
Referring to fig. 7, one embodiment provides a schematic structural diagram of an unknown class identification device 70 for deep learning. The deep learning unknown class identification device 70 of this embodiment comprises a processor 71, a memory 72 and a computer program stored in said memory 72 and executable on said processor 71. The processor 71, when executing the computer program, implements the steps in the unknown class identification method embodiment of deep learning described above, such as step S1 shown in fig. 2. Or the processor 71, when executing the computer program, performs the functions of the modules/units in the above-described device embodiments.
Illustratively, the computer program may be partitioned into one or more modules/units that are stored in the memory 72 and executed by the processor 71 to accomplish the present invention. The one or more modules/units may be a series of computer program instruction segments capable of performing specific functions for describing the execution of the computer program in the deep learning unknown class identification device 70.
The deep learning unknown class identification device 70 may include, but is not limited to, a processor 71, a memory 72. It will be appreciated by those skilled in the art that the schematic diagram is merely an example of a deep-learned unknown class identification device 70 and does not constitute a limitation of the deep-learned unknown class identification device 70, and may include more or fewer components than illustrated, or may combine certain components, or different components, e.g., the deep-learned unknown class identification device 70 may also include input-output devices, network access devices, buses, etc.
The Processor 71 may be a central processing unit (Central Processing Unit, CPU), but may also be other general purpose processors, digital signal processors (DIGITAL SIGNAL Processor, DSP), application SPECIFIC INTEGRATED Circuit (ASIC), off-the-shelf Programmable gate array (Field-Programmable GATE ARRAY, FPGA) or other Programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, or the like. The general purpose processor may be a microprocessor or the processor may be any conventional processor or the like, and the processor 71 is a control center of the deep-learned unknown class identification apparatus 70, and connects the respective parts of the entire deep-learned unknown class identification apparatus 70 using various interfaces and lines.
The memory 72 may be used to store the computer program and/or module, and the processor 71 implements the various functions of the deep learning unknown class identification device 70 by running or executing the computer program and/or module stored in the memory 72 and invoking data stored in the memory 72. The memory 72 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program (such as a sound playing function, an image playing function, etc.) required for at least one function, and the like; the storage data area may store data (such as audio data, phonebook, etc.) created according to the use of the handset, etc. In addition, memory 72 may include high-speed random access memory, and may also include non-volatile memory, such as a hard disk, memory, plug-in hard disk, smart memory card (SMART MEDIA CARD, SMC), secure Digital (SD) card, flash memory card (FLASH CARD), at least one magnetic disk storage device, flash memory device, or other volatile solid-state storage device.
Wherein the deeply learned unknown class identification device 70 integrated modules/units may be stored in a computer readable storage medium if implemented in the form of software functional units and sold or used as a stand alone product. Based on such understanding, the present invention may also be implemented by implementing all or part of the flow of the method of the above embodiment, or by instructing the relevant hardware by a computer program, where the computer program may be stored in a computer readable storage medium, and the computer program may implement the steps of each of the method embodiments described above when executed by the processor 71. Wherein the computer program comprises computer program code which may be in source code form, object code form, executable file or some intermediate form etc. The computer readable medium may include: any entity or device capable of carrying the computer program code, a recording medium, a U disk, a removable hard disk, a magnetic disk, an optical disk, a computer Memory, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), an electrical carrier signal, a telecommunications signal, a software distribution medium, and so forth. It should be noted that the computer readable medium contains content that can be appropriately scaled according to the requirements of jurisdictions in which such content is subject to legislation and patent practice, such as in certain jurisdictions in which such content is subject to legislation and patent practice, the computer readable medium does not include electrical carrier signals and telecommunication signals.
It should be noted that the above-described apparatus embodiments are merely illustrative, and the units described as separate units may or may not be physically separate, and units shown as units may or may not be physical units, may be located in one place, or may be distributed over a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of this embodiment. In addition, in the drawings of the embodiment of the device provided by the invention, the connection relation between the modules represents that the modules have communication connection, and can be specifically implemented as one or more communication buses or signal lines. Those of ordinary skill in the art will understand and implement the present invention without undue burden.
The foregoing embodiments have been provided for the purpose of illustrating the objects, technical solutions and advantages of the present invention in further detail, and it should be understood that the foregoing embodiments are merely illustrative of the present invention and are not intended to limit the scope of the present invention. Any modification, equivalent replacement, improvement, etc. made without departing from the spirit and scope of the present invention also fall within the scope of the present invention.
Claims (5)
1. An unknown class classification method based on deep learning comprises the following steps:
S1, acquiring a picture of a category to be identified;
S2, inputting the picture of the class to be identified into a pre-trained unknown class classification model based on deep learning to obtain a prediction result matrix of the picture; the model comprises a main network, a first full-connection layer, a second full-connection layer and an activation function layer, wherein the picture of the category to be identified obtains a feature map through the main network, the feature map obtains a first feature matrix through the first full-connection layer, the first feature matrix obtains a third feature matrix through the second full-connection layer, and the third feature matrix obtains a prediction result matrix through the activation function layer;
by matrix of prediction results Voting to obtain prediction result/>The concrete mode of voting is that the prediction result matrix/>, is comparedAnd/>If/>Let/>; Otherwise/>; Pair/>Ordering in descending order, at which time the maximum probability value/>The corresponding category is the prediction result, if/>The prediction result is of unknown class, otherwise is/>The corresponding known category; wherein/>And/>Representing the probability value of the single-heat coding and the inverse single-heat coding of the ith category obtained by the third feature matrix after the sigmoid function, wherein the dimension is 2N; /(I)The classified pictures to be identified are subjected to prediction result matrix/>A prediction result is obtained after voting;
the codes of the labels of one class consist of the single thermal codes and the inverse single thermal codes, and the dimension of the labels is 2N;
The weight of the backbone network in the pre-trained unknown class classification model based on deep learning uses the weight of a training network; the structure of the model is as follows: the feature map is divided into two branches, and a first branch passes through the full-connection layer to obtain a first feature matrix; the second branch passes through the full connection layer to obtain a first uncertain matrix; after the first uncertain matrix passes through an activation function, a second uncertain matrix is obtained; randomly sampling a number in Gaussian distribution, multiplying the number by the second uncertainty matrix by bits, and adding the number by bits to the first feature matrix to obtain a second feature matrix;
and obtaining a third feature matrix after the second feature matrix passes through the full connection layer.
2. The unknown class classification method based on deep learning according to claim 1, wherein the loss function of the training network is composed of sigmoid cross entropy and KL divergence, and the formula is as follows:
;
;
;
Wherein, Representing the label after the coding mode; /(I)Representing the predicted outcome; /(I)Representing a second uncertainty matrix; representing a first feature matrix; /(I) Representing the weight balance factor.
3. An unknown class classification device based on deep learning, comprising the following units:
the image acquisition unit is used for acquiring images of the category to be identified;
The feature extraction unit is used for inputting the picture of the class to be identified into a pre-trained unknown class classification model based on deep learning to obtain a prediction result matrix of the picture; the model comprises a main network, a first full-connection layer, a second full-connection layer and an activation function layer, wherein the picture of the category to be identified obtains a feature map through the main network, the feature map obtains a first feature matrix through the first full-connection layer, the first feature matrix obtains a third feature matrix through the second full-connection layer, and the third feature matrix obtains a prediction result matrix through the activation function layer;
category recognition unit for predicting result matrix Voting to obtain prediction result/>The concrete mode of voting is that the prediction result matrix/>, is comparedAnd/>If/>Let/>; Otherwise/>; Pair/>Ordering in descending order, at which time the maximum probability value/>The corresponding category is the prediction result, if/>The prediction result is of unknown class, otherwise is/>The corresponding known category; wherein/>And/>Representing the probability value of the single-heat coding and the inverse single-heat coding of the ith category obtained by the third feature matrix after the sigmoid function, wherein the dimension is 2N; /(I)The classified pictures to be identified are subjected to prediction result matrix/>A prediction result is obtained after voting;
the codes of the labels of one class consist of the single thermal codes and the inverse single thermal codes, and the dimension of the labels is 2N;
The weight of the backbone network in the pre-trained unknown class classification model based on deep learning uses the weight of a training network; the structure of the model is as follows: the feature map is divided into two branches, and a first branch passes through the full-connection layer to obtain a first feature matrix; the second branch passes through the full connection layer to obtain a first uncertain matrix; after the first uncertain matrix passes through an activation function, a second uncertain matrix is obtained; randomly sampling a number in Gaussian distribution, multiplying the number by the second uncertainty matrix by bits, and adding the number by bits to the first feature matrix to obtain a second feature matrix;
and obtaining a third feature matrix after the second feature matrix passes through the full connection layer.
4. A deep learning based unknown class classification device according to claim 3, wherein the loss function of the training network is composed of sigmoid cross entropy and KL divergence together, as shown in the following formula:
;
;
;
Wherein, Representing the label after the coding mode; /(I)Representing the predicted outcome; /(I)Representing a second uncertainty matrix; representing a first feature matrix; /(I) Representing the weight balance factor.
5. A non-volatile storage medium containing instructions that, when executed, are to implement the method of any of claims 1-2.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110603719.4A CN113283514B (en) | 2021-05-31 | 2021-05-31 | Unknown class classification method, device and medium based on deep learning |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110603719.4A CN113283514B (en) | 2021-05-31 | 2021-05-31 | Unknown class classification method, device and medium based on deep learning |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113283514A CN113283514A (en) | 2021-08-20 |
CN113283514B true CN113283514B (en) | 2024-05-21 |
Family
ID=77282711
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110603719.4A Active CN113283514B (en) | 2021-05-31 | 2021-05-31 | Unknown class classification method, device and medium based on deep learning |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113283514B (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114241526B (en) * | 2022-02-28 | 2022-05-17 | 南京甄视智能科技有限公司 | Classification model, training method, classification method, electronic device, and storage medium |
CN115690566B (en) * | 2022-10-24 | 2024-01-19 | 广东工业大学 | Deep sea animal new species identification method based on deep migration cluster learning |
Citations (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109190646A (en) * | 2018-06-25 | 2019-01-11 | 北京达佳互联信息技术有限公司 | A kind of data predication method neural network based, device and nerve network system |
CN109815801A (en) * | 2018-12-18 | 2019-05-28 | 北京英索科技发展有限公司 | Face identification method and device based on deep learning |
CN109978041A (en) * | 2019-03-19 | 2019-07-05 | 上海理工大学 | A kind of hyperspectral image classification method based on alternately update convolutional neural networks |
CN110163177A (en) * | 2019-05-28 | 2019-08-23 | 李峥嵘 | A kind of wind power generation unit blade unmanned plane automatic sensing recognition methods |
CN110516718A (en) * | 2019-08-12 | 2019-11-29 | 西北工业大学 | The zero sample learning method based on depth embedded space |
CN110728224A (en) * | 2019-10-08 | 2020-01-24 | 西安电子科技大学 | Remote sensing image classification method based on attention mechanism depth Contourlet network |
CN110866565A (en) * | 2019-11-26 | 2020-03-06 | 重庆邮电大学 | Multi-branch image classification method based on convolutional neural network |
CN111461025A (en) * | 2020-04-02 | 2020-07-28 | 同济大学 | Signal identification method for self-evolving zero-sample learning |
CN111738357A (en) * | 2020-07-24 | 2020-10-02 | 完美世界(北京)软件科技发展有限公司 | Junk picture identification method, device and equipment |
CN112016616A (en) * | 2020-08-27 | 2020-12-01 | 西华大学 | High-frequency financial time sequence multi-class prediction method |
CN112101476A (en) * | 2020-09-23 | 2020-12-18 | 创新奇智(西安)科技有限公司 | Picture classification method and device, electronic equipment and storage medium |
CN112257808A (en) * | 2020-11-02 | 2021-01-22 | 郑州大学 | Integrated collaborative training method and device for zero sample classification and terminal equipment |
CN112364747A (en) * | 2020-11-04 | 2021-02-12 | 重庆高新区飞马创新研究院 | Target detection method under limited sample |
CN112418329A (en) * | 2020-11-25 | 2021-02-26 | 武汉大学 | Cervical OCT image classification method and system based on multi-scale textural feature fusion |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9916538B2 (en) * | 2012-09-15 | 2018-03-13 | Z Advanced Computing, Inc. | Method and system for feature detection |
-
2021
- 2021-05-31 CN CN202110603719.4A patent/CN113283514B/en active Active
Patent Citations (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109190646A (en) * | 2018-06-25 | 2019-01-11 | 北京达佳互联信息技术有限公司 | A kind of data predication method neural network based, device and nerve network system |
CN109815801A (en) * | 2018-12-18 | 2019-05-28 | 北京英索科技发展有限公司 | Face identification method and device based on deep learning |
CN109978041A (en) * | 2019-03-19 | 2019-07-05 | 上海理工大学 | A kind of hyperspectral image classification method based on alternately update convolutional neural networks |
CN110163177A (en) * | 2019-05-28 | 2019-08-23 | 李峥嵘 | A kind of wind power generation unit blade unmanned plane automatic sensing recognition methods |
CN110516718A (en) * | 2019-08-12 | 2019-11-29 | 西北工业大学 | The zero sample learning method based on depth embedded space |
CN110728224A (en) * | 2019-10-08 | 2020-01-24 | 西安电子科技大学 | Remote sensing image classification method based on attention mechanism depth Contourlet network |
CN110866565A (en) * | 2019-11-26 | 2020-03-06 | 重庆邮电大学 | Multi-branch image classification method based on convolutional neural network |
CN111461025A (en) * | 2020-04-02 | 2020-07-28 | 同济大学 | Signal identification method for self-evolving zero-sample learning |
CN111738357A (en) * | 2020-07-24 | 2020-10-02 | 完美世界(北京)软件科技发展有限公司 | Junk picture identification method, device and equipment |
CN112016616A (en) * | 2020-08-27 | 2020-12-01 | 西华大学 | High-frequency financial time sequence multi-class prediction method |
CN112101476A (en) * | 2020-09-23 | 2020-12-18 | 创新奇智(西安)科技有限公司 | Picture classification method and device, electronic equipment and storage medium |
CN112257808A (en) * | 2020-11-02 | 2021-01-22 | 郑州大学 | Integrated collaborative training method and device for zero sample classification and terminal equipment |
CN112364747A (en) * | 2020-11-04 | 2021-02-12 | 重庆高新区飞马创新研究院 | Target detection method under limited sample |
CN112418329A (en) * | 2020-11-25 | 2021-02-26 | 武汉大学 | Cervical OCT image classification method and system based on multi-scale textural feature fusion |
Non-Patent Citations (1)
Title |
---|
基于深度优先随机森林分类器的目标检测;马娟娟;潘泉;梁彦;胡劲文;赵春晖;王华夏;;中国惯性技术学报(04);全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN113283514A (en) | 2021-08-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11816570B2 (en) | Method for accelerated detection of object in videos, server, and non-transitory computer readable storage medium | |
CN109685110B (en) | Training method of image classification network, image classification method and device, and server | |
CN115937655B (en) | Multi-order feature interaction target detection model, construction method, device and application thereof | |
CN110188829B (en) | Neural network training method, target recognition method and related products | |
CN113283514B (en) | Unknown class classification method, device and medium based on deep learning | |
CN113850666A (en) | Service scheduling method, device, equipment and storage medium | |
CN112149754B (en) | Information classification method, device, equipment and storage medium | |
CN115731422A (en) | Training method, classification method and device of multi-label classification model | |
CN110751191A (en) | Image classification method and system | |
CN112016617B (en) | Fine granularity classification method, apparatus and computer readable storage medium | |
CN110163206B (en) | License plate recognition method, system, storage medium and device | |
CN116168394A (en) | Image text recognition method and device | |
CN108830302B (en) | Image classification method, training method, classification prediction method and related device | |
CN113255766B (en) | Image classification method, device, equipment and storage medium | |
US6052483A (en) | Methods and apparatus for classification of images using distribution maps | |
CN117523218A (en) | Label generation, training of image classification model and image classification method and device | |
CN117893839B (en) | Multi-label classification method and system based on graph attention mechanism | |
Ciuntu et al. | Real-time traffic sign detection and classification using machine learning and optical character recognition | |
CN112037174A (en) | Chromosome abnormality detection method, device, equipment and computer readable storage medium | |
CN114175053A (en) | Conversion device, conversion method, program, and information recording medium | |
CN115291813B (en) | Data storage method and device, data reading method and device, and equipment | |
CN117333035A (en) | Urban land utilization strategy generation method, device, equipment and medium based on satellite remote sensing data | |
CN112580505B (en) | Method and device for identifying network point switch door state, electronic equipment and storage medium | |
CN111709479B (en) | Image classification method and device | |
CN105844296B (en) | Remote sensing images scene classification method based on CDCP local description |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |