CN115099294A

CN115099294A - Flower image classification algorithm based on feature enhancement and decision fusion

Info

Publication number: CN115099294A
Application number: CN202210275922.8A
Authority: CN
Inventors: 贾连印; 翟红淞; 丁家满; 李润鑫; 李晓武; 游进国
Original assignee: Kunming University of Science and Technology
Current assignee: Kunming University of Science and Technology
Priority date: 2022-03-21
Filing date: 2022-03-21
Publication date: 2022-09-23
Anticipated expiration: 2042-03-21
Also published as: CN115099294B

Abstract

The invention provides a flower image classification algorithm based on feature enhancement and decision fusion, and belongs to the technical field of image classification. The method comprises a data preprocessing stage, a feature extraction and enhancement stage, a training stage and a decision fusion classification stage. In the data preprocessing stage, corresponding operations such as cutting, scaling, normalization and the like are carried out on the data set; in the characteristic extraction and enhancement stage, the preprocessed data is sent into a VGG16 model pre-trained by ImageNet to extract the characteristics of the multilayer depth image, and a characteristic enhancement strategy is introduced to adaptively distribute the characteristic weight; in the training stage, a plurality of user-defined softmax classifiers are trained by using a plurality of groups of features in the previous stage; and in the decision fusion classification stage, information entropy is introduced to express the certainty degree of each classifier, fusion weight is determined according to the information entropy, and fusion decision is performed to realize classification. The features of the invention which are subjected to self-adaptive enhancement have stronger expression capability, and have stronger classification capability compared with the traditional softmax classifier.

Description

Flower image classification algorithm based on feature enhancement and decision fusion

Technical Field

The invention relates to a flower image classification algorithm based on feature enhancement and decision fusion, and belongs to the technical field of image classification.

Background

Image classification is a fundamental problem in computer vision, and is the basis of image positioning, image detection, image segmentation and other technologies. There are many image classification algorithms, such as classification algorithms based on artificially defined features, algorithms based on neural networks such as convolutional neural networks, antagonistic neural networks, attention-driven mechanisms, and the like.

The quality of the extracted features is a key for determining the classification effect, and previous research algorithms usually focus on improving image preprocessing methods, network structures, activation functions and the like, and aim to extract more excellent image features. However, these efforts are less focused on optimizing the extracted features and a plurality of softmax classifiers are jointly used for decision-related research, so that the classification accuracy is to be improved.

Disclosure of Invention

The invention aims to solve the technical problem of providing a flower image classification algorithm based on feature enhancement and decision fusion, so as to solve the problem of misclassification caused by insufficient feature expression and excessive breakage of a softmax classifier.

The technical scheme of the invention is as follows: a flower image classification algorithm based on feature enhancement and decision fusion comprises a data preprocessing stage, a feature extraction and enhancement stage, a training stage and a decision fusion classification stage. And in the data preprocessing stage, corresponding operations such as cutting, scaling, normalization and the like are carried out on the data set. And in the characteristic extraction and enhancement stage, the preprocessed data is sent into a VGG16 model pre-trained by ImageNet to extract the characteristics of the multilayer depth image, and a characteristic enhancement strategy is introduced to adaptively distribute characteristic weights. The training phase trains a plurality of custom softmax classifiers using the sets of features of the previous phase. And the decision fusion classification stage introduces information entropy to express the certainty degree of each classifier, determines a fusion weight according to the information entropy, and fuses decisions to realize classification.

The method comprises the following specific steps:

step 1: and carrying out corresponding operations of cutting, scaling and normalization on the data set.

The Step1 specifically comprises the following steps:

step1.1: the image data set is divided into a training set, a verification set and a test set.

Step1.2: and (3) cutting each image in the training set into k square images by adopting a normal random cutting strategy (each image can be cut for the required times to expand the data of the training set), and cutting each image in the verification set and the test set into one square image by adopting a central cutting strategy.

Step1.3: and (3) scaling the images in the training set, the test set and the verification set after the cropping to be 224 × 224 of the size required by the VGG16 model by using a Lanczos interpolation mode.

Step1.4: and linearly transforming the pixel values of the images in the training set, the verification set and the test set after the scaling from the range of [0,255] to the range of [0,1] by adopting a maximum and minimum normalization mode.

Step 2: sending the preprocessed data into a VGG16 model pre-trained by ImageNet to extract multilayer high-level features, and introducing a feature enhancement strategy to adaptively distribute feature weights so as to realize the effect of adaptively enhancing the features.

The Step2 is specifically as follows:

step2.1: and removing three full-connection layers of the pre-trained VGG16 model, and sending the training set, the verification set and the test set to extract high-level features.

Step2.2: the characteristics of the block i _ convj layer and the block5_ pool layer of the training set, the verification set and the test set are respectively saved.

Step2.3: and self-adaptive enhancing is carried out on the features of the blocki _ convj layer by using a feature enhancing masking strategy to obtain blocki _ convj _ en.

If the block i _ convj output characteristic diagram is assumed to be M _ij ＝F _ij The output characteristic diagram of (x, y, p) block i _ convj _ en is M _{ij_en} ＝F _{ij_en} (x, y, p), introduction of feature enhancement mask M _en ＝F _en (x,y,p)，M _{ij_en} ＝M _ij ×M _en Wherein x is more than or equal to 1, y is more than or equal to W, P is more than or equal to 1 and less than or equal to P, wherein W represents the side length of the characteristic diagram, and P represents the channel number of the characteristic diagram.

Step2.4: and splicing the block i _ convj _ en layer and the block5_ pool layer into a new tensor Concat _ ij.

The Step2.3 specifically comprises the following steps:

step2.3.1: block i _ convj layer feature map, called M _ij The dimension is W × W × P, that is, each of the P channels has a pixel matrix with the size of W × W, and the P channels are compressed into one channel to obtain a superposition map M with the dimension of W × W _stack And calculating an average M thereof _average Wherein M is _ij ＝F _ij (x, y, p) if one of the channel characteristics is marked as m _p Then M is _stack ＝m ₁ +m ₂ +…+m _P ,M _average ＝M _stack /P。

Step2.3.2: order to

Representing a threshold that distinguishes high response areas from non-high response areas, the high response areas being more likely to be flower expressing areas.

Step2.3.3: if M is _ij If F (x, y, p) is greater than or equal to thres, then M _en ＝1+(M _ij (x, y, p) -thres)/(a-thres). If M is _ij If F (x, y, p) < thres, then M _en B, wherein a and b are hyperparameters.

Step2.3.4：M _{ij_en} ＝M _ij ×M _en 。

Step 3: multiple custom softmax classifiers are trained using the previous stage's sets of features.

The Step3 is specifically as follows:

step3.1: and connecting a global average pooling layer (GAP) with a full connection layer containing 1024 nodes, and constructing a softmax classifier by using a softmax function as an activation function.

Step3.2: and (3) respectively accessing the tensor Concat _ ij into the classifier, and training and verifying by adopting the output characteristics of the block i _ convj and the block5_ pool layers.

Step3.3: and storing the model trained by Concat _ ij to obtain a decision model dec _ brij.

Step 4: and introducing information entropy to express the certainty degree of each classifier, so that decision fusion weight is adaptively distributed, and classification is realized through decision fusion.

The Step4 is specifically as follows:

step4.1: testing the images of the test set by using dec _ brij to obtain the probability that each sample belongs to each class, and for a sample x _n Test result p of decision branch dec _ brij _ij (x _n )＝[p _ij1 (x _n )p _ij2 (x _n )…p _ijL (x _n )]Wherein L represents the total number of classes of the sample, p _ijl (x _n ) Prediction sample x representing decision dec _ brij _n Probability of belonging to class i.

Step4.2: the entropy h (x) of information is introduced to represent the degree of certainty that the decision model has for the decision,

wherein i is more than or equal to 1 and less than or equal to 5, and j is more than or equal to 1 and less than or equal to 3.

H _ij (x) Indicating the degree of certainty that the decision branch dec _ brij has with respect to the classification result, p _ijl (x) Indicating the probability that the decision branch dec _ brij decides sample x as class i.

Step4.3: a decision weight ω is introduced to determine a specific method of decision fusion, where ω is defined as follows,

wherein i is more than or equal to 1 and less than or equal to 5, and j is more than or equal to 1 and less than or equal to 3. Omega _ij Indicating the decision weight assigned to dec _ brij.

Step4.4: for sample x _n The output result of dec _ brij integrated with the decision weight is

Step4.5: for sample x _n The decision fusion probability belonging to each class is P (x) _n )，

Representing the probability that the prediction sample x belongs to the class L after decision fusion.

Step4.6: for sample x, the final prediction class label (x) ═ argmax (p (x)).

The invention has the beneficial effects that: the invention combines the characteristic enhancement and the characteristic fusion, combines the softmax classifier and the information entropy, and has the following advantages:

1. the features subjected to self-adaption enhancement have stronger expression capability;

2. compared with the conventional softmax classifier, the soft max classifier has stronger classification capability.

Drawings

FIG. 1 is a diagram of a feature extraction and enhancement network of the present invention;

FIG. 2 is a block diagram of a decision fusion of the present invention;

FIG. 3 is a dimension diagram of a network layer output characteristic diagram, wherein W represents a side length of the characteristic diagram, and P represents a channel number;

FIG. 4 is an overlay visualization of features of the present invention;

FIG. 5 is a feature overlay visualization after feature enhancement of the present invention;

FIG. 6 is a flow chart of the steps of the present invention.

Detailed Description

The invention is further described with reference to the following drawings and detailed description.

As shown in fig. 6, a flower image classification algorithm based on feature enhancement and decision fusion includes a data preprocessing stage, a feature extraction and enhancement stage, a training stage, and a decision fusion classification stage. In the data preprocessing stage, corresponding operations such as cutting, scaling, normalization and the like are carried out on the data set; in the characteristic extraction and enhancement stage, the preprocessed data is sent into a VGG16 model which is pre-trained by ImageNet to extract the characteristics of the multilayer depth image, and a characteristic enhancement strategy is introduced to adaptively distribute characteristic weights; in the training stage, a plurality of user-defined softmax classifiers are trained by using a plurality of groups of characteristics in the previous stage; and in the decision fusion classification stage, information entropy is introduced to express the certainty degree of each classifier, fusion weight values are determined according to the information entropy, and fusion decision classification is realized.

The method comprises the following specific steps:

step 1: and carrying out corresponding operations such as cutting, scaling, normalization and the like on the data set.

Step 2: and sending the preprocessed data into a VGG16 model pre-trained by ImageNet to extract multilayer depth image features, and introducing a feature enhancement strategy to adaptively distribute feature weights so as to realize adaptive enhancement features.

Step 4: and introducing information entropy to express the certainty degree of each classifier, thereby adaptively allocating decision fusion weights and realizing classification through decision fusion.

The Step1 is specifically as follows:

In this example, the public data set oxford-17flowers provided by oxford university was selected from the Visual Geometry Group of oxford university, which is a common 17flowers in the uk. Each flower had 80 pictures and the entire data set had 1360 pictures. Because the number of data sets is small, 50 flowers of each type are randomly selected as a training set, 15 flowers are selected as a verification set, and 15 flowers are selected as a testing set. Furthermore, since the features extracted from four of these classes cannot be used, most of the literature has been tested on 13 classes of flowers, which is also used herein to evaluate the performance of the present invention.

In this embodiment, the present invention randomly cuts the training set, the verification set, and the test set images 5 times, respectively, and expands the data to 5 times the original data, that is, 3250 training sets, and 975 verification sets, respectively, and 975 test sets.

Step1.3: and scaling the clipped training set, the test set and the verification set to be 224 multiplied by 224 required by the VGG16 model by using a Lanczos interpolation mode.

Step1.4: and linearly transforming the pixel values of the training set, the verification set and the test set after the scaling from the range of [0,255] to the range of [0,1] by adopting a maximum and minimum normalization mode.

In this embodiment, sample x is used ₁ For example, as shown in FIG. 1, the sample resolution before normalization is 540X 500, with pixel values in the range of [0,255]]After normalization, the sample resolution is 224 × 224 required by the model, and the pixel value range is [0,1]]As in the left image normalization portion of fig. 1. Pixels before normalization were: [[[56,56,56,...,26,26,26],[57,58,58,...,26,26, 26],[59,60,61,...,26,26,26],...,[6,5,5,...,14,14,14],[6,5,5,...,14,14,14],[6,5, 5,...,13,13,13]],[[78,78,78,...,29,29,29],[79,80,80,...,29,29,29],[82,83,84,...,29,29, 29],...,[6,5,5,...,16,16,16],[6,5,5,...,16,16,16],[6,5,5,...,15,15,15]]] _540×500×3 Where "540 × 500" represents the sample resolution and "3" represents the RGB three channels of samples. Pixels after normalization are [ [ [0.21960784,0.21960784,0.21960784,. ] 9,0.10196079,0.10196079, 0.10196079],[0.22352941, 0.22745098,0.22745098,...,0.10196079,0.10196079,0.10196079],...,[0.02352941,0.01960784, 0.01960784,...,0.05098039,0.05098039,0.05098039]],[[0.11372549,0.11372549,0.11372549,..., 0.03921569,0.03921569,0.03921569],[0.11764706,0.12156863,0.12156863,...,0.03921569, 0.03921569,0.03921569],...,[0.02352941,0.01960784,0.02745098,...,0.04705882,0.04705882, 0.04705882]]] _224×224×3 。

Step2.1: and removing three full connection layers of the pre-trained VGG16 model, and sending the training set, the verification set and the test set into the model to extract high-level features.

In this embodiment, i in block i _ convj represents the ith network block, j represents the jth convolutional layer, block5_ conv2 represents the 2 nd convolutional layer of the 5 th network block of the VGG16 model, and taking block5_ conv2 as an example, the feature diagram dimensions of the layers are (14, 512), that is, the feature diagram size is 14 × 14, and the number of channels is 512.

Step2.3: and carrying out self-adaptive enhancement on the feature graph of the blocki _ convj layer by using a feature enhancement masking strategy to obtain blocki _ convj _ en.

In this embodiment, taking block5_ conv2 and block5_ conv2_ en as examples, block5_ conv2 and block5_ conv2_ en output characteristic diagrams M ₅₂ 、M _{52_en} The dimensions are 14 × 14 × 512, i.e., W is 14 and P is 512.

In this embodiment, taking block5_ conv2_ en and block5_ pool layers as examples, the feature graph dimension of the block5_ conv2_ en layer is 14 × 14 × 512, and a one-dimensional tensor a with 512 parameters is obtained through a global average pooling layer. The dimension of the block5_ pool layer characteristic graph is 7 × 7 × 512, and a one-dimensional tensor B with 512 parameters is generated through the global average pooling layer. The two tensors a and B are merged to generate a one-dimensional tensor Concat _52 containing 1024 parameters.

Step2.3.1: block i _ convj layer signature graph (hereinafter referred to as M) _ij For example, as shown in fig. 1), dimension W × P (each of P channels has a pixel matrix with size W × W), and compressing P channels into one channel results in a superimposed graph M with dimension W × W _stack And calculating an average M thereof _average Wherein M is _ij ＝F _ij (x, y, p) if one of the channel signatures is denoted as m _p Then M is _stack ＝m ₁ +m ₂ +…+m _P ，M _average ＝M _stack /P。

In this embodiment, the block5_ conv2 layer feature maps of 512 channels are compressed into one channel, so as to obtain a 14 × 14 overlay map M _stack The overlay of the block5_ conv2 layer characteristic diagram is shown in fig. 4.

Step2.3.2: order to

Representing a threshold that distinguishes high response regions from non-high response regions, the high response regions being more likely to be flower expressing regions.

In this embodiment, the threshold thres of the sample for distinguishing the high response region from the non-high response region is 0.24598.

Step2.3.3: if M is _ij If F (x, y, p) is greater than or equal to thres, then M _en ＝1+(M _ij (x, y, p) -thres)/(a-thres). If it is

M _ij If F (x, y, p) < thres, then M _en ＝b。

In the embodiment, a and b are hyperparameters, a is a factor for enhancing the characteristics of the high-response region and used for adjusting the degree of characteristic enhancement, and b is a factor for weakening the non-high-response region, wherein a is more than or equal to 1 and less than or equal to 2, and b is more than or equal to 0 and less than or equal to 1.

Step2.3.4：M _{ij_en} ＝M _ij ×M _en 。

In this embodiment, after the adaptive enhancement features are performed, an overlay of feature maps of the block5_ conv2 layer and the block5_ pool layer is shown in fig. 5.

In the embodiment, an Adam optimizer is adopted for training, the sample batch is set to be 32, 50 epochs are trained each time, and the model is saved when the average accuracy rate is higher than that of the previous Epoch each time in the training process.

In this embodiment, for example, dec _ br52 is used, and dec _ br52 represents a decision model obtained by splicing block5_ conv2_ en and block5_ pool to train a classifier.

Step4.1: testing the images of the test set by using dec _ brij to obtain the probability that each sample belongs to each class, and regarding the sample x _n Test result p of decision branch dec _ brij _ij (x _n )＝[p _ij1 (x _n )p _ij2 (x _n )…p _ijL (x _n )]Where L represents the total number of classes of the sample, p _ijl (x _n ) Prediction sample x representing decision dec _ brij _n Probability of belonging to class i.

In this embodiment, taking dec _ br51, dec _ br52 and dec _ br53 as examples, the test result p of dec _ br51 ₅₁ (x ₁ )＝ [1.0000000e+00,4.1446593e-17,1.9549345e-15,7.9526921e-20,2.0109854e-20,3.1513676e-19, 3.7332855e-20,2.1863510e-23,1.3979170e-16,2.2339542e-20,9.2374559e-15, 5.8055036e-11,3.0085874e-23]。

Test result p of dec _ br52 ₅₂ (x ₁ )＝[9.9999547e-01,7.5567144e-09,7.8939694e-10, 5.6319975e-17,7.2288855e-13,3.3209550e-14,4.4501590e-12,7.1541192e-16,5.3460147e-10, 4.9500820e-12,6.0213007e-10,4.4727053e-06,1.1179825e-15]。

Test result p of dec _ br53 ₅₃ (x ₁ )＝[9.9999964e-01,8.5260909e-11,6.1670391e-08, 6.0080515e-14,2.3101764e-11,6.7670508e-11,1.8991200e-09,4.3540345e-12,2.2171498e-07, 7.6678863e-10,3.7856211e-09,3.0988421e-08,1.2395125e-10]。

Step4.2: introducing information entropy H (x) to representThe degree of certainty that the decision model has with respect to the decision,

wherein i is more than or equal to 1 and less than or equal to 5, and j is more than or equal to 1 and less than or equal to 3. H _ij (x) Indicating the degree of certainty that the decision branch dec _ brij has with respect to the classification result, p _ijl (x) Indicating the probability that the decision branch dec _ brij decides sample x as class i.

In this embodiment, the information entropy H of dec _ br51 ₅₁ (x ₁ ) Information entropy H of dec _ br 52-3.986313764776386 e-08 ₅₂ (x ₂ ) Information entropy H of dec _ br 53-8.630309145106366 e-05 ₅₃ (x ₃ )＝7.871773456765137e-06。

Step4.3: a specific method for determining decision fusion by introducing a decision weight ω, which is defined as follows,

In this embodiment, the decision weight ω corresponding to dec _ br51 ₅₁ 0.33334378825983924, dec _ br52 corresponds to the decision weight ω ₅₂ 0.3333150341887531, dec _ br53 corresponds to the decision weight ω ₅₃ ＝0.33334117755140763。

In this embodiment, the output results of the decision weights incorporated into the dec _ br51, dec _ br52, and dec _ br53 are respectively:

Representing the probability that the prediction sample x after decision fusion belongs to class L.

In this embodiment, sample x ₁ The decision fusion probability belonging to each class is P (x) ₁ )＝[9.99998371e-01, 2.58543500e-09,2.08537332e-08,1.00000001e-10,1.00000001e-10,1.00000001e-10, 6.99720771e-10,1.00000001e-10,7.41182590e-08,3.22268109e-10,1.49593680e-09, 1.50118298e-06,1.07983939e-10]。

Step4.6: for sample x, the final prediction class label (x) ═ argmax (p (x)).

In this embodiment, P (x) ₁ ) Can be seen that the first probability value is the largest, so the sample x ₁ Belonging to class 0.

The present invention can be further illustrated by the following experimental results.

The experimental environment is as follows: the CPU is Intel (R) core (TM) i7-7700 CPU @3.60GHz, the memory is 32GB, the operating system is windows 10, and the compiling environment is jupyter notebook.

Experimental data: the invention adopts the oxford 17flowers public data set to classify the images.

And (3) analyzing an experimental result: the invention introduces the enhancement mask to enhance the characteristics in a self-adaptive manner, thereby improving the expression capability of the characteristics; and introducing information entropy to fuse a plurality of decisions for classification. The classification accuracy of the poplar method reached 95.41% in the data set, and table 1 is a statistical table of the classification accuracy of the method of the present invention under the feature enhancement effect in the data set, and data in the table shows that the feature enhancement method of the present invention is superior to the method of poplar in most cases, and the classification accuracy of the method of the present invention in the data set can reach 97.03%.

TABLE 1

Table 2 shows the comparison of classification accuracy under the fusion of single decision and decision in this dataset. Experimental data show that when the two decisions are close to each other, decision fusion can achieve a better effect, but when the two decisions are far from each other, a decision neutralization effect may occur.

TABLE 2

While the present invention has been described in detail with reference to the embodiments shown in the drawings, the present invention is not limited to the embodiments, and various changes can be made without departing from the spirit of the present invention within the knowledge of those skilled in the art.

Claims

1. A flower image classification algorithm based on feature enhancement and decision fusion is characterized in that:

step 1: performing corresponding cutting, scaling and normalization operations on the data set;

step 2: sending the preprocessed data into a VGG16 model pre-trained by ImageNet to extract multilayer high-level features, and introducing a feature enhancement strategy to adaptively distribute feature weights so as to realize the effect of adaptively enhancing the features;

step 3: training a plurality of self-defined softmax classifiers by using the plurality of groups of characteristics of the previous stage;

2. A floral image classification algorithm based on feature enhancement and decision fusion as claimed in claim 1, wherein Step1 is specifically:

step1.1: dividing an image data set into a training set, a verification set and a test set;

step1.2: cutting each image in the training set into k square images by adopting a normal random cutting strategy, and cutting each image in the verification set and the test set into a square image by adopting a center cutting strategy;

step1.3: scaling the images in the training set, the test set and the verification set after clipping to be 224 multiplied by 224 of the size required by the VGG16 model by using a Lanczos interpolation mode;

3. A floral image classification algorithm based on feature enhancement and decision fusion as claimed in claim 1, wherein Step2 is specifically:

step 2.1: removing three full-connection layers of a pre-trained VGG16 model, and sending a training set, a verification set and a test set into the three full-connection layers to extract high-level features;

step2.2: respectively storing the characteristics of a block i _ convj layer and a block5_ pool layer of the training set, the verification set and the test set;

step2.3: self-adaptive enhancing is carried out on the characteristics of the blocki _ convj layer by using a characteristic enhancing mask strategy to obtain blocki _ convj _ en;

if the block i _ convj output characteristic diagram is assumed to be M _ij ＝F _ij The output characteristic diagram of (x, y, p) block i _ convj _ en is M _{ij_en} ＝F _{ij_en} (x, y, p), introduction of feature enhancement mask M _en ＝F _en (x,y,p)，M _{ij_en} ＝M _ij ×M _en Wherein x is more than or equal to 1, y is more than or equal to W, P is more than or equal to 1 and less than or equal to P, wherein W represents the side length of the characteristic diagram, and P represents the channel number of the characteristic diagram;

4. A floral image classification algorithm based on feature enhancement and decision fusion as claimed in claim 3, wherein said step2.3 is specifically:

step2.3.1: block i _ convj layer feature map, called M _ij Dimension is W multiplied by P, namely, each of P channels has a pixel matrix with the size of W multiplied by W, and the P channels are compressed to one channel to obtain a superposition map M with the dimension of W multiplied by W _stack And calculating an average M thereof _average Wherein M is _ij ＝F _ij (x, y, p) if one of the channel characteristics is marked as m _p Then M is _stack ＝m ₁ +m ₂ +…+m _P ,M _average ＝M _stack /P；

Step2.3.2: order to

A threshold representing a distinguishing high response zone from a non-high response zone, the high response zone being more likely to be a zone in which flowers are expressed;

step2.3.3: if M is _ij If F (x, y, p) is greater than or equal to thres, then M _en ＝1+(M _ij (x, y, p) -thres)/(a-thres); if M is _ij If F (x, y, p) < thres, then M _en B, wherein a and b are hyperparameters;

Step2.3.4：M _{ij_en} ＝M _ij ×M _en 。

5. a floral image classification algorithm based on feature enhancement and decision fusion as claimed in claim 1, wherein Step3 is specifically:

step3.1: connecting the global average pooling layer with a full-connection layer containing 1024 nodes, and constructing a softmax classifier by taking a softmax function as an activation function;

step3.2: respectively accessing the tensor Concat _ ij into a classifier, and training and verifying by adopting output characteristics of a block i _ convj layer and a block5_ pool layer;

6. A floral image classification algorithm based on feature enhancement and decision fusion as claimed in claim 1, wherein Step4 is specifically:

step4.1: testing the images of the test set by using dec _ brij to obtain the probability that each sample belongs to each class, and for a sample x _n Test result p of decision branch dec _ brij _ij (x _n )＝[p _ij1 (x _n )p _ij2 (x _n )…p _ijL (x _n )]Wherein L represents the total number of classes of the sample, p _ijl (x _n ) Prediction sample x representing decision dec _ brij _n Probability of belonging to class i;

wherein i is more than or equal to 1 and less than or equal to 5, and j is more than or equal to 1 and less than or equal to 3;

H _ij (x) Indicating the degree of certainty that the decision branch dec _ brij has with respect to the classification result, p _ijl (x) The probability that the decision branch dec _ brij decides the sample x as class i is represented;

wherein i is more than or equal to 1 and less than or equal to 5, j is more than or equal to 1 and less than or equal to 3, omega _ij Represents the decision weight assigned to dec _ brij;

Representing the probability that the prediction sample x belongs to the class L after decision fusion;

step4.6: for sample x, the final prediction class label (x) ═ argmax (p (x)).