CN109784399A

CN109784399A - Based on the multi-source image target association method for improving dictionary learning

Info

Publication number: CN109784399A
Application number: CN201910028307.5A
Authority: CN
Inventors: 熊伟; 吕亚飞; 张筱晗; 崔亚奇; 朱洪峰; 顾祥岐; 蔡咪
Original assignee: Naval Aeronautical University
Current assignee: Naval Aeronautical University
Priority date: 2019-01-11
Filing date: 2019-01-11
Publication date: 2019-05-21

Abstract

The invention discloses a multi-source image target association method based on improved dictionary learning, belongs to the field of intelligence data processing, and mainly solves the problems of information loss and redundant steps existing in the existing multi-source intelligence data fusion in which image target association is first identified and then associated . First, collect image data sets of the same scene or the same target from multiple sources to form the original data set; use the improved dictionary learning method to perform a unified sparse representation of multi-source images, and introduce label information into the objective function to increase the characteristics of the dictionary set. Characterize the discriminative ability; build a neural network to learn the sparse representation and label information of each image, and obtain the distance measurement standard between associated and non-associated targets, instead of the traditional distance measurement method, to complete the establishment of the association discrimination model. The method makes full use of the feature information of the image, avoids the redundant steps of the existing methods, and has the advantages of fast model generation, less information loss, and good practical effect.

Description

Based on the multi-source image target association method for improving dictionary learning

Technical field

The present invention is under the jurisdiction of information data process field, is related to the life of the target association discrimination model between multi-source heterogeneous image At suitable for multi-source information fusion treatment link.

Background technique

The acquisition of image category information generally has the characteristics that investigative range is wide, revisiting period is long, positioning accuracy is poor, can be used for A wide range of early warning early period during early warning detection, can increase the phase of target by the target association to a variety of image category informations It closes and measures dot density, improve measurement information renewal time, improve the positioning accuracy of target.Target association master between image category information If the correlation according to clarification of objective information is associated judgement under certain space-time restriction.It is main in practical applications It is to carry out indirect association, first with target signature information, identifies target category model, it is whether consistent further according to target type model Be associated differentiation, but this has the following disadvantages: interrelating effect is accurate too dependent on the completeness and algorithm for identifying library Property；Target signature information is identified as classification information and arrives association results again, increases information process loss.Currently, multi-source image The research of fusion is that image co-registration is realized from Pixel-level mostly to obtain more detailed information, convenient for further to image Analysis, processing and understand, but it has higher requirements to the time of input picture and spatial registration, and computationally intensive, and when processing disappears Time-consuming is long, it is difficult to reach requirement of real-time；And due to characteristic information dimension height, the feature of foreign peoples's information source between multi-source image information Information difference is big, rarely has at present from the angle of feature-based fusion and carries out correlative study.

Summary of the invention

The purpose of the present invention is to propose to the multi-source image target association methods based on improvement dictionary learning, from image feature level It sets out, compares multi-source image clarification of objective information by extracting, to judge whether image is derived from same target, construct multi-source figure As the mapping relations between target；It aims to solve the problem that and first identifies again associated information damage in current existing image object correlating method The problem of mistake, step redundancy.

Multi-source image target association method of the present invention based on improvement dictionary learning, can be mainly divided into four ranks Section, data set is collected and pretreatment stage, unified wordbook generation phase, representative learning stage and classification learning stage, specifically Including following technical measures, it is data set collection and pretreatment stage first, collects plurality of information resources to Same Scene or same mesh Target image data set, and handmarking's analysis is carried out to multi-source image data set, store image data set and handmarking's knot Fruit is formed with label multi-source image target association raw data set.Unified wordbook generation phase, on the basis of dictionary learning On, it is further proposed that the method for improving dictionary learning, to increase the characterization discriminating power of the generated feature of dictionary learning, and utilizes It improves dictionary learning method and concentrates all image datas to carry out dictionary learning initial data, generate the common dictionary of multi-source image Collection.Representative learning stage, each multi-source that multi-source image target association initial data is concentrated using multi-source image common wordbook Image data carries out unified rarefaction representation, generates the sparse coefficient collection of each multi-source image data, the label concentrated with initial data Information is collectively formed multi-source image target association and differentiates data set.The classification learning stage establishes the differentiation of multi-source image target association Model, learnt using data and label of the multilayer neural network model to multi-source image target association differentiation data concentration, Training, finally obtains the distance metric between associated objects and dereferenced target, thus complete instead of traditional distance metric mode At the foundation of association discrimination model.

Multi-source image target association method proposed by the present invention based on improvement dictionary learning can be based on measured data, directly The association method of discrimination between training generation multi-source image target is connect, avoids and first identifies again in associated conventional method to identification library Information requirements are high, the serious equal traditional problems of information loss, have that model formation speed is fast, information loss is few, good practical effect etc. Advantage, multi-source image target association method generated can be directly used for solving more in practical information data processing without debugging Source information data merges problem.

Detailed description of the invention

Fig. 1 is based on the multi-source image target association method flow diagram for improving dictionary learning.

Specific embodiment

Multi-source image target association method and technology scheme proposed by the present invention based on improvement dictionary learning includes following step It is rapid:

Step 1: data set is collected and pretreatment stage, collects plurality of information resources to the picture number of Same Scene or same target Handmarking's analysis is carried out according to collection, and to multi-source image data set, stores image data set and handmarking as a result, being formed with mark Sign multi-source image target association raw data set S；

Step 1.1: mainly including visible images, multispectral image, infrared image, SAR image to plurality of information resources image In two kinds of images collected extensively, and guarantee between the multi-source image for target association to be for Same Scene, comprising same A kind of target or corresponding target；

Step 1.2: having label multi-source image target association raw data set S={ (y_i,y_j',a_ij),i∈(0,N),j∈ (0, N') }, wherein y_iAnd y_j' indicate that the data of any two kinds of information sources indicate, a_ij=0 or 1, it is handmarking as a result, a_ij=0 table Show y_iAnd y_j' it is dereferenced target, a_ij=1 indicates y_iAnd y_j' it is associated objects, N and N' are respectively that two kinds of multi-source image collection include Number of elements.

Step 2: unified wordbook generation phase, using the method for improvement dictionary learning to all figures in raw data set S As data carry out dictionary learning, the generation common wordbook D=[d of multi-source image₁,d₂,…d_k]；

Step 2.1: dictionary learning is the sample Y=[y for common dense expression₁,…,y_n]∈R^n×N, find one it is excessively complete Standby wordbook D=[d₁,…,d_K]∈R^n×KFormer dense sample is expressed as linear group of each element in wordbook by (K > n) It closes, guarantees the coefficient X=[x of linear combination₁,…,x_N]∈R^K×NIt is sparse as far as possible, and the degree of information loss is controlled, to make to learn Habit task is simplified, and reduces the complexity of model, is seen shown in formula (1)；

Step 2.2: to make the sparse coefficient obtained that there is certain classification capacity, dictionary learning is further improved, On the basis of its objective function, increase a label to limitation, the objective function for improving dictionary learning is obtained, as shown in formula (2)；

Wherein, α and β is hyper parameter, for controlling the weight of error representated by items；Section 3 is to have sparse coefficient The limit entry of classification capacity, when target is to (y_i,y_j') label be associated objects, i.e. same category when, M_ij=1；When target pair (y_i,y_j') label be dereferenced target, i.e., when different classes of, M_ij=-1；Other situations, such as (y_i,y_j') it is same target, That is when i=j, M_ij=0, see shown in formula (3)；

N=diag { N₁,…,N_N, it is diagonal matrix, each diagonal element is the sum of each column element in matrix M,L=N-M；；

Step 2.3: objective function (2) has two parameters of D and X, while being optimized for non-convex optimization problem to parameter D and X, Need to fix parametric solution another parameter, thus the optimization problem of objective function can be converted to following two formula (4), the iteration optimization of (5):

Wherein, formula (4) is that fixed wordbook D optimizes sparse coefficient X one by one, and formula (5) is fixed sparse system Several pairs of wordbooks optimize, and the optimization of formula (5) can use K-SVD progress, update the element and right in wordbook one by one Answer sparse coefficient；To formula (4), due to there is a presence of L1 norm, objective function be not it is continuously differentiable, searched using characteristic symbol Rope algorithm is corresponded to the sign vector θ of sparse coefficient by grey iterative generation, converted objective function to standard, unconfined Double optimization problem, therefore, the gradient of calculation formula (4) are shown in that formula (6) is shown,

Formula (6) gradient is enabled to be equal to zero, can obtain sparse coefficient as a result, sees shown in formula (7),

Step 2.3.1；When generating wordbook to formula (5) using K-SVD algorithm, every wheel iteration only updates a dictionary member Plain d_k, keep other elements constant, find a best d_kAnd update corresponding sparse coefficient(For the K of coefficient matrix X Row, the kth element d with wordbook_kIt is corresponding), so that formula (5) is reached minimum；

As shown in formula (8), directly to E_kCarry out singular value decomposition (SVD) Lai Gengxin d_kWithIt is likely to occur sparse coefficientNot sparse phenomenon, i.e., it is updatedBefore updateThe position of nonzero element is inconsistent, therefore, only retains sparse system NumberIn nonzero value, E_kOnly retain d_kWithItem after the product of middle non-zero position is formedIt is rightSingular value decomposition is carried out to obtain It arrives,And it enablesIt is updated to the first row of U, the product conduct of the first row and Δ (1,1) of VUpdated value, Complete the update of a dictionary element；

Step 2.3.2；For formula (4), characteristic symbol searching algorithm is dilute to generate by iteration one group of active set of update Sparse coefficient symbol, the element in active set are the number of sparse coefficient, specifically includes the following steps:

Step 2.3.2.1；Initialization, characteristic symbol vectorWherein θ_iIt is sparse that ∈ { -1,0,1 } respectively indicates correspondence Coefficient x_iSymbol, active set activeset:={ } be empty set；

Step 2.3.2.2；The sign of each non-zero sparse coefficient and the element of active set are set, is selected conducive to target letter The sparse coefficient number that number (4) reduces is active set element, it may be assumed that

WhenWhen, if θ_i=-1, activeset:={ i } ∪ activeset；

WhenWhen, if θ_i=1, activeset:={ i } ∪ activeset；

Step 2.3.2.3；After generating active set, respectively to being selected in wordbook D, sparse coefficient X and characteristic symbol vector θ Number element in active set corresponds to generating subsetFormula (4) can be converted into be seen without limitation double optimization problem Shown in formula (9),

It enables the derivative of formula (9) be equal to 0, can obtain optimal

Step 3: the representative learning stage, using the common wordbook D of multi-source image to multi-source image target association initial data Multi-source image data in collection S carry out unified rarefaction representation, the sparse coefficient collection X of each multi-source image data are generated, with original number Multi-source image target association is collectively formed according to the label information of concentration and differentiates data set S'；

Step 3.1: using the wordbook D that improvement dictionary learning generates in step 2 to multi-source image target association original number Rarefaction representation is carried out according to the multi-source image data in collection S, using formula (4), fixed wordbook D carries out one by one sparse coefficient X Optimization, generates the sparse coefficient of each multi-source image data；

Step 3.2: being collectively formed in conjunction with the sparse coefficient of label information and each multi-source image data in raw data set S Multi-source image target association differentiates data set S'={ (x_i,x_j',a_ij), i ∈ (0, N), j ∈ (0, N') }, wherein x_iAnd x_j' be The rarefaction representation of the two kinds of information sources generated in 3.1, a_ij=0 or 1, it marks for the Central Plains S somebody's work as a result, a_ij=0 indicates x_iWith x_j' it is dereferenced target, a_ij=1 indicates x_iAnd x_j' it is associated objects.

Step 4: the classification learning stage establishes multi-source image target association discrimination model, utilizes multilayer neural network model Data and label in data set S', which are learnt, train, to be differentiated to multi-source image target association, neural network association is obtained and sentences Other model；

Step 4.1: the network structure of multilayer neural network model uses three layers of full connection structure, and input layer is two any more The L2 distance of source image data sparse coefficient is shown in that wherein n is characterized dimension shown in formula (10), for 256 dimensions, the second node layer Number is 512, and third layer node number is 256, and in addition to output layer, each interlayer is all made of dropout strategy to avoid mistake Fitting；

Step 4.2: defeated since the problem of whether being associated with two targets is converted into binary classification discrimination Node layer number is set as 1 out, and the activation primitive of the output layer of the multilayer neural network of use uses sigmoid function, sees public affairs Shown in formula (11), export as the number of 0~1 range, 1 to represent two image datas associated, 0 represent it is uncorrelated between two image datas Connection is exported closer to 1, and association probability is bigger, conversely, association probability is smaller；The activation primitive of each middle layer of network structure uses Relu function is shown in shown in formula (12).

sigmoid(w^TX+b)=1/1+exp [- (w^Tx+b)] (11)

Claims

1. based on the multi-source image target association method for improving dictionary learning, which comprises the following steps:

Step 1, data set collection and pretreatment stage

Plurality of information resources is collected to the image data set of Same Scene or same target, and multi-source image data set is manually marked It scores analysis, stores image data set and handmarking as a result, being formed with label multi-source image target association raw data set S；

Step 2, unified wordbook generation phase

Dictionary learning is carried out to image datas all in raw data set S using the method for improving dictionary learning, generates multi-source figure As common wordbook D=[d₁,d₂,…d_k]；

Step 3, the representative learning stage

The multi-source image data in multi-source image target association raw data set S are carried out using multi-source image common wordbook D Unified rarefaction representation, generates the sparse coefficient collection X of each multi-source image data, and the label information concentrated with initial data is collectively formed Multi-source image target association differentiates data set S'；

Step 4, the classification learning stage

Multi-source image target association discrimination model is established, is differentiated in data set S' using multi-source image target association in step 3 Data and label are learnt, are trained, and trained association discrimination model is obtained.

2. as described in claim 1 based on the multi-source image target association method for improving dictionary learning, which is characterized in that described Improvement dictionary learning in step 2 is to increase a label on the basis of former dictionary learning objective function to limitation, obtain such as public affairs Objective function shown in formula (1)；

Wherein, α and β is hyper parameter；N=diag { N₁,…,N_N, it is diagonal matrix, each diagonal element is each column element in matrix M The sum of,L=N-M；The definition of M is shown in that formula (2) is shown,

3. as described in claim 1 based on the multi-source image target association method for improving dictionary learning, which is characterized in that described Step 1 specifically includes following sub-step:

Step 1.1, to two kinds of images in plurality of information resources image visible images, multispectral image, infrared image, SAR image It is collected extensively, and guarantee is for being for Same Scene between the multi-source image of target association, includes same class target or right Answer target；

Step 1.2, there is label multi-source image target association raw data set S={ (y_i,y_j',a_ij),i∈(0,N),j∈(0, N') }, wherein y_iAnd y_j' indicate that the data of any two kinds of information sources indicate, a_ij=0 or 1, it is handmarking as a result, a_ij=0 indicates y_iAnd y_j' it is dereferenced target, a_ij=1 indicates y_iAnd y_j' it is associated objects, N and N' are respectively that two kinds of multi-source image collection include Number of elements.

4. as described in claim 1 based on the multi-source image target association method for improving dictionary learning, which is characterized in that described Step 3 specifically includes following sub-step:

Step 3.1, using the wordbook D that improvement dictionary learning generates in step 2 to multi-source image target association raw data set S In multi-source image data carry out rarefaction representation, generate the sparse coefficient of each multi-source image data；

Step 3.2, multi-source is collectively formed in conjunction with the sparse coefficient of label information and each multi-source image data in raw data set S Image object association differentiates data set S'={ (x_i,x_j',a_ij), i ∈ (0, N), j ∈ (0, N') }, wherein x_iAnd x_j' it is in 3.1 The sparse coefficient of the two kinds of information sources generated, a_ij=0 or 1, it marks for the Central Plains S somebody's work as a result, a_ij=0 indicates x_iAnd x_j' be Dereferenced target, a_ij=1 indicates x_iAnd x_j' it is associated objects.

5. as described in claim 1 based on the multi-source image target association method for improving dictionary learning, which is characterized in that described Multi-source image target association discrimination model in step 4 is the multilayer neural network model using three layers of full connection structure, input Layer is the L2 distance of two any multi-source image Sparse coefficients, and second layer node number is 512, and third layer node number is 256, in addition to output layer, each interlayer is all made of dropout strategy, and activation primitive uses relu function；Output layer node number It is set as 1, activation primitive uses sigmoid function.