CN103617431B

CN103617431B - Maximum average entropy-based scale-invariant feature transform (SIFT) descriptor binaryzation and similarity matching method

Info

Publication number: CN103617431B
Application number: CN201310539961.5A
Authority: CN
Inventors: 毋立芳; 侯亚希; 周鹏; 许晓; 曹航明; 颜凤辉; 曹瑜; 漆薇
Original assignee: Langzhao Technology Beijing Co ltd
Current assignee: Langzhao Technology Beijing Co ltd
Priority date: 2013-11-05
Filing date: 2013-11-05
Publication date: 2017-01-18
Anticipated expiration: 2033-11-05
Also published as: CN103617431A

Abstract

The invention relates to a maximum average entropy-based scale-invariant feature transform (SIFT) descriptor binaryzation and similarity matching method and belongs to the image matching field. Scale-invariant feature transform (SIFT) operators have strong matching ability, while, the scale-invariant feature transform (SIFT) operators will bring a huge amount of data, and therefore, binaryzation should be performed on the scale-invariant feature transform (SIFT) operators, however, if unified binaryzation is performed on all the operators, data redundancy or information loss will be brought about. The maximum average entropy-based scale-invariant feature transform (SIFT) descriptor binaryzation and similarity matching method of the invention comprises the following steps that: binaryzation is performed on the scale-invariant feature transform (SIFT) operators; average entropy calculation is performed on each layer of binaryzation results, such that different numbers of binaryzation layers are selected adaptively; new binaryzation descriptors are provided; the Hamming distance is utilized to replace the Euclidean distance so as to calculate the distance between two descriptors; and the distance between the two descriptors is compared with a set threshold value. With the maximum average entropy-based scale-invariant feature transform (SIFT) descriptor binaryzation and similarity matching method of the invention adopted, information of original features is reserved; the amount of data storage can be greatly reduced; computational complexity can be reduced; a requirement for a real-time property can be realized better; the matching results equivalent to original scale-invariant feature transform (SIFT) descriptors can be obtained and are far superior to results of matching by using the unified binaryzation.

Description

SIFT Descriptor Binarization and Similarity Matching Method Based on Maximum Bit Average Entropy

技术领域technical field

本发明涉及图像匹配技术领域，具体涉及一种基于最大位平均熵的SIFT描述子二值化方法及其相似度匹配方案。The invention relates to the technical field of image matching, in particular to a SIFT descriptor binarization method based on maximum bit average entropy and a similarity matching scheme thereof.

背景技术Background technique

随着现代计算机技术的发展，人脸识别技术在安全认证、人机交流、公安系统等方面得到了广泛的使用，并且在视频会议、档案管理、医学医疗等方面也发挥着很大的作用。在美国911恐怖袭击事件以及网络CSDN用户信息遭泄露事件发生之后，生物特征识别技术更加受到大家关注，而人脸生物特征的识别一直是生物特征识别领域研究的热点，人脸识别在可控的情况下可以获得很好的识别性能，但在实际应用中，人脸识别往往受到很多因素影响，当人脸姿态发生变化，表情发生变化，外界光照发生变化，人脸存在遮挡(戴围巾，墨镜)等情况时，人脸识别的性能将会下降很多，这就制约了人脸识别在实际中的应用。因此，很多的研究学者致力于人脸识别方法的研究，各种人脸识别方法层出不穷。With the development of modern computer technology, face recognition technology has been widely used in security authentication, human-computer communication, public security system, etc., and also plays a great role in video conferencing, file management, medical treatment, etc. After the 911 terrorist attacks in the United States and the leakage of network CSDN user information, biometric technology has attracted more attention, and the recognition of facial biometrics has always been a hot spot in the field of biometrics. However, in practical applications, face recognition is often affected by many factors. When the face posture changes, the expression changes, the external light changes, and the face is blocked (wearing a scarf, sunglasses, etc.) ) and other situations, the performance of face recognition will drop a lot, which restricts the practical application of face recognition. Therefore, many researchers are devoted to the research of face recognition methods, and various face recognition methods emerge in endlessly.

SIFT(Scale-Invariant Feature Transform)特征匹配算法是目前国内外特征点匹配算法研究的热点与难点，它是在1999年由加拿大的David G.Lowe提出初步思想的局部特征描述子，并于2004年在原基础上进行了更深入的发展并加以完善。SIFT描述子是一种图像的局部描述子，具有尺度、旋转、平移的不变性，而且对光照变化、仿射变换和3维投影变换具有一定的鲁棒性。在Mikolajczyk对包括SIFT描述子在内的十种局部描述子所做的不变性对比实验中，SIFT及其扩展算法已被证实在同类描述子中具有最强的健壮性。SIFT描述子匹配能力较强，对大多数图像变换具备很强的不变性，特别适合于处理两幅图像间发生平移、旋转、仿射变换时的匹配问题，其稳定的特征匹配能力甚至可对任意角度拍摄的图像应用。SIFT特征还具有很好的独特性，适于在海量特征数据库中进行比较快速、准确的匹配。但是，用SIFT来表示人脸图像的数据量是很巨大的。一般一幅人脸图像有3000个SIFT点，每个SIFT由128个描述子构成，每个描述子由8比特表示，总的数据量为3072000比特。Wang等人提出了二值化SIFT描述子的思想。本发明是在对SIFT描述子进行二值化的时候，以最大位平均熵为准则来确定二值化的层数，形成新的二值化描述子，然后用汉明距离代替欧式距离计算两个描述子之间的距离，将其跟设定阈值进行比较，最终得到匹配结果。这样极大的减少了数据量，降低了计算的复杂度，可以更好实现实时性的要求。并且二值化后的描述子在进行匹配运算时可以得到原始SIFT描述子基本一致的匹配结果，远远优于将SIFT描述子进行统一二值化进行匹配的结果。本发明提出了一种基于最大位平均熵的SIFT描述子二值化方法及相似度匹配方案。The SIFT (Scale-Invariant Feature Transform) feature matching algorithm is a hot spot and difficulty in the research of feature point matching algorithms at home and abroad. It is a local feature descriptor proposed by David G. Lowe of Canada in 1999, and was developed in 2004 On the original basis, it has been further developed and improved. The SIFT descriptor is a local descriptor of an image, which is invariant to scale, rotation, and translation, and has certain robustness to illumination changes, affine transformations, and 3D projection transformations. In Mikolajczyk's invariance comparison experiments on ten kinds of local descriptors including SIFT descriptors, SIFT and its extended algorithm have been proved to have the strongest robustness among similar descriptors. The SIFT descriptor has strong matching ability, and has strong invariance to most image transformations. It is especially suitable for dealing with the matching problems when translation, rotation, and affine transformation occur between two images. Its stable feature matching ability can even be used for Application of images taken from any angle. SIFT features also have good uniqueness, which is suitable for relatively fast and accurate matching in massive feature databases. However, the amount of data to represent face images with SIFT is huge. Generally, a face image has 3000 SIFT points, each SIFT consists of 128 descriptors, each descriptor is represented by 8 bits, and the total data volume is 3072000 bits. Wang et al. proposed the idea of binarizing SIFT descriptors. In the present invention, when performing binarization on SIFT descriptors, the number of layers of binarization is determined with the maximum bit average entropy as a criterion to form a new binarization descriptor, and then the Hamming distance is used to replace the Euclidean distance to calculate two The distance between each descriptor is compared with the set threshold, and finally the matching result is obtained. In this way, the amount of data is greatly reduced, the complexity of calculation is reduced, and the requirement of real-time performance can be better realized. And the binarized descriptor can obtain the matching result that is basically the same as the original SIFT descriptor when performing the matching operation, which is far better than the result of uniformly binarizing the SIFT descriptor for matching. The invention proposes a SIFT descriptor binarization method and a similarity matching scheme based on maximum bit average entropy.

发明内容Contents of the invention

本发明提供了一种基于最大位平均熵准则的SIFT描述子二值化方法及相似度匹配方案，该方法可以利用位平均熵最大来有效的确定二值化进行的层数，形成新的二值化描述子，然后用汉明距离代替欧式距离计算两个描述子之间的距离，将其跟设定阈值进行比较，最终得到匹配结果。可以保留大量原始SIFT描述子信息，在很大程度上减少数据量，降低了计算的复杂度，可以更好实现实时性的要求。并且可以保证匹配结果和使用原始SIFT描述子得到的匹配结果基本一致，远远优于将SIFT描述子进行统一二值化进行匹配的结果。The present invention provides a SIFT descriptor binarization method and a similarity matching scheme based on the maximum bit average entropy criterion. The method can use the maximum bit average entropy to effectively determine the number of layers for binarization to form a new binary Value descriptors, and then use Hamming distance instead of Euclidean distance to calculate the distance between two descriptors, compare it with the set threshold, and finally get the matching result. A large amount of original SIFT descriptor information can be retained, the amount of data is reduced to a large extent, the complexity of calculation is reduced, and the requirement of real-time performance can be better realized. And it can guarantee that the matching result is basically the same as the matching result obtained by using the original SIFT descriptor, which is far better than the result of uniform binarization of the SIFT descriptor for matching.

对SIFT描述子进行二值化，如果仅仅进行一层二值化，那么会丢失很多信息，保留原特征的信息量会很少。在进行多层二值化时，包含四种二值化策略：(1)在做完第一层二值化之后，得到0和1，然后一直对0部分进行二值化。(2)在做完第一层二值化之后，得到0和1，选择0部分进行二值化得到0和1，然后一直对1部分进行二值化。(3)在做完第一层二值化之后，得到0和1，选择1部分进行二值化得到0和1，然后一直对0部分进行二值化。(4)在做完第一层二值化之后，得到0和1，然后一直对1部分进行二值化。因为SIFT描述子中包含很多个0，对于每一层二值化结果，一直对1进行二值化的策略信息熵总是最大的，所以采取第四种策略进行多层二值化。熵可以用来度量信息量的大小。SIFT描述子进行二值化层数增加，信息熵会随之增大。但是，与之相随的是数据量的大大增加。而且，如果对提取的所有SIFT描述子进行统一层数的二值化，那么会包含下面两种情况：(1)本身不需要二值化进行到指定层数，位平均熵已经达到最大。比如做两层二值化后的位平均熵是最大的，则只需要保留两层的结果。如果做三层或更多层，会增大数据冗余，冗余的数据可能会引起匹配的错误；(2)在做完指定层数的二值化后，位平均熵还未达到最大，这样会导致二值化后的描述子不能充分的保留原始SIFT描述子携带的信息，会导致误匹配度的增加。但是，从数据量上来考虑，最多保留四层二值化结果。本发明提出了基于最大位平均熵的二值化SIFT描述子方法及相似度匹配方案，通过对每一个描述子进行位平均熵的计算，来自适应的选择不同的二值化层数，形成新的二值化描述子，然后用汉明距离代替欧式距离计算两个描述子之间的距离，将其跟设定阈值进行比较，最终得到匹配结果。本发明提出的算法在很大程度上保留了原始特征的信息并大大减少了数据存储量，降低了计算的复杂度，可以更好实现实时性的要求。并且可以取得等同于原始SIFT描述子的匹配结果，远远优于将SIFT描述子进行统一二值化进行匹配的结果。因此，本发明具有一定的应用价值和意义。To binarize the SIFT descriptor, if only one layer of binarization is performed, a lot of information will be lost, and the amount of information retaining the original features will be very small. When performing multi-layer binarization, four binarization strategies are included: (1) After the first layer of binarization is completed, 0 and 1 are obtained, and then the 0 part is always binarized. (2) After the first layer of binarization is completed, 0 and 1 are obtained, and the 0 part is selected for binarization to obtain 0 and 1, and then the 1 part is always binarized. (3) After the first layer of binarization is completed, 0 and 1 are obtained, and the 1 part is selected for binarization to obtain 0 and 1, and then the 0 part is always binarized. (4) After the first layer of binarization is done, 0 and 1 are obtained, and then the 1 part is always binarized. Because the SIFT descriptor contains many 0s, for each layer of binarization results, the strategy information entropy of binarizing 1 is always the largest, so the fourth strategy is adopted for multi-layer binarization. Entropy can be used to measure the amount of information. As the number of binarization layers of the SIFT descriptor increases, the information entropy will increase accordingly. However, it is accompanied by a great increase in the amount of data. Moreover, if all extracted SIFT descriptors are binarized with a uniform number of layers, the following two situations will be included: (1) Binarization itself does not need to be performed to a specified number of layers, and the average bit entropy has reached the maximum. For example, the average bit entropy after two layers of binarization is the largest, and only the results of the two layers need to be retained. If three or more layers are used, data redundancy will be increased, and redundant data may cause matching errors; (2) After the binarization of the specified number of layers is completed, the average bit entropy has not yet reached the maximum, This will lead to the fact that the binarized descriptor cannot fully retain the information carried by the original SIFT descriptor, which will lead to an increase in the degree of false matching. However, considering the amount of data, up to four layers of binarization results are retained. The present invention proposes a binarized SIFT descriptor method and a similarity matching scheme based on maximum bit average entropy. By calculating bit average entropy for each descriptor, different binarization layers are adaptively selected to form a new The binary descriptor, and then use the Hamming distance instead of the Euclidean distance to calculate the distance between the two descriptors, compare it with the set threshold, and finally get the matching result. The algorithm proposed by the invention retains the original feature information to a large extent, greatly reduces the amount of data storage, reduces the complexity of calculation, and can better realize the real-time requirement. And the matching result equivalent to the original SIFT descriptor can be obtained, which is far better than the result of uniform binarization of the SIFT descriptor for matching. Therefore, the present invention has certain application value and significance.

为了实现上述问题，本发明提出了一种基于最大位平均熵的SIFT描述子二值化的方法及相似度匹配方案，该方法具体包括：In order to realize the above-mentioned problem, the present invention proposes a kind of method and similarity matching scheme based on the SIFT descriptor binarization of maximum bit average entropy, and this method specifically comprises:

A、二值化阶段，对于每一幅人脸图像，首先提取SIFT特征描述子，然后对SIFT描述子进行多层二值化，求得各个层次的信息熵，根据各层0和1出现的概率和保留的总比特数，求得位平均熵，然后找出最大位平均熵对应的二值化层数，保留这几层二值化结果，形成新的二值化特征描述子。A. In the binarization stage, for each face image, first extract the SIFT feature descriptor, and then perform multi-layer binarization on the SIFT descriptor to obtain the information entropy of each level, according to the appearance of 0 and 1 in each layer The probability and the total number of reserved bits are used to obtain the average bit entropy, and then the number of binarization layers corresponding to the maximum bit average entropy is found, and the binarization results of these layers are retained to form a new binary feature descriptor.

B、匹配阶段，对于任意两幅人脸图像，分别提取其新的二值化特征描述子之后，然后用汉明距离代替欧式距离计算两个描述子之间的距离，将其跟设定阈值进行比较，如果汉明距离小于等于设定阈值，则认为匹配成功；否则，则认为匹配不成功。对于提取的不同层次的二值化特征描述子，需要选择不同的阈值。B. In the matching stage, for any two face images, after extracting their new binary feature descriptors, the Hamming distance is used instead of the Euclidean distance to calculate the distance between the two descriptors, which is compared with the set threshold For comparison, if the Hamming distance is less than or equal to the set threshold, the matching is considered successful; otherwise, the matching is considered unsuccessful. For the extracted binary feature descriptors of different levels, different thresholds need to be selected.

所述步骤A具体包括：Described step A specifically comprises:

A1、对于每一幅人脸图像，首先提取SIFT特征描述子；A1, for each face image, first extract the SIFT feature descriptor;

A2、对于每一幅人脸图像的每一个SIFT描述子进行二值化；A2, perform binarization for each SIFT descriptor of each face image;

A3、统计0和1的个数，分别为n₁₀和n₁₁；A3, count the number of 0 and 1, respectively n ₁₀ and n ₁₁ ;

A4、标记为1的那些字节对应的描述子，继续进行二值化，统计此时0和1的个数，分别为n₂₀和n₂₁，继续进行上述二值化过程，依次得到n₃₀和n₃₁，n₄₀和n₄₁ A4. The descriptors corresponding to those bytes marked as 1 continue to be binarized, and the numbers of 0 and 1 are counted at this time, which are n ₂₀ and n ₂₁ respectively. Continue the above binarization process to obtain n ₃₀ in turn and n ₃₁ , n ₄₀ and n ₄₁

A5、分别计算做一层、二层、三层、四层二值化的信息熵；A5. Calculate the information entropy of binarization of the first layer, the second layer, the third layer, and the fourth layer respectively;

A6、根据二值化后保留的总比特数，位平均熵可以由信息熵除以比特数得到；A6. According to the total number of bits retained after binarization, the average bit entropy can be obtained by dividing the information entropy by the number of bits;

A7、求得最大位平均熵所对应的层数，保留对应层数的二值化结果，形成新的二值化描述子。A7. Obtain the number of layers corresponding to the maximum bit average entropy, retain the binarization results of the corresponding layers, and form a new binarization descriptor.

所述步骤B具体包括：Described step B specifically comprises:

B1、对于任意两幅人脸图像，提取其新的二值化描述子；B1. For any two face images, extract their new binary descriptors;

B2、对于每两个描述子，计算其汉明距离dis_H；B2. For every two descriptors, calculate the Hamming distance dis _H ;

B3、提取不同层次的二值化描述子，选择的阈值T也不相同，做一层、二层、三层和四层二值化的阈值分别为：T₁，T₂，T₃，T₄ B3. To extract binary descriptors of different levels, the selected threshold T is also different. The thresholds for one-layer, two-layer, three-layer and four-layer binarization are: T ₁ , T ₂ , T ₃ , T ₄

B4、如果dis_H小于等于阈值，则认为匹配成功；否则，认为匹配不成功。本发明与现有技术相比，具有以下明显的优势和有益效果：B4. If dis _H is less than or equal to the threshold, it is considered that the matching is successful; otherwise, it is considered that the matching is unsuccessful. Compared with the prior art, the present invention has the following obvious advantages and beneficial effects:

(1)本发明对SIFT描述子进行二值化，是根据最大位平均熵原则来自适应的根据每一个描述子来确定二值化层数，与进行统一二值化相比，不仅减少了数据冗余，而且在很大程度上保留原始SIFT描述子携带的信息。(1) The present invention carries out binarization to SIFT descriptor, is to determine binarization layer number according to each descriptor self-adaptively according to the principle of maximum bit average entropy, compared with carrying out unified binarization, not only reduces Data redundancy, and to a large extent retain the information carried by the original SIFT descriptor.

(2)本发明自适应得到新的二值化描述子，通过汉明距离来进行匹配，计算简单快捷，复杂度降低，可以更好的满足实时性的要求。(2) The present invention self-adaptively obtains a new binary descriptor, and performs matching through Hamming distance, which is simple and quick to calculate, reduces complexity, and can better meet real-time requirements.

(3)本发明通过最大位平均熵得到的二值化SIFT描述子来进行匹配，经实验分析，匹配结果等同于原始SIFT描述子的匹配结果，远远优于将SIFT描述子进行统一二值化进行匹配得到的结果。(3) The present invention matches through the binarized SIFT descriptor obtained by the maximum bit average entropy. Through experimental analysis, the matching result is equal to the matching result of the original SIFT descriptor, which is far superior to unifying the SIFT descriptor. The result obtained by value-based matching.

附图说明：Description of drawings:

图1是技术方案的整体流程图。Fig. 1 is an overall flow chart of the technical solution.

图2(a)是二值化策略1(左左)Figure 2(a) is binarization strategy 1 (left left)

图2(b)是二值化策略1(左右)Figure 2(b) is binarization strategy 1 (left and right)

图2(c)是二值化策略1(右左)Figure 2(c) is binarization strategy 1 (right left)

图2(d)是二值化策略1(右右)Figure 2(d) is binarization strategy 1 (right right)

图3是相似度匹配结果比较图。Figure 3 is a comparison chart of similarity matching results.

具体实施方式：detailed description:

本发明技术方案的整体流程如说明书附图1所示。我们的方法大大减少了数据存储量，降低了计算的复杂度，可以更好实现实时性的要求。并且可以取得等同于原始SIFT描述子的匹配结果，远远优于将SIFT描述子进行统一二值化进行匹配的结果。The overall flow of the technical solution of the present invention is shown in Figure 1 of the description. Our method greatly reduces the amount of data storage, reduces the complexity of calculation, and can better meet the requirements of real-time performance. And the matching result equivalent to the original SIFT descriptor can be obtained, which is far better than the result of uniform binarization of the SIFT descriptor for matching.

A、对于每一幅人脸图像，首先提取SIFT特征描述子，然后对SIFT描述子进行二值化，求得各个层次的信息熵，根据各层0和1出现的概率和保留的总比特数，求得位平均熵，然后找出最大位平均熵对应的二值化层数，保留这几层二值化结果，最终形成新的二值化描述子。具体步骤包括：A. For each face image, first extract the SIFT feature descriptor, and then binarize the SIFT descriptor to obtain the information entropy of each level, according to the probability of occurrence of 0 and 1 in each layer and the total number of bits reserved , to obtain the bit average entropy, and then find out the number of binarization layers corresponding to the maximum bit average entropy, keep the binarization results of these layers, and finally form a new binarization descriptor. Specific steps include:

A1、对于每一幅人脸图像，首先提取SIFT特征描述子A1. For each face image, first extract the SIFT feature descriptor

D＝(f₁,f₂,…,f₁₂₈)^T∈R¹²⁸ D=(f ₁ ,f ₂ ,…,f ₁₂₈ ) ^T ∈ R ¹²⁸

A2、对于每一幅人脸图像的每一个SIFT描述子进行二值化A2. Binarize each SIFT descriptor of each face image

为向量D的中值 is the median value of the vector D

A5、计算做各层二值化后的信息熵A5. Calculate the information entropy after binarization of each layer

做一层二值化的信息熵为：The information entropy for one layer of binarization is:

H₁＝-P(n₁₀)log₂P(n₁₀)-P(n₁₁)log₂P(n₁₁)H ₁ =-P(n ₁₀ )log ₂ P(n ₁₀ )-P(n ₁₁ )log ₂ P(n ₁₁ )

做二层二值化的信息熵为：The information entropy for two-layer binarization is:

${H h}_{22} = = - - {Σ Σ}_{i i = = 11}^{22} P P (({n no}_{i i 00})) {log log}_{22} P P (({n no}_{i i 00})) - - P P (({n no}_{21 twenty one})) {log log}_{22} P P (({n no}_{21 twenty one}))$

做三层二值化的信息熵为：The information entropy for three-layer binarization is:

${H h}_{33} = = - - {Σ Σ}_{i i = = 11}^{33} P P (({n no}_{i i 00})) {log log}_{22} P P (({n no}_{i i 00})) - - P P (({n no}_{3131})) {log log}_{22} P P (({n no}_{3131}))$

做四层二值化的信息熵为：The information entropy of four-layer binarization is:

${H h}_{44} = = - - {Σ Σ}_{i i = = 11}^{44} P P (({n no}_{i i 00})) {log log}_{22} P P (({n no}_{i i 00})) - - P P (({n no}_{4141})) {log log}_{22} P P (({n no}_{4141}))$

A6、计算经过各层二值化后保留的比特数A6. Calculate the number of bits retained after each layer of binarization

做一层二值化后保留的总字节数为：The total number of bytes reserved after one layer of binarization is:

N₁＝n₁₀+n₁₁ N ₁ =n ₁₀ +n ₁₁

做二层二值化后保留的总字节数为：The total number of bytes reserved after the second layer binarization is:

${N N}_{22} = = {Σ Σ}_{i i = = 11}^{22} {n no}_{i i 00} + + {n no}_{21 twenty one}$

做三层二值化后保留的总字节数为：The total number of bytes reserved after three layers of binarization is:

${N N}_{33} = = {Σ Σ}_{i i = = 11}^{33} {n no}_{i i 00} + + {n no}_{3131}$

做四层二值化后保留的总字节数为：The total number of bytes reserved after four layers of binarization is:

${N N}_{44} = = {Σ Σ}_{i i = = 11}^{44} {n no}_{i i 00} + + {n no}_{4141}$

A7、位平均熵可以由信息熵除以位数得到A7. The average bit entropy can be obtained by dividing the information entropy by the number of bits

做一层二值化后的位平均熵为：The average bit entropy after one layer of binarization is:

${\overset{&OverBar; &OverBar;}{H h}}_{11} = = {H h}_{11} / / {N N}_{11} = = [[- - P P (({n no}_{1010})) {log log}_{22} P P (({n no}_{1010})) - - P P (({n no}_{1111})) {log log}_{22} P P (({n no}_{1111}))]] / / (({n no}_{1010} + + {n no}_{1111}))$

做二层二值化后的位平均熵为：The average bit entropy after two-layer binarization is:

${\overset{&OverBar; &OverBar;}{H h}}_{22} = = {H h}_{22} / / {N N}_{22} = = [[- - {Σ Σ}_{i i = = 11}^{22} P P (({n no}_{i i 00})) {log log}_{22} P P (({n no}_{i i 00})) - - P P (({n no}_{21 twenty one})) {log log}_{22} P P (({n no}_{21 twenty one}))]] / / (({Σ Σ}_{i i = = 11}^{22} {n no}_{i i 00} + + {n no}_{21 twenty one}))$

做三层二值化后的位平均熵为：The average bit entropy after three layers of binarization is:

${\overset{&OverBar; &OverBar;}{H h}}_{33} = = {H h}_{33} / / {N N}_{33} = = [[- - {Σ Σ}_{i i = = 11}^{33} P P (({n no}_{i i 00})) {log log}_{22} P P (({n no}_{i i 00})) - - P P (({n no}_{3131})) {log log}_{22} P P (({n no}_{3131}))]] / / (({Σ Σ}_{i i = = 11}^{33} {n no}_{i i 00} + + {n no}_{3131}))$

做四层二值化后的位平均熵为：The average bit entropy after four layers of binarization is:

${\overset{&OverBar; &OverBar;}{H h}}_{44} = = {H h}_{44} / / {N N}_{44} = = [[- - {Σ Σ}_{i i = = 11}^{44} P P (({n no}_{i i 00})) {log log}_{22} P P (({n no}_{i i 00})) - - P P (({n no}_{4141})) {log log}_{22} P P (({n no}_{4141}))]] / / (({Σ Σ}_{i i = = 11}^{44} {n no}_{i i 00} + + {n no}_{4141}))$

A8、求得最大位平均熵所对应的层数，保留对应层数的二值化结果，形成新的二值化描述子。A8. Obtain the number of layers corresponding to the maximum bit average entropy, keep the binarization results of the corresponding layers, and form a new binarization descriptor.

B、匹配阶段，对于任意两幅人脸图像，分别提取其新的二值化特征描述子之后，然后用汉明距离代替欧式距离计算两个描述子之间的距离，将其跟设定阈值进行比较，如果汉明距离小于等于设定阈值，则认为匹配成功；否则，则认为匹配不成功。对于提取的不同层次的二值化特征描述子，需要选择不同的阈值。具体步骤包括：B. In the matching stage, for any two face images, after extracting their new binary feature descriptors, the Hamming distance is used instead of the Euclidean distance to calculate the distance between the two descriptors, and the threshold is set For comparison, if the Hamming distance is less than or equal to the set threshold, the matching is considered successful; otherwise, the matching is considered unsuccessful. For the extracted binary feature descriptors of different levels, different thresholds need to be selected. Specific steps include:

B2、对于每两个描述子x＝[b₁₁,b₁₂,b₁₃,…]和y＝[b₂₁,b₂₂,b₂₃,…]，计算其汉明距离B2. For every two descriptors x=[b ₁₁ ,b ₁₂ ,b ₁₃ ,…] and y=[b ₂₁ ,b ₂₂ ,b ₂₃ ,…], calculate their Hamming distance

${dis dis}_{H h} ((x x,, y the y)) = = x x &CirclePlus; &CirclePlus; y the y$

B3、将得到的汉明距离和阈值进行比较，对于不同层次的二值化描述子，选择的阈值也不相同，设定i层二值化的阈值为T_i(i＝1,2,3,4,T₁＝17；T₂＝28；T₃＝34；T₄＝39)B3. Comparing the obtained Hamming distance with the threshold value, the selected threshold value is also different for the binary descriptors of different levels, and the threshold value of the i-layer binarization is set to T _i (i=1,2,3 ,4, T ₁ =17; T ₂ =28; T ₃ =34; T ₄ =39)

$S S G G N N (({dis dis}_{H h})) = = \{\begin{matrix} 11,, d d i i {s the s}_{H h} \leq \leq {T T}_{i i} \\ 00,, {dis dis}_{H h} > > {T T}_{i i} \end{matrix}$

B4、如果SGN(Dis_H)＝1，则认为匹配成功；否则，认为匹配不成功。B4. If SGN(Dis _H )=1, it is considered that the matching is successful; otherwise, it is considered that the matching is not successful.

匹配结果如说明书附图3中所示，(a)是原始SIFT描述子的匹配结果，匹配个数为1242；(b)是统一二值化后的匹配结果，产生了很多错误的匹配。(c)是本发明方法的匹配结果，匹配个数为1161。本发明方法的匹配结果等同于原始SIFT描述子的匹配结果，远远优于将SIFT描述子进行统一二值化进行匹配得到的结果。The matching results are shown in Figure 3 of the specification, (a) is the matching result of the original SIFT descriptor, and the number of matches is 1242; (b) is the matching result after unified binarization, and many wrong matches have been generated. (c) is the matching result of the method of the present invention, and the matching number is 1161. The matching result of the method of the present invention is equal to the matching result of the original SIFT descriptor, which is far superior to the result obtained by uniformly binarizing the SIFT descriptor for matching.

Claims

1. a kind of sub- binarization method and similarity mode scheme are described based on the sift of dominant bit mean entropy, walk including following Rapid:

A, binaryzation stage, for each width facial image, extract sift Feature Descriptor first, then son is described to sift and enter Row multilamellar binaryzation, tries to achieve comentropy at all levels, the total bit number of the probability being occurred according to each layer 0 and 1 and reservation, tries to achieve Position mean entropy, then finds out the dominant bit mean entropy corresponding binaryzation number of plies, retains this which floor binaryzation result, forms new two Value Feature Descriptor；

Described step a specifically includes:

A1, for each width facial image, extract sift Feature Descriptor first

D=(f₁,f₂,…,f₁₂₈)^t∈r¹²⁸

A2, each sift for each width facial image describe son and carry out binaryzation

Intermediate value for vectorial d

A3, the number of statistics 0 and 1, respectively n₁₀And n₁₁；

A4, be labeled as 1 those bytes corresponding description, proceed binaryzation, statistics now 0 and 1 number, respectively n₂₀And n₂₁, proceed above-mentioned binarization, obtain n successively₃₀And n₃₁, n₄₀And n₄₁；

A5, calculating do the comentropy after each layer binaryzation

The comentropy doing one layer of binaryzation is:

h₁=-p (n₁₀)log₂p(n₁₀)-p(n₁₁)log₂p(n₁₁)

The comentropy doing two layers of binaryzation is:

h_{2} = - σ_{i = 1}^{2} p (n_{i 0}) \log_{2} p (n_{i 0}) - p (n_{21}) \log_{2} p (n_{21})

The comentropy doing three layers of binaryzation is:

h_{3} = - σ_{i = 1}^{3} p (n_{i 0}) \log_{2} p (h_{0}) - p (n_{31}) \log_{2} p (n_{31})

The comentropy doing four layers of binaryzation is:

h_{4} = - σ_{i = 1}^{4} p (n_{i 0}) \log_{2} p (n_{i 0}) - p (n_{41}) \log_{2} p (n_{41})

The bit number that a6, calculating retain after each layer binaryzation

The total bytes retaining after doing one layer of binaryzation are:

n₁=n₁₀+n₁₁

The total bytes retaining after doing two layers of binaryzation are:

n_{2} = σ_{i = 1}^{2} n_{i 0} + n_{21}

The total bytes retaining after doing three layers of binaryzation are:

n_{3} = σ_{i = 1}^{3} n_{i 0} + n_{31}

The total bytes retaining after doing four layers of binaryzation are:

n_{4} = σ_{i = 1}^{4} n_{i 0} + n_{41}

A7, position mean entropy can be obtained divided by digit by comentropy

Doing the position mean entropy after one layer of binaryzation is:

{\overset{&overbar;}{h}}_{1} = h_{1} / n_{1} = [- p (n_{10}) \log_{2} p (n_{10}) - p (n_{11}) \log_{2} p (n_{11})] / (n_{10} + n_{11})

Doing the position mean entropy after two layers of binaryzation is:

{\overset{&overbar;}{h}}_{2} = h_{2} / n_{2} = [- σ_{i = 1}^{2} p (n_{i 0}) \log_{2} p (n_{i 0}) - p (n_{21}) \log_{2} p (n_{21})] / (σ_{i = 1}^{2} n_{i 0} + n_{21})

Doing the position mean entropy after three layers of binaryzation is:

{\overset{&overbar;}{h}}_{3} = h_{3} / n_{3} = [- σ_{i = 1}^{3} p (n_{i 0}) \log_{2} p (n_{i 0}) - p (n_{31}) \log_{2} p (n_{31})] / (σ_{i = 1}^{3} n_{i 0} + n_{31})

Doing the position mean entropy after four layers of binaryzation is:

{\overset{&overbar;}{h}}_{4} = h_{4} / n_{4} = [- σ_{i = 1}^{4} p (n_{i 0}) \log_{2} p (n_{i 0}) - p (n_{41}) \log_{2} p (n_{41})] / (σ_{i = 1}^{4} n_{i 0} + n_{41})

A8, the number of plies tried to achieve corresponding to dominant bit mean entropy retain the binaryzation result of the corresponding number of plies, form new binaryzation and retouch State son；

B, matching stage, for any two width facial images, after extracting its new binaryzation Feature Descriptor respectively, Ran Houyong Hamming distance replaces Euclidean distance to calculate the distance between two description, it is compared with given threshold, if Hamming Distance is less than or equal to given threshold then it is assumed that the match is successful；Otherwise then it is assumed that coupling is unsuccessful；For the different levels extracted Binaryzation Feature Descriptor, need to select different threshold values；

Described step b specifically includes:

B1, for any two width facial images, extract its new binaryzation description；

B2, sub- x=[b is described for each two₁₁,b₁₂,b₁₃...] and y=[b₂₁,b₂₂,b₂₃...], calculate its Hamming distance

{dis}_{h} (x, y) = x &circleplus; y

B3, the Hamming distance obtaining and threshold value are compared, the binaryzation for different levels describes son, the threshold value of selection Differ, set the threshold value of i layer binaryzation as t_i, wherein i=1,2,3,4, t₁=17；t₂=28；t₃=34；t₄=39；

s g n ({dis}_{h}) = \{\begin{matrix} 1, d i s_{h} \leq t_{i} \\ 0, d i s_{h} > t_{i} \end{matrix}

If b4 is sgn (dis_h)=1 is then it is assumed that the match is successful；Otherwise it is assumed that coupling is unsuccessful.