Abstract
Locality sensitive hashing (LSH), one of the most popular hashing techniques, has attracted considerable attention for nearest neighbor search in the field of image retrieval. It can achieve promising performance only if the number of the generated hash bits is large enough. However, more hash bits assembled to the binary codes contain massive redundant information and require more time cost and storage spaces. To alleviate this limitation, we propose a novel bit selection framework to pick important bits out of the hash bits generated by hashing techniques. Within the bit selection framework, we further exploit eleven evaluation criteria to measure the importance and similarity of each bit generated by LSH, so that the bits with high importance and less similarity are selected to assemble new binary codes. To demonstrate the effectiveness of the proposed framework of bit selection, we evaluated the proposed framework with the evaluation criteria on five commonly used data sets. Experimental results show the proposed bit selection framework works effectively in different cases, and the performance of LSH has not been degraded significantly after redundant hash bits reduced by the evaluation criteria.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Wang J, Zhang T, Song J, Sebe N, Shen HT (2018) A survey on learning to hash. IEEE Trans Pattern Anal Mach Intell 40(4):769–790
Zhang B, Qian J (2021) Autoencoder-based unsupervised clustering and hashing. Appl Intell 51(1):493–505
Liu H, Li E, Liu X, Su K, Zhang S (2021) Anomaly detection with kernel preserving embedding. ACM Transactions on Knowledge Discovery from Data 15(5):91:1–91:
Zhang S, Li X, Zong M, Zhu X, Wang R (2018) Efficient knn classification with different numbers of nearest neighbors. IEEE Transactions on Neural Networks and Learning Systems 29(5):1774–1785
Quynh NH, Thuy QDT, Van CP, Van CN, Tao NQ (2018) An efficient image retrieval method using adaptive weights. Appl Intell 48(10):3807–3826
Silpa-Anan C, Hartley RI (2008) Optimised kd-trees for fast image descriptor matching. In: Proc. of the 2008 IEEE conf computer vision and pattern recognition (CVPR08), pp 24–26
Bentley JL (1975) Multidimensional binary search trees used for associative searching. Commun ACM 18(9):509–517
Nielsen F, Piro P, Barlaud M (2009) Tailored bregman ball trees for effective nearest neighbors. In: Proc of IEEE european workshop on computational geometry, pp 29–32
Liu H, Xu X, Li E, Li X, Zhang S (2021) Anomaly detection with representative neighbors. IEEE Transactions on Neural Networks and Learning Systems, 32(12). https://doi.org/10.1109/TNNLS.2021.3109898
Liu H, Li X, Zhang S, Tian Q (2020) Adaptive hashing with sparse matrix factorization. IEEE Transactions on Neural Networks and Learning Systems 31(10):4318–4329
Cai D (2021) A revisit of hashing algorithms for approximate nearest neighbor search. IEEE Trans Knowl Data Eng 33(6):2337–2348
Kulis B, Grauman K (2009) Kernelized locality-sensitive hashing for scalable image search. In: Proc of IEEE int conf computer vision (ICCV09), pp 2130–2137
Raginsky M, Lazebnik S (2009) Locality-sensitive binary codes from shift-invariant kernels. In: Proc of the annual conf neural information processing systems (NIPS09), pp 1509–1517
Chi L, Zhu X (2017) Hashing techniques: A survey and taxonomy. ACM Computting Surveys 50(1):1–36
Jégou H, Perronnin F, Douze M, Sánchez J, Pérez P, Schmid C (2012) Aggregating local image descriptors into compact codes. IEEE Trans Pattern Anal Mach Intell 34(9):1704–1716
Gong Y, Lazebnik S, Gordo A, Perronnin F (2013) Iterative quantization: a procrustean approach to learning binary codes for large-scale image retrieval. IEEE Trans Pattern Anal Mach Intell 35 (12):2916–2929
Hoang T, Do TT, Nguyen TV, Cheung NM (2020) Unsupervised deep cross-modality spectral hashing. IEEE Trans Image Process 29:8391–8406
Liu W, Wang J, Kumar S, Chang SF (2011) Hashing with graphs. In: Proc of Int Conf Machine Learning (ICML2011), pp 1–8
Zhu X, Zhang L, Huang Z (2014) A sparse embedding and least variance encoding approach to hashing. IEEE Trans Image Process 23(9):3737–3750
Irie G, Li Z, Wu XM, Chang SF (2014) Locally linear hashing for extracting non-linear manifolds. In: Proc of the 2014 IEEE conf computer vision and pattern recognition (CVPR2014), pp 2123–2130
Pang QQ, Zhang L (2020) Fast backward iterative laplacian score for unsupervised feature selection. In: Proc of int conf knowledge science, engineering and management (KSEM2020), pp 409–420
Xie X, Chen H, Qian J (2019) Twin maximum entropy discriminations for classification. Appl Intell 49(6):2391–2399
Pan H, You X, Liu S, Zhang D (2021) Pearson correlation coefficient-based pheromone refactoring mechanism for multi-colony ant colony optimization. Appl Intell 51(2):752–774
Liu H, Sun J, Liu L, Zhang H (2009) Feature selection with dynamic mutual information. Pattern Recogn 42(7):1330–1339
Zhang D, Wang J, Cai D, Lu J (2010) Self-taught hashing for fast similarity search. In: Proc of the 33rd Int ACM SIGIR conf research and development in information, pp 18–25
Zhu X, Huang Z, Cheng H, Cui J, Shen HT (2013) Sparse hashing for fast multimedia search. ACM Trans Inf Syst 31(2):1–24
Liu X, He J, Chang SF (2017a) Hash bit selection for nearest neighbor search. IEEE Trans Image Process 26(11):5367–5380
Liu H, Liu L, Le TD, Lee I, Sun S, Li J (2017b) Nonparametric sparse matrix decomposition for cross-view dimensionality reduction. IEEE Transactions on Multimedia 19(8):1848– 1859
Nie X, Jing W, Cui C, Zhang CJ, Zhu L, Yin Y (2020) Joint multi-view hashing for large-scale near-duplicate video retrieval. IEEE Trans Knowl Data Eng 32(10):1951–1965
Shen HT, Liu L, Yang Y, Xu X, Huang Z, Shen F, Hong R (2021) Exploiting subspace relation in semantic labels for cross-modal hashing. IEEE Trans Knowl Data Eng 33(10):3351–3365
Kong W, Li WJ, Guo M (2012) Manhattan hashing for large-scale image retrieval. In: Proc of the 35rd Int ACM SIGIR conf research and development in information, pp 45–54
Liu X, Nie X, Zhou Q, Nie L, Yin Y (2020) Model optimization boosting framework for linear model hash learning. IEEE Trans Image Process 29:4254–4268
Hasan MM, Srizon AY, Sayeed A, Hasan MAM (2021) High performance classification of caltech-101 with a transfer learned deep convolutional neural network. In: Proc of the 2021 IEEE int conf information and communication technology for sustainable development (ICICT4SD), pp 35–39
Gui J, Liu T, Sun Z, Tao D, Tan T (2018) Fast supervised fiscrete hashing. IEEE Trans Pattern Anal Mach Intell 40(2):490– 496
Yan C, Bai X, Wang S, Zhou J, Hancock ER (2019) Cross-modal hashing with semantic deep embedding. Neurocomputing 337:58–66
Acknowledgments
The authors would like to thank the anonymous referees and the editors for their valuable comments and suggestions, helping to improve the paper significantly. This work was partially supported by the national NSF of China (NSFC) (61976195) and the NSF of Zhejiang Province (LY18F020019).
Author information
Authors and Affiliations
Corresponding authors
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article belongs to the Topical Collection: Special Issue on Multi-view Learning
Guest Editors: Guoqing Chao, Xingquan Zhu, Weiping Ding, Jinbo Bi and Shiliang Sun
Rights and permissions
About this article
Cite this article
Zhou, W., Liu, H., Lou, J. et al. Locality sensitive hashing with bit selection. Appl Intell 52, 14724–14738 (2022). https://doi.org/10.1007/s10489-022-03546-9
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-022-03546-9