[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3206025.3206058acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
research-article

Binary Coding by Matrix Classifier for Efficient Subspace Retrieval

Published: 05 June 2018 Publication History

Abstract

Fast retrieval in large-scale database with high-dimensional subspaces is an important task in many applications, such as image retrieval, video retrieval and visual recognition. This can be facilitated by approximate nearest subspace (ANS) retrieval which requires effective subspace representation. Most of the existing methods for this problem represent subspace by point in the Euclidean space or the Grassmannian space before applying the approximate nearest neighbor (ANN) search. However, the efficiency of these methods can not be guaranteed because the subspace representation step can be very time consuming when coping with high dimensional data. Moreover, the transforming process for subspace to point will cause subspace structural information loss which influence the retrieval accuracy. In this paper, we present a new approach for hashing-based ANS retrieval. The proposed method learns the binary codes for given subspace set following a similarity preserving criterion. It simultaneously leverages the learned binary codes to train matrix classifiers as hash functions. This method can directly binarize a subspace without transforming it into a vector. Therefore, it can efficiently solve the large-scale and high-dimensional multimedia data retrieval problem. Experiments on face recognition and video retrieval show that our method outperforms several state-of-the-art methods in both efficiency and accuracy.

References

[1]
Xiao Bai, Haichuan Yang, Jun Zhou, Peng Ren, and Jian Cheng. 2014. Data-Dependent Hashing Based on p-Stable Distribution. IEEE Transactions on Image Processing Vol. 23, 12 (2014), 5033--5046.
[2]
Ronen Basri, Tal Hassner, and Lihi Zelnikmanor. 2011. Approximate Nearest Subspace Search. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 33, 2 (2011), 266--278.
[3]
Ronen Basri and David W Jacobs. 2003. Lambertian reflectance and linear subspaces. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 25, 2 (2003), 218--233.
[4]
Martin Bauml, Makarand Tapaswi, and Rainer Stiefelhagen. 2013. Semi-supervised Learning with Constraints for Person Identification in Multimedia Data. In Computer Vision and Pattern Recognition. 3602--3609.
[5]
Xinyuan Cai, Chunheng Wang, Baihua Xiao, Xue Chen, and Ji Zhou. 2012. Deep nonlinear metric learning with independent subspace analysis for face verification. In Proceedings of the 2012 ACM International Conference on Multimedia. ACM, 749--752.
[6]
Mayur Datar, Nicole Immorlica, Piotr Indyk, and Vahab S Mirrokni. 2004. Locality-sensitive hashing scheme based on p-stable distributions Symposium on Computational Geometry. 253--262.
[7]
Simone Ercoli, Marco Bertini, and Alberto Del Bimbo. 2017. Compact Hash Codes for Efficient Visual Descriptors Retrieval in Large Scale Databases. IEEE Transactions on Multimedia Vol. 19, 11 (2017), 2521--2532.
[8]
Athinodoros S Georghiades, Peter N Belhumeur, and David Kriegman. 2001. From few to many: illumination cone models for face recognition under variable lighting and pose. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 23, 6 (2001), 643--660.
[9]
Tom Goldstein, Donoghue, and Simon Setzer. 2014. Fast Alternating Direction Optimization Methods. Siam Journal on Imaging Sciences Vol. 7, 3 (2014).
[10]
Gene H Golub and Charles F Van Loan. 1996. Matrix computations. 1996. Johns Hopkins University, Press, Baltimore, MD, USA (1996), 374--426.
[11]
Yunchao Gong and Svetlana Lazebnik. 2013. Iterative quantization: A procrustean approach to learning binary codes Computer Vision and Pattern Recognition. 817--824.
[12]
Xiaofei He. 2004. Incremental semi-supervised subspace learning for image retrieval Proceedings of the 2004 ACM International Conference on Multimedia. ACM, 2--8.
[13]
J. Ji, J. Li, Q. Tian, S. Yan, and B. Zhang. 2015. Angular-Similarity-Preserving Binary Signatures for Linear Subspaces. IEEE Transactions on Image Processing Vol. 24, 11 (2015), 4372--80.
[14]
Yu-Gang Jiang, Jun Wang, Xiangyang Xue, and Shih-Fu Chang. 2013. Query-adaptive image search with hash codes. IEEE Transactions on Multimedia Vol. 15, 2 (2013), 442--453.
[15]
Yin-Hsi Kuo, Kuan-Ting Chen, Chien-Hsing Chiang, and Winston H Hsu. 2009. Query expansion for hash-based image object retrieval Proceedings of the 2009 ACM International Conference on Multimedia. ACM, 65--74.
[16]
Kuangchih Lee, Jeffrey Ho, and David Kriegman. 2005. Acquiring linear subspaces for face recognition under variable lighting. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 27, 5 (2005), 684--698.
[17]
Yan Li, Ruiping Wang, Zhiwu Huang, Shiguang Shan, and Xilin Chen. 2015. Face video retrieval with image query via hashing across Euclidean space and Riemannian manifold. In Computer Vision and Pattern Recognition. 4758--4767.
[18]
Luo Luo, Yubo Xie, Zhihua Zhang, and Wujun Li. 2015. Support Matrix Machines. International Conference on Machine Learning (2015), 938--947.
[19]
Nam Nguyen, Wanquan Liu, and Svetha Venkatesh. 2007. Random subspace two-dimensional PCA for face recognition Pacific-Rim Conference on Multimedia. Springer, 655--664.
[20]
S O'Hara and B. A Draper. 2012. Scalable action recognition with a subspace forest Computer Vision and Pattern Recognition. 1210--1217.
[21]
Fumin Shen, Wei Liu, Shaoting Zhang, Yang Yang, and Heng Tao Shen. 2015. Learning Binary Codes for Maximum Inner Product Search International Conference on Computer Vision. 4148--4156.
[22]
Mei-Ling Shyu, Zongxing Xie, Min Chen, and Shu-Ching Chen. 2008. Video semantic event/concept detection using a subspace-based multimedia data mining framework. IEEE Transactions on Multimedia Vol. 10, 2 (2008), 252--259.
[23]
Litian Sun and Kiyoharu Aizawa. 2013. Action recognition using invariant features under unexampled viewing conditions Proceedings of the 2013 ACM International Conference on Multimedia. ACM, 389--392.
[24]
René Vidal. 2011. Subspace clustering. IEEE Signal Processing Magazine Vol. 28, 2 (2011), 52--68.
[25]
Daixin Wang, Peng Cui, Mingdong Ou, and Wenwu Zhu. 2015. Learning compact hash codes for multimodal representations using orthogonal deep structure. IEEE Transactions on Multimedia Vol. 17, 9 (2015), 1404--1416.
[26]
Xu Wang, Stefan Atev, John Wright, and Gilad Lerman. 2013. Fast Subspace Search via Grassmannian Based Hashing International Conference on Computer Vision. 2776--2783.
[27]
Yair Weiss, Antonio Torralba, and Robert Fergus. 2008. Spectral Hashing. In Advances in Neural Information Processing Systems. 1753--1760.
[28]
L. Wolf, T. Hassner, and I. Maoz. 2011. Face recognition in unconstrained videos with matched background similarity Computer Vision and Pattern Recognition. 529--534.
[29]
Dayan Wu, Zheng Lin, Bo Li, Mingzhen Ye, and Weiping Wang. 2017. Deep Supervised Hashing for Multi-Label and Large-Scale Image Retrieval Proceedings of the 2017 ACM International Conference on Multimedia Retrieval. ACM, 150--158.
[30]
Ruicong Xu, Yang Yang, Fumin Shen, Ning Xie, and Heng Tao Shen. 2017. Efficient Binary Coding for Subspace-based Query-by-Image Video Retrieval Proceedings of the 2017 ACM International Conference on Multimedia. ACM, 1354--1362.
[31]
Mingmin Zhen, Wenmin Wang, and Ronggang Wang. 2016. Regional Subspace Projection Coding for Image Retrieval Proceedings of the 2016 ACM International Conference on Multimedia Retrieval. ACM, 205--212.

Cited By

View all
  • (2023)Generalized Zero-Shot Learning via Implicit Attribute Composition2023 IEEE International Conference on Systems, Man, and Cybernetics (SMC)10.1109/SMC53992.2023.10394354(384-389)Online publication date: 1-Oct-2023
  • (2023)Attribute subspaces for zero-shot learningPattern Recognition10.1016/j.patcog.2023.109869144(109869)Online publication date: Dec-2023
  • (2021)Fast Nearest Subspace Search via Random Angular HashingIEEE Transactions on Multimedia10.1109/TMM.2020.297745923(342-352)Online publication date: 2021
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
ICMR '18: Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval
June 2018
550 pages
ISBN:9781450350464
DOI:10.1145/3206025
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 June 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. binary coding
  2. hashing
  3. large-scale multimedia data retrieval
  4. matrix classifier
  5. subspace retrieval

Qualifiers

  • Research-article

Funding Sources

  • Beijing Natural Science Foundation
  • State Key Lab. of Software Development Environment
  • the National Natural Science Foundation of China

Conference

ICMR '18
Sponsor:

Acceptance Rates

ICMR '18 Paper Acceptance Rate 44 of 136 submissions, 32%;
Overall Acceptance Rate 254 of 830 submissions, 31%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)0
Reflects downloads up to 26 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Generalized Zero-Shot Learning via Implicit Attribute Composition2023 IEEE International Conference on Systems, Man, and Cybernetics (SMC)10.1109/SMC53992.2023.10394354(384-389)Online publication date: 1-Oct-2023
  • (2023)Attribute subspaces for zero-shot learningPattern Recognition10.1016/j.patcog.2023.109869144(109869)Online publication date: Dec-2023
  • (2021)Fast Nearest Subspace Search via Random Angular HashingIEEE Transactions on Multimedia10.1109/TMM.2020.297745923(342-352)Online publication date: 2021
  • (2021)Fast Subspace Clustering Based on the Kronecker Product2020 25th International Conference on Pattern Recognition (ICPR)10.1109/ICPR48806.2021.9412287(1558-1565)Online publication date: 10-Jan-2021
  • (2021)GmFace: An explicit function for face image representationDisplays10.1016/j.displa.2021.10202268(102022)Online publication date: Jul-2021
  • (2020)Matrix Classifier On Dynamic Functional Connectivity For Mci Identification2020 IEEE International Conference on Image Processing (ICIP)10.1109/ICIP40778.2020.9191280(325-329)Online publication date: Oct-2020
  • (2019)Latent distribution preserving deep subspace clusteringProceedings of the 28th International Joint Conference on Artificial Intelligence10.5555/3367471.3367659(4440-4446)Online publication date: 10-Aug-2019
  • (2019)Accelerating Deep Convnets via Sparse Subspace ClusteringImage and Graphics10.1007/978-3-030-34110-7_50(595-606)Online publication date: 28-Nov-2019
  • (2018)Cross-Model Retrieval with Reconstruct HashingStructural, Syntactic, and Statistical Pattern Recognition10.1007/978-3-319-97785-0_37(386-394)Online publication date: 2-Aug-2018
  • (2018)Random Angular Projection for Fast Nearest Subspace SearchAdvances in Multimedia Information Processing – PCM 201810.1007/978-3-030-00776-8_2(15-26)Online publication date: 19-Sep-2018

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media