[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2964284.2967206acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
short-paper

Dictionary Learning Based Hashing for Cross-Modal Retrieval

Published: 01 October 2016 Publication History

Abstract

Recent years have witnessed the growing popularity of cross-modal hashing for fast multi-modal data retrieval. Most existing cross-modal hashing methods project heterogeneous data directly into a common space with linear projection matrices. However, such scheme will lead to large error as there will probably be some heterogeneous data with semantic similarity hard to be close in latent space when linear projection is used. In this paper, we propose a dictionary learning cross-modal hashing (DLCMH) to perform cross-modal similarity search. Instead of projecting data directly, DLCMH learns dictionaries and generates sparse representation for each instance, which is more suitable to be projected to latent space. Then, it assumes that all modalities of one instance have identical hash codes, and gets final binary codes by minimizing quantization error. Experimental results on two real-world datasets show that DLCMH outperforms or is comparable to several state-of-the-art hashing models.

References

[1]
D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. Journal of Machine Learning Research, 3:993--1022, 2003.
[2]
M. M. Bronstein, A. M. Bronstein, F. Michel, and N. Paragios. Data fusion through cross-modality metric learning using similarity-sensitive hashing. In Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition, pages 3594--3601, 2010.
[3]
T.-S. Chua, J. Tang, R. Hong, H. Li, Z. Luo, and Y. Zheng. Nus-wide: a real-world web image database from national university of singapore. In Proceedings of ACM International Conference on Image and Video Retrieval, page 48, 2009.
[4]
G. Ding, Y. Guo, and J. Zhou. Collective matrix factorization hashing for multimodal data. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pages 2075--2082, 2014.
[5]
S. Kim, Y. Kang, and S. Choi. Sequential spectral learning to hash with multiple representations. In Proceedings of European Conference on Computer Vision, pages 538--551, 2012.
[6]
S. Kumar and R. Udupa. Learning hash functions for cross-view similarity search. In Proceedings of International Joint Conference on Artificial Intelligence, pages 1360--1365, 2011.
[7]
Z. Lin, G. Ding, M. Hu, and J. Wang. Semantics-preserving hashing for cross-view retrieval. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 3864--3872, 2015.
[8]
W. Liu, J. Wang, R. Ji, Y.-G. Jiang, and S.-F. Chang. Supervised hashing with kernels. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pages 2074--2081, 2012.
[9]
M. Ou, P. Cui, F. Wang, J. Wang, W. Zhu, and S. Yang. Comparing apples to oranges: a scalable solution with heterogeneous hashing. In Proceedings of ACM International Conference on Knowledge Discovery and Data Mining, pages 230--238, 2013.
[10]
N. Rasiwasia, J. Costa Pereira, E. Coviello, G. Doyle, G. R. Lanckriet, R. Levy, and N. Vasconcelos. A new approach to cross-modal multimedia retrieval. In Proceedings of ACM International Conference on Multimedia, pages 251--260, 2010.
[11]
J. Song, Y. Yang, Y. Yang, Z. Huang, and H. T. Shen. Inter-media hashing for large-scale retrieval from heterogeneous data sources. In Proceedings of ACM International Conference on Management of Data, pages 785--796, 2013.
[12]
D. Wang, X. Gao, X. Wang, and L. He. Semantic topic multimodal hashing for cross-media retrieval. In Proceedings of International Joint Conference on Artificial Intelligence, pages 3890--3896, 2015.
[13]
J. Wang, X.-S. Xu, S. Guo, L. Cui, and X.-L. Wang. Linear unsupervised hashing for ann search in euclidean space. Neurocomputing, 171(C):283--292, 2016.
[14]
S.-S. Wang, Z. Huang, and X.-S. Xu. A multi-label least-squares hashing for scalable image search. In Proceedings of SIAM International Conference on Data Mining, pages 954--962, 2015.
[15]
Y. Yang, Z.-J. Zha, Y. Gao, X. Zhu, and T.-S. Chua. Exploiting web images for robust semantic video indexing via sample-specific loss. IEEE Transactions on Multimedia, 16(6):1677--1689, 2014.
[16]
Y. Yang, H. Zhang, M. Zhang, F. Shen, and X. Li. Visual coding in a semantic hierarchy. In Proceedings of ACM International Conference on Multimedia, pages 59--68, 2015.
[17]
Z. Yu, F. Wu, Y. Yang, Q. Tian, J. Luo, and Y. Zhuang. Discriminative coupled dictionary hashing for fast cross-media retrieval. In Proceedings of ACM International Conference on Research and Development in Information Retrieval, pages 395--404, 2014.
[18]
D. Zhai, H. Chang, Y. Zhen, X. Liu, X. Chen, and W. Gao. Parametric local multimodal hashing for cross-view similarity search. In Proceedings of International Joint Conference on Artificial Intelligence, pages 2754--2760, 2013.
[19]
D. Zhang and W.-J. Li. Large-scale supervised multimodal hashing with semantic correlation maximization. In Proceedings of AAAI Conference on Artificial Intelligence, pages 2177--2183, 2014.
[20]
D. Zhang, F. Wang, and L. Si. Composite hashing with multiple information sources. In Proceedings of ACM SIGIR International Conference on Research and Development in Information Retrieval, pages 225--234, 2011.
[21]
Y. Zhen and D.-Y. Yeung. Co-regularized hashing for multimodal data. In Advances in Neural Information Processing Systems 25, pages 1376--1384, 2012.
[22]
Y. Zhen and D.-Y. Yeung. A probabilistic model for multimodal hash function learning. In Proceedings of ACM International Conference on Knowledge Discovery and Data Mining, pages 940--948, 2012.
[23]
J. Zhou, G. Ding, and Y. Guo. Latent semantic sparse hashing for cross-modal similarity search. In Proceedings of ACM International Conference on Research and Development in Information Retrieval, pages 415--424, 2014.

Cited By

View all
  • (2025)Three-Stage Semisupervised Cross-Modal Hashing With Pairwise Relations ExploitationIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.326322136:1(260-273)Online publication date: Jan-2025
  • (2024)Cross-Modal Retrieval: A Systematic Review of Methods and Future DirectionsProceedings of the IEEE10.1109/JPROC.2024.3525147112:11(1716-1754)Online publication date: Nov-2024
  • (2022)Two-Stage Supervised Discrete Hashing for Cross-Modal RetrievalIEEE Transactions on Systems, Man, and Cybernetics: Systems10.1109/TSMC.2021.313093952:11(7014-7026)Online publication date: Nov-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '16: Proceedings of the 24th ACM international conference on Multimedia
October 2016
1542 pages
ISBN:9781450336031
DOI:10.1145/2964284
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 October 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. cross-modal
  2. dictionary learning
  3. hashing
  4. sparse representation

Qualifiers

  • Short-paper

Conference

MM '16
Sponsor:
MM '16: ACM Multimedia Conference
October 15 - 19, 2016
Amsterdam, The Netherlands

Acceptance Rates

MM '16 Paper Acceptance Rate 52 of 237 submissions, 22%;
Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)9
  • Downloads (Last 6 weeks)2
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Three-Stage Semisupervised Cross-Modal Hashing With Pairwise Relations ExploitationIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.326322136:1(260-273)Online publication date: Jan-2025
  • (2024)Cross-Modal Retrieval: A Systematic Review of Methods and Future DirectionsProceedings of the IEEE10.1109/JPROC.2024.3525147112:11(1716-1754)Online publication date: Nov-2024
  • (2022)Two-Stage Supervised Discrete Hashing for Cross-Modal RetrievalIEEE Transactions on Systems, Man, and Cybernetics: Systems10.1109/TSMC.2021.313093952:11(7014-7026)Online publication date: Nov-2022
  • (2022)Multiview Graph Convolutional Hashing for Multisource Remote Sensing Image RetrievalIEEE Geoscience and Remote Sensing Letters10.1109/LGRS.2021.309388419(1-5)Online publication date: 2022
  • (2022)Multi-label enhancement based self-supervised deep cross-modal hashingNeurocomputing10.1016/j.neucom.2021.09.053467(138-162)Online publication date: Jan-2022
  • (2021)BRUSH: Label Reconstructing and Similarity Preserving Hashing for Cross-modal RetrievalProceedings of the 3rd ACM International Conference on Multimedia in Asia10.1145/3469877.3490589(1-7)Online publication date: 1-Dec-2021
  • (2021)Fine-Grained Image-Text Retrieval via Discriminative Latent Space LearningIEEE Signal Processing Letters10.1109/LSP.2021.306559528(643-647)Online publication date: 2021
  • (2021)Convolutional neural network based dictionary learning to create hash codes for content-based image retrievalProcedia Computer Science10.1016/j.procs.2021.02.106183(624-629)Online publication date: 2021
  • (2020)Modality-specific matrix factorization hashing for cross-modal retrievalJournal of Ambient Intelligence and Humanized Computing10.1007/s12652-020-02177-713:11(5067-5081)Online publication date: 19-Jun-2020
  • (2019)Supervised Robust Discrete Multimodal Hashing for Cross-Media RetrievalIEEE Transactions on Multimedia10.1109/TMM.2019.291271421:11(2863-2877)Online publication date: Nov-2019
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media