More Web Proxy on the site http://driver.im/

short-paper

Dictionary Learning Based Hashing for Cross-Modal Retrieval

Author:

Xin-Shun XuAuthors Info & Claims

MM '16: Proceedings of the 24th ACM international conference on Multimedia

Pages 177 - 181

https://doi.org/10.1145/2964284.2967206

Published: 01 October 2016 Publication History

Abstract

Recent years have witnessed the growing popularity of cross-modal hashing for fast multi-modal data retrieval. Most existing cross-modal hashing methods project heterogeneous data directly into a common space with linear projection matrices. However, such scheme will lead to large error as there will probably be some heterogeneous data with semantic similarity hard to be close in latent space when linear projection is used. In this paper, we propose a dictionary learning cross-modal hashing (DLCMH) to perform cross-modal similarity search. Instead of projecting data directly, DLCMH learns dictionaries and generates sparse representation for each instance, which is more suitable to be projected to latent space. Then, it assumes that all modalities of one instance have identical hash codes, and gets final binary codes by minimizing quantization error. Experimental results on two real-world datasets show that DLCMH outperforms or is comparable to several state-of-the-art hashing models.

References

[1]

D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. Journal of Machine Learning Research, 3:993--1022, 2003.

Digital Library

[2]

M. M. Bronstein, A. M. Bronstein, F. Michel, and N. Paragios. Data fusion through cross-modality metric learning using similarity-sensitive hashing. In Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition, pages 3594--3601, 2010.

[3]

T.-S. Chua, J. Tang, R. Hong, H. Li, Z. Luo, and Y. Zheng. Nus-wide: a real-world web image database from national university of singapore. In Proceedings of ACM International Conference on Image and Video Retrieval, page 48, 2009.

Digital Library

[4]

G. Ding, Y. Guo, and J. Zhou. Collective matrix factorization hashing for multimodal data. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pages 2075--2082, 2014.

Digital Library

[5]

S. Kim, Y. Kang, and S. Choi. Sequential spectral learning to hash with multiple representations. In Proceedings of European Conference on Computer Vision, pages 538--551, 2012.

Digital Library

[6]

S. Kumar and R. Udupa. Learning hash functions for cross-view similarity search. In Proceedings of International Joint Conference on Artificial Intelligence, pages 1360--1365, 2011.

Digital Library

[7]

Z. Lin, G. Ding, M. Hu, and J. Wang. Semantics-preserving hashing for cross-view retrieval. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 3864--3872, 2015.

[8]

W. Liu, J. Wang, R. Ji, Y.-G. Jiang, and S.-F. Chang. Supervised hashing with kernels. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pages 2074--2081, 2012.

Digital Library

[9]

M. Ou, P. Cui, F. Wang, J. Wang, W. Zhu, and S. Yang. Comparing apples to oranges: a scalable solution with heterogeneous hashing. In Proceedings of ACM International Conference on Knowledge Discovery and Data Mining, pages 230--238, 2013.

Digital Library

[10]

N. Rasiwasia, J. Costa Pereira, E. Coviello, G. Doyle, G. R. Lanckriet, R. Levy, and N. Vasconcelos. A new approach to cross-modal multimedia retrieval. In Proceedings of ACM International Conference on Multimedia, pages 251--260, 2010.

Digital Library

[11]

J. Song, Y. Yang, Y. Yang, Z. Huang, and H. T. Shen. Inter-media hashing for large-scale retrieval from heterogeneous data sources. In Proceedings of ACM International Conference on Management of Data, pages 785--796, 2013.

Digital Library

[12]

D. Wang, X. Gao, X. Wang, and L. He. Semantic topic multimodal hashing for cross-media retrieval. In Proceedings of International Joint Conference on Artificial Intelligence, pages 3890--3896, 2015.

Digital Library

[13]

J. Wang, X.-S. Xu, S. Guo, L. Cui, and X.-L. Wang. Linear unsupervised hashing for ann search in euclidean space. Neurocomputing, 171(C):283--292, 2016.

Digital Library

[14]

S.-S. Wang, Z. Huang, and X.-S. Xu. A multi-label least-squares hashing for scalable image search. In Proceedings of SIAM International Conference on Data Mining, pages 954--962, 2015.

[15]

Y. Yang, Z.-J. Zha, Y. Gao, X. Zhu, and T.-S. Chua. Exploiting web images for robust semantic video indexing via sample-specific loss. IEEE Transactions on Multimedia, 16(6):1677--1689, 2014.

[16]

Y. Yang, H. Zhang, M. Zhang, F. Shen, and X. Li. Visual coding in a semantic hierarchy. In Proceedings of ACM International Conference on Multimedia, pages 59--68, 2015.

Digital Library

[17]

Z. Yu, F. Wu, Y. Yang, Q. Tian, J. Luo, and Y. Zhuang. Discriminative coupled dictionary hashing for fast cross-media retrieval. In Proceedings of ACM International Conference on Research and Development in Information Retrieval, pages 395--404, 2014.

Digital Library

[18]

D. Zhai, H. Chang, Y. Zhen, X. Liu, X. Chen, and W. Gao. Parametric local multimodal hashing for cross-view similarity search. In Proceedings of International Joint Conference on Artificial Intelligence, pages 2754--2760, 2013.

Digital Library

[19]

D. Zhang and W.-J. Li. Large-scale supervised multimodal hashing with semantic correlation maximization. In Proceedings of AAAI Conference on Artificial Intelligence, pages 2177--2183, 2014.

Digital Library

[20]

D. Zhang, F. Wang, and L. Si. Composite hashing with multiple information sources. In Proceedings of ACM SIGIR International Conference on Research and Development in Information Retrieval, pages 225--234, 2011.

Digital Library

[21]

Y. Zhen and D.-Y. Yeung. Co-regularized hashing for multimodal data. In Advances in Neural Information Processing Systems 25, pages 1376--1384, 2012.

Digital Library

[22]

Y. Zhen and D.-Y. Yeung. A probabilistic model for multimodal hash function learning. In Proceedings of ACM International Conference on Knowledge Discovery and Data Mining, pages 940--948, 2012.

Digital Library

[23]

J. Zhou, G. Ding, and Y. Guo. Latent semantic sparse hashing for cross-modal similarity search. In Proceedings of ACM International Conference on Research and Development in Information Retrieval, pages 415--424, 2014.

Digital Library

Cited By

Fan WZhang CLi HJia XWang G(2025)Three-Stage Semisupervised Cross-Modal Hashing With Pairwise Relations ExploitationIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.326322136:1(260-273)Online publication date: Jan-2025
https://doi.org/10.1109/TNNLS.2023.3263221
Wang TLi FZhu LLi JZhang ZShen H(2024)Cross-Modal Retrieval: A Systematic Review of Methods and Future DirectionsProceedings of the IEEE10.1109/JPROC.2024.3525147112:11(1716-1754)Online publication date: Nov-2024
https://doi.org/10.1109/JPROC.2024.3525147
Zhang DWu XXu TKittler J(2022)Two-Stage Supervised Discrete Hashing for Cross-Modal RetrievalIEEE Transactions on Systems, Man, and Cybernetics: Systems10.1109/TSMC.2021.313093952:11(7014-7026)Online publication date: Nov-2022
https://doi.org/10.1109/TSMC.2021.3130939
Show More Cited By

Index Terms

Dictionary Learning Based Hashing for Cross-Modal Retrieval
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision tasks
        Visual content-based indexing and retrieval
  2. Machine learning

Recommendations

Dictionary Learning based Supervised Discrete Hashing for Cross-Media Retrieval
ICMR '18: Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval

Hashing technique has attracted considerable attention for large-scale multimedia retrieval due to its low storage cost and fast query speed. Moreover, many hashing models have been proposed for cross-modal retrieval task. However, there are still some ...
Semi-Relaxation Supervised Hashing for Cross-Modal Retrieval
MM '17: Proceedings of the 25th ACM international conference on Multimedia

Recently, some cross-modal hashing methods have been devised for cross-modal search task. Essentially, given a similarity matrix, most of these methods tackle a discrete optimization problem by separating it into two stages, i.e., first relaxing the ...
Neighborhood-Preserving Hashing for Large-Scale Cross-Modal Search
MM '16: Proceedings of the 24th ACM international conference on Multimedia

In the literature of cross-modal search, most methods employ linear models to pursue hash codes that preserve data similarity, in terms of Euclidean distance, both within-modal and across-modal. However, data dimensionality can be quite different across ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '16: Proceedings of the 24th ACM international conference on Multimedia

October 2016

1542 pages

ISBN:9781450336031

DOI:10.1145/2964284

General Chairs:
Alan Hanjalic
Delft University of Technology
,
Cees Snoek
Qualcomm Research Netherlands / University of Amsterdam
,
Marcel Worring
University of Amsterdam
,
Moderator:
Dick Bulterman
CWI / VU University Amsterdam
,
Program Chairs:
Benoit Huet
EURECOM
,
Aisling Kelliher
Virginia Tech
,
Yiannis Kompatsiaris
CERTH-ITI
,
Jin Li
Microsoft

Copyright © 2016 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 October 2016

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Conference

MM '16

Sponsor:

SIGMM

MM '16: ACM Multimedia Conference

October 15 - 19, 2016

Amsterdam, The Netherlands

Acceptance Rates

MM '16 Paper Acceptance Rate 52 of 237 submissions, 22%;

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

20
Total Citations
View Citations
479
Total Downloads

Downloads (Last 12 months)9
Downloads (Last 6 weeks)2

Reflects downloads up to 02 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Fan WZhang CLi HJia XWang G(2025)Three-Stage Semisupervised Cross-Modal Hashing With Pairwise Relations ExploitationIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.326322136:1(260-273)Online publication date: Jan-2025
https://doi.org/10.1109/TNNLS.2023.3263221
Wang TLi FZhu LLi JZhang ZShen H(2024)Cross-Modal Retrieval: A Systematic Review of Methods and Future DirectionsProceedings of the IEEE10.1109/JPROC.2024.3525147112:11(1716-1754)Online publication date: Nov-2024
https://doi.org/10.1109/JPROC.2024.3525147
Zhang DWu XXu TKittler J(2022)Two-Stage Supervised Discrete Hashing for Cross-Modal RetrievalIEEE Transactions on Systems, Man, and Cybernetics: Systems10.1109/TSMC.2021.313093952:11(7014-7026)Online publication date: Nov-2022
https://doi.org/10.1109/TSMC.2021.3130939
Gao JShen XFu PJi ZWang T(2022)Multiview Graph Convolutional Hashing for Multisource Remote Sensing Image RetrievalIEEE Geoscience and Remote Sensing Letters10.1109/LGRS.2021.309388419(1-5)Online publication date: 2022
https://doi.org/10.1109/LGRS.2021.3093884
Zou XWu SBakker EWang X(2022)Multi-label enhancement based self-supervised deep cross-modal hashingNeurocomputing10.1016/j.neucom.2021.09.053467(138-162)Online publication date: Jan-2022
https://doi.org/10.1016/j.neucom.2021.09.053
Zhang PZhao PLuo XXu X(2021)BRUSH: Label Reconstructing and Similarity Preserving Hashing for Cross-modal RetrievalProceedings of the 3rd ACM International Conference on Multimedia in Asia10.1145/3469877.3490589(1-7)Online publication date: 1-Dec-2021
https://dl.acm.org/doi/10.1145/3469877.3490589
Zheng MWang WLi Q(2021)Fine-Grained Image-Text Retrieval via Discriminative Latent Space LearningIEEE Signal Processing Letters10.1109/LSP.2021.306559528(643-647)Online publication date: 2021
https://doi.org/10.1109/LSP.2021.3065595
Öztürk Ş(2021)Convolutional neural network based dictionary learning to create hash codes for content-based image retrievalProcedia Computer Science10.1016/j.procs.2021.02.106183(624-629)Online publication date: 2021
https://doi.org/10.1016/j.procs.2021.02.106
Xiong HOu WYan ZGou JZhou QWang A(2020)Modality-specific matrix factorization hashing for cross-modal retrievalJournal of Ambient Intelligence and Humanized Computing10.1007/s12652-020-02177-713:11(5067-5081)Online publication date: 19-Jun-2020
https://doi.org/10.1007/s12652-020-02177-7
Li CYan TLuo XNie LXu X(2019)Supervised Robust Discrete Multimodal Hashing for Cross-Media RetrievalIEEE Transactions on Multimedia10.1109/TMM.2019.291271421:11(2863-2877)Online publication date: Nov-2019
https://doi.org/10.1109/TMM.2019.2912714
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten