short-paper

A Two-step Approach to Cross-modal Hashing

Authors:

Kaiye Wang,

Wei Wang,

Liang Wang,

Ran HeAuthors Info & Claims

ICMR '15: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval

Pages 459 - 462

https://doi.org/10.1145/2671188.2749297

Published: 22 June 2015 Publication History

Get Access

Abstract

With the rapid growth of multimedia data, it is very desirable to effectively and efficiently search objects of interest across different modalities from large scale databases. Cross-modal hashing provides a very promising way to address such problem. In this paper, we propose a two-step cross-modal hashing approach to obtain compact hash codes and learn hash functions from multimodal data. Our approach decomposes the cross-modal hashing problem into two steps: generating hash code and learning hash function. In the first step, we obtain the hash codes for all modalities of data via a joint multi-modal graph, which takes into consideration both the intra-modality and inter-modality similarity. In the second step, learning hashing function is formulated as a binary classification problem. We train binary classifiers to predict the hash code for any data object unseen before. Experimental results on two cross-modal datasets show the effectiveness of our proposed approach.

References

[1]

M. M. Bronstein, A. M. Bronstein, F. Michel, and N. Paragios. Data fusion through cross-modality metric learning using similarity-sensitive hashing. In CVPR, pages 3594--3601, 2010.

Crossref

Google Scholar

[2]

T.-S. Chua, J. Tang, R. Hong, H. Li, Z. Luo, and Y.-T. Zheng. NUS-WIDE: A real-world web image database from national university of singapore. In ACM International Conference on Image and Video Retrieval, 2009.

Digital Library

Google Scholar

[3]

R.-E. Fan, K.-W. Chang, C.-J. Hsieh, X.-R. Wang, and C.-J. Lin. LIBLINEAR: A library for large linear classification. JMLR, 9:1871--1874, 2008.

Digital Library

Google Scholar

[4]

Y. Gong and S. Lazebnik. Iterative quantization: A procrustean approach to learning binary codes. In CVPR, pages 817--824, 2011.

Digital Library

Google Scholar

[5]

S. Kumar and R. Udupa. Learning hash functions for cross-view similarity search. In IJCAI, pages 1360--1365, 2011.

Digital Library

Google Scholar

[6]

N. Quadrianto and C. H. Lampert. Learning multi-view neighborhood preserving projections. In ICML, pages 425--432, 2011.

Google Scholar

[7]

N. Rasiwasia, P. J. Moreno, and N. Vasconcelos. Bridging the gap: query by semantic example. IEEE TMM, 9(5):923--938, 2007.

Digital Library

Google Scholar

[8]

N. Rasiwasia, J. C. Pereira, E. Coviello, G. Doyle, G. Lanckriet, R. Levy, and N. Vasconcelos. A new approach to cross-modal multimedia retrieval. In ACM MM, pages 251--260, 2010.

Digital Library

Google Scholar

[9]

M. Rastegari, J. Choi, S. Fakhraei, H. D. III, and L. S. Davis. Predictable dual-view hashing. In ICML, 2013.

Digital Library

Google Scholar

[10]

J. Song, Y. Yang, Z. Huang, H. Shen, and R. Hong. Multiple feature hashing for real-time large scale near-duplicate video retrieval. In ACM MM, pages 423--432, 2011.

Digital Library

Google Scholar

[11]

Y. Weiss, A. Torralba, and R. Fergus. Spectral hashing. In NIPS, pages 1753--1760, 2008.

Digital Library

Google Scholar

[12]

D. Zhang, F. Wang, and L. Si. Composite hashing with multiple information sources. In ACM SIGIR, pages 225--234, 2011.

Digital Library

Google Scholar

[13]

D. Zhang, J. Wang, D. Cai, and J. S. Liu. Self-taught hashing for fast similarity search. In SIGIR, pages 18--25, 2010.

Digital Library

Google Scholar

[14]

Y. Zhen and D.-Y. Yeung. A probabilistic model for multimodal hash function learning. In SIGKDD, pages 940--948, 2012.

Digital Library

Google Scholar

Cited By

View all

Sun SGuo BMi ZZheng Z(2021)Cross-modal semantic autoencoder with embedding consensusScientific Reports10.1038/s41598-021-92750-711:1Online publication date: 13-Oct-2021
https://doi.org/10.1038/s41598-021-92750-7
Ding KFan BHuo CXiang SPan C(2017)Cross-Modal Hashing via Rank-Order PreservingIEEE Transactions on Multimedia10.1109/TMM.2016.262574719:3(571-585)Online publication date: 1-Mar-2017
https://dl.acm.org/doi/10.1109/TMM.2016.2625747

Index Terms

A Two-step Approach to Cross-modal Hashing
1. Information systems
  1. Information retrieval
    1. Retrieval models and ranking

Recommendations

Discriminant Cross-modal Hashing
ICMR '16: Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval

Hashing based methods have attracted considerable attention for efficient cross-modal retrieval on large-scale multimedia data. The core problem of cross-modal hashing is how to effectively integrate heterogeneous features from different modalities to ...
A Label Noise Robust Cross-Modal Hashing Approach
Knowledge Science, Engineering and Management
Abstract
Cross-modal hashing has attracted more and more research interest for its high speed and low storage cost in solving cross-modal approximate nearest neighbor search problem. With the rapid growth of social networks, a large amount of information ...
Deep fused two-step cross-modal hashing with multiple semantic supervision
Abstract
Existing cross-modal hashing methods ignore the informative multimodal joint information and cannot fully exploit the semantic labels. In this paper, we propose a deep fused two-step cross-modal hashing (DFTH) framework with multiple semantic ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

ICMR '15: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval

June 2015

700 pages

ISBN:9781450332743

DOI:10.1145/2671188

General Chairs:
Alex Hauptmann
Carnegie Mellon University, USA
,
Chong-Wah Ngo
City University of Hong Kong, China
,
Xiangyang Xue
Fudan University, China
,
Program Chairs:
Yu-Gang Jiang
Fudan University, China
,
Cees Snoek
University of Amsterdam and Qualcomm Research Netherlands
,
Nuno Vasconcelos
University of California, San Diego, USA

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 June 2015

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Funding Sources

National Basic Research Program of China
National Natural Science Foundation of China

Conference

ICMR '15

Sponsor:

SIGMM

ICMR '15: International Conference on Multimedia Retrieval

June 23 - 26, 2015

Shanghai, China

Acceptance Rates

ICMR '15 Paper Acceptance Rate 48 of 127 submissions, 38%;

Overall Acceptance Rate 254 of 830 submissions, 31%

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
174
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)0

Reflects downloads up to 01 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Sun SGuo BMi ZZheng Z(2021)Cross-modal semantic autoencoder with embedding consensusScientific Reports10.1038/s41598-021-92750-711:1Online publication date: 13-Oct-2021
https://doi.org/10.1038/s41598-021-92750-7
Ding KFan BHuo CXiang SPan C(2017)Cross-Modal Hashing via Rank-Order PreservingIEEE Transactions on Multimedia10.1109/TMM.2016.262574719:3(571-585)Online publication date: 1-Mar-2017
https://dl.acm.org/doi/10.1109/TMM.2016.2625747

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations

Discriminant Cross-modal Hashing

A Label Noise Robust Cross-Modal Hashing Approach

Deep fused two-step cross-modal hashing with multiple semantic supervision

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations