More Web Proxy on the site http://driver.im/

research-article

Semi-Relaxation Supervised Hashing for Cross-Modal Retrieval

Authors:

Peng-Fei Zhang,

Chuan-Xiang Li,

Xin-Shun XuAuthors Info & Claims

MM '17: Proceedings of the 25th ACM international conference on Multimedia

Pages 1762 - 1770

https://doi.org/10.1145/3123266.3123320

Published: 23 October 2017 Publication History

Abstract

Recently, some cross-modal hashing methods have been devised for cross-modal search task. Essentially, given a similarity matrix, most of these methods tackle a discrete optimization problem by separating it into two stages, i.e., first relaxing the binary constraints and finding a solution of the relaxed optimization problem, then quantizing the solution to obtain the binary codes. This scheme will generate large quantization error. Some discrete optimization methods have been proposed to tackle this; however, the generation of the binary codes is independent of the features in the original space, which makes it not robust to noise. To consider these problems, in this paper, we propose a novel supervised cross-modal hashing method---Semi-Relaxation Supervised Hashing (SRSH). It can learn the hash functions and the binary codes simultaneously. At the same time, to tackle the optimization problem, it relaxes a part of binary constraints, instead of all of them, by introducing an intermediate representation variable. By doing this, the quantization error can be reduced and the optimization problem can also be easily solved by an iterative algorithm proposed in this paper. Extensive experimental results on three benchmark datasets demonstrate that SRSH can obtain competitive results and outperform state-of-the-art unsupervised and supervised cross-modal hashing methods.

References

[1]

Michael M Bronstein, Alexander M Bronstein, Fabrice Michel, and Nikos Paragios. 2010. Data fusion through cross-modality metric learning using similarity-sensitive hashing Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 3594--3601.

[2]

Yue Cao, Mingsheng Long, Jianmin Wang, Qiang Yang, and Philip S Yu. 2016. Deep Visual-Semantic Hashing for Cross-Modal Retrieval Proceedings of ACM International Conference on Knowledge Discovery and Data Mining. 1445--1454.

Digital Library

[3]

Tat-Seng Chua, Jinhui Tang, Richang Hong, Haojie Li, Zhiping Luo, and Yantao Zheng. 2009. NUS-WIDE: a real-world web image database from national university of singapore Proceedings of ACM International Conference on Image and Video Retrieval. 48.

Digital Library

[4]

Qi Dai, Jianguo Li, Jingdong Wang, and Yu-Gang Jiang. 2016. Binary optimized hashing. In Proceedings of ACM International Conference on Multimedia. 1247--1256.

Digital Library

[5]

Guiguang Ding, Yuchen Guo, and Jile Zhou. 2014. Collective matrix factorization hashing for multimodal data Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 2075--2082.

Digital Library

[6]

Aristides Gionis, Piotr Indyk, Rajeev Motwani, and others. 1999. Similarity search in high dimensions via hashing. Proceedings of International Conference on Very Large Data Bases. 518--529.

Digital Library

[7]

Yunchao Gong, Svetlana Lazebnik, Albert Gordo, and Florent Perronnin. 2013. Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 12, 35 (2013), 2916--2929.

Digital Library

[8]

Mark J Huiskes and Michael S Lew. 2008. The MIR flickr retrieval evaluation. In Proceedings of ACM International Conference on Multimedia Information Retrieval. 39--43.

Digital Library

[9]

Qing-Yuan Jiang and Wu-Jun Li. 2015. Scalable graph hashing with feature transformation. Proceedings of International Joint Conference on Artificial Intelligence. 2248--2254.

Digital Library

[10]

Qing-Yuan Jiang and Wu-Jun Li. 2016. Deep Cross-Modal Hashing. arXiv preprint arXiv:1602.02255 (2016).

[11]

Wang-Cheng Kang, Wu-Jun Li, and Zhi-Hua Zhou. 2016. Column sampling based discrete supervised hashing. Proceedings of AAAI Conference on Artificial Intelligence. 1230--1236.

Digital Library

[12]

Weihao Kong and Wu-Jun Li. 2012. Isotropic hashing Proceedings of Advances in Neural Information Processing Systems. 1646--1654.

Digital Library

[13]

Shaishav Kumar and Raghavendra Udupa. 2011. Learning hash functions for cross-view similarity search Proceedings of International Joint Conference on Artificial Intelligence. 1360.

Digital Library

[14]

Guosheng Lin, Chunhua Shen, Qinfeng Shi, Anton van den Hengel, and David Suter. 2014. Fast supervised hashing with decision trees for high-dimensional data Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 1963--1970.

Digital Library

[15]

Guosheng Lin, Chunhua Shen, David Suter, and Anton Van Den Hengel. 2013. A general two-step approach to learning-based hashing Proceedings of IEEE International Conference on Computer Vision. 2552--2559.

Digital Library

[16]

Zijia Lin, Guiguang Ding, Mingqing Hu, and Jianmin Wang. 2015. Semantics-preserving hashing for cross-view retrieval Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 3864--3872.

[17]

Wei Liu, Cun Mu, Sanjiv Kumar, and Shih-Fu Chang. 2014. Discrete graph hashing. In Proceedings of Advances in Neural Information Processing Systems. 3419--3427.

Digital Library

[18]

Wei Liu, Jun Wang, Rongrong Ji, Yu-Gang Jiang, and Shih-Fu Chang. 2012. Supervised hashing with kernels. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 2074--2081.

Digital Library

[19]

Mingsheng Long, Yue Cao, Jianmin Wang, and Philip S Yu. 2016. Composite correlation quantization for efficient multimodal retrieval Proceedings of ACM International Conference on Research and Development in Information Retrieval. 579--588.

Digital Library

[20]

Liqiang Nie, Meng Wang, Zheng-Jun Zha, and Tat-Seng Chua. 2012 a. Oracle in Image Search: A Content-Based Approach to Performance Prediction. ACM Transactions on Information System Vol. 30, 2 (2012), 13:1--13:23.

Digital Library

[21]

Liqiang Nie, Shuicheng Yan, Meng Wang, Richang Hong, and Tat-Seng Chua. 2012 b. Harvesting Visual Concepts for Image Search with Complex Queries Proceedings of ACM International Conference on Multimedia. 59--68.

Digital Library

[22]

Mohammad Norouzi and David M Blei. 2011. Minimal loss hashing for compact binary codes. In Proceedings of International Conference on Machine Learning. 353--360.

Digital Library

[23]

Nikhil Rasiwasia, Jose Costa Pereira, Emanuele Coviello, Gabriel Doyle, Gert RG Lanckriet, Roger Levy, and Nuno Vasconcelos. 2010. A new approach to cross-modal multimedia retrieval Proceedings of ACM International Conference on Multimedia. 251--260.

Digital Library

[24]

Fumin Shen, Chunhua Shen, Wei Liu, and Heng Tao Shen. 2015. Supervised discrete hashing. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 37--45.

[25]

Jingkuan Song, Yang Yang, Yi Yang, Zi Huang, and Heng Tao Shen. 2013. Inter-media hashing for large-scale retrieval from heterogeneous data sources Proceedings of ACM International Conference on Management of Data. 785--796.

Digital Library

[26]

Christoph Strecha, Alex Bronstein, Michael Bronstein, and Pascal Fua. 2012. LDAHash: Improved matching with smaller descriptors. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 34, 1 (2012), 66--78.

Digital Library

[27]

Jinhui Tang, Zechao Li, Meng Wang, and Ruizhen Zhao. 2015. Neighborhood Discriminant Hashing for Large-Scale Image Retrieval. IEEE Transactions on Image Processing Vol. 24, 9 (2015), 2827--2840.

Digital Library

[28]

Jun Wang, Sanjiv Kumar, and Shih-Fu Chang. 2010. Sequential projection learning for hashing with compact codes Proceedings of International Conference on Machine Learning. 1127--1134.

Digital Library

[29]

Jian Wang, Xin-Shun Xu, Shanqing Guo, Lizhen Cui, and Xiao-Lin Wang. 2016 a. Linear unsupervised hashing for ANN search in Euclidean space. Neurocomputing Vol. 171 (2016), 283--292.

Digital Library

[30]

Min Wang, Wengang Zhou, Qi Tian, Zhengjun Zha, and Houqiang Li. 2016 b. Linear Distance Preserving Pseudo-Supervised and Unsupervised Hashing Proceedings of ACM International Conference on Multimedia. 1257--1266.

Digital Library

[31]

Yair Weiss, Antonio Torralba, and Rob Fergus. 2009. Spectral hashing Proceedings of Advances in Neural Information Processing Systems. 1753--1760.

Digital Library

[32]

Botong Wu, Qiang Yang, Wei-Shi Zheng, Yizhou Wang, and Jingdong Wang. 2015. Quantized correlation hashing for fast cross-modal search. Proceedings of International Joint Conference on Artificial Intelligence. 3946--3952.

Digital Library

[33]

Xin-Shun Xu. 2016. Dictionary learning based hashing for cross-modal retrieval Proceedings of ACM International Conference on Multimedia. 177--181.

Digital Library

[34]

Ting-Kun Yan, Xin-Shun Xu, Shanqing Guo, Zi Huang, and Xiao-Lin Wang. 2016. Supervised robust discrete multimodal hashing for cross-media retrieval Proceedings of ACM International on Conference on Information and Knowledge Management. 1271--1280.

Digital Library

[35]

Erkun Yang, Cheng Deng, Wei Liu, Xianglong Liu, Dacheng Tao, and Xinbo Gao. 2017. Pairwise Relationship Guided Deep Hashing for Cross-Modal Retrieval Proceedings of AAAI Conference on Artificial Intelligence. 1618--1625.

[36]

Yang Yang, Zhigang Ma, Yi Yang, Feiping Nie, and Heng Tao Shen. 2015. Multitask spectral clustering by exploring intertask correlation. IEEE Transactions on Cybernetics Vol. 45, 5 (2015), 1083--1094.

[37]

Yang Yang, Zheng-Jun Zha, Yue Gao, Xiaofeng Zhu, and Tat-Seng Chua. 2014. Exploiting web images for semantic video indexing via robust sample-specific loss. IEEE Transactions on Multimedia Vol. 16, 6 (2014), 1677--1689.

[38]

Dongqing Zhang and Wu-Jun Li. 2014. Large-scale supervised multimodal hashing with semantic correlation maximization. Proceedings of AAAI Conference on Artificial Intelligence. 7.

Digital Library

[39]

Yi Zhen and Dit-Yan Yeung. 2012. Co-regularized hashing for multimodal data. In Proceedings of Advances in Neural Information Processing Systems. 1376--1384.

Digital Library

[40]

Jile Zhou, Guiguang Ding, and Yuchen Guo. 2014. Latent semantic sparse hashing for cross-modal similarity search Proceedings of ACM International Conference on Research and Development in Information Retrieval. 415--424.

Digital Library

[41]

Xiaofeng Zhu, Zi Huang, Heng Tao Shen, and Xin Zhao. 2013. Linear cross-modal hashing for efficient multimedia search Proceedings of ACM International Conference on Multimedia. 143--152.

Digital Library

Cited By

Fan WZhang CLi HJia XWang G(2025)Three-Stage Semisupervised Cross-Modal Hashing With Pairwise Relations ExploitationIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.326322136:1(260-273)Online publication date: Jan-2025
https://doi.org/10.1109/TNNLS.2023.3263221
Chen B(2025) Robust Partially Observed Data Sensing via ℓ₂, ₚ Norms With Flexible Adaptive Label Marginal Space for Visual IoT IEEE Internet of Things Journal10.1109/JIOT.2024.349025212:5(5435-5448)Online publication date: 1-Mar-2025
https://doi.org/10.1109/JIOT.2024.3490252
Teng SLi JTeng LFei LWu NZhang W(2024)Scalable Discrete and Asymmetric Unequal Length Hashing Learning for Cross-Modal RetrievalIEEE Transactions on Multimedia10.1109/TMM.2024.337287626(7917-7932)Online publication date: 5-Mar-2024
https://dl.acm.org/doi/10.1109/TMM.2024.3372876
Show More Cited By

Index Terms

Semi-Relaxation Supervised Hashing for Cross-Modal Retrieval
1. Computing methodologies
  1. Machine learning
    1. Learning paradigms
2. Information systems
  1. Information retrieval
    1. Specialized information retrieval
      1. Multimedia and multimodal retrieval

Recommendations

Supervised Discriminative Discrete Hashing for Cross-Modal Retrieval
Advanced Data Mining and Applications
Abstract
With the growing interest in cross-modal retrieval technology, cross-modal hashing has become a mainstream trend for comparing and searching between different modalities. However, when faced with multi-label information, existing research has ... $^{}$ $^{}$
Semi-supervised semantic factorization hashing for fast cross-modal retrieval

Cross-modal hashing can effectively solve the large-scale cross-modal retrieval by integrating the advantages of traditional cross-modal analysis and hashing techniques. In cross-modal hashing, preserving semantic correlation is important and ...
Class consistent hashing for fast Web data searching

Hashing based ANN search has drawn lots of attention due to its low storage and time cost. Supervised hashing methods can leverage label information to generate compact and accurate hash codes and have achieved promising results. However, when dealing ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '17: Proceedings of the 25th ACM international conference on Multimedia

October 2017

2028 pages

ISBN:9781450349062

DOI:10.1145/3123266

General Chairs:
Qiong Liu
FXPAL, USA
,
Rainer Lienhart
Universität Augsburg, Germany
,
Haohong Wang
TCL America, USA
,
Program Chairs:
Sheng-Wei "Kuan-Ta" Chen
Academia Sinica, Taiwan
,
Susanne Boll
University of Oldenburg, Germany
,
Phoebe Chen
La Trobe University, Australia
,
Gerald Friedland
Lawrence Livermore National Lab, USA
,
Jia Li
Google, USA
,
Shuicheng Yan
Qihoo 360, China

Copyright © 2017 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 October 2017

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Key Research and Development Program of Shandong Province
National Natural Science Foundation of China

Conference

MM '17

Sponsor:

SIGMM

MM '17: ACM Multimedia Conference

October 23 - 27, 2017

California, Mountain View, USA

Acceptance Rates

MM '17 Paper Acceptance Rate 189 of 684 submissions, 28%;

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

40
Total Citations
View Citations
446
Total Downloads

Downloads (Last 12 months)25
Downloads (Last 6 weeks)0

Reflects downloads up to 01 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Fan WZhang CLi HJia XWang G(2025)Three-Stage Semisupervised Cross-Modal Hashing With Pairwise Relations ExploitationIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.326322136:1(260-273)Online publication date: Jan-2025
https://doi.org/10.1109/TNNLS.2023.3263221
Chen B(2025) Robust Partially Observed Data Sensing via ℓ₂, ₚ Norms With Flexible Adaptive Label Marginal Space for Visual IoT IEEE Internet of Things Journal10.1109/JIOT.2024.349025212:5(5435-5448)Online publication date: 1-Mar-2025
https://doi.org/10.1109/JIOT.2024.3490252
Teng SLi JTeng LFei LWu NZhang W(2024)Scalable Discrete and Asymmetric Unequal Length Hashing Learning for Cross-Modal RetrievalIEEE Transactions on Multimedia10.1109/TMM.2024.337287626(7917-7932)Online publication date: 5-Mar-2024
https://dl.acm.org/doi/10.1109/TMM.2024.3372876
Zhu LZheng CGuan WLi JYang YShen H(2024)Multi-Modal Hashing for Efficient Multimedia Retrieval: A SurveyIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.328292136:1(239-260)Online publication date: Jan-2024
https://doi.org/10.1109/TKDE.2023.3282921
Yang FHan MMa FLiu YDing XTong D(2024)Disperse Asymmetric Subspace Relation Hashing for Cross-Modal RetrievalIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.328730134:1(603-617)Online publication date: Jan-2024
https://doi.org/10.1109/TCSVT.2023.3287301
Teng SXu TZheng ZWu NZhang WTeng L(2024)Robust Asymmetric Cross-Modal Hashing Retrieval With Dual Semantic EnhancementIEEE Transactions on Computational Social Systems10.1109/TCSS.2024.335249411:3(4340-4353)Online publication date: Jun-2024
https://doi.org/10.1109/TCSS.2024.3352494
Chen BWang T(2024)Robust Perceptual Data Sensing Based on Majorization-Minimized Low-Rank Semidefinite Relaxation for Visual IoTIEEE Internet of Things Journal10.1109/JIOT.2024.339901211:16(27201-27213)Online publication date: 15-Aug-2024
https://doi.org/10.1109/JIOT.2024.3399012
Chen RWang H(2024)Supervised Consensus Anchor Graph Hashing for Cross Modal RetrievalIEEE Access10.1109/ACCESS.2023.334850812(1805-1821)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2023.3348508
Wang HChen RShu ZZhang YLi H(2024)Supervised adaptive similarity consistent latent representation hashingNeurocomputing10.1016/j.neucom.2023.127113570(127113)Online publication date: Feb-2024
https://doi.org/10.1016/j.neucom.2023.127113
Wang YDong FWang KNie XChen Z(2024)Weighted cross-modal hashing with label enhancementKnowledge-Based Systems10.1016/j.knosys.2024.111657293(111657)Online publication date: Jun-2024
https://doi.org/10.1016/j.knosys.2024.111657
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten