[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3123266.3123320acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Semi-Relaxation Supervised Hashing for Cross-Modal Retrieval

Published: 23 October 2017 Publication History

Abstract

Recently, some cross-modal hashing methods have been devised for cross-modal search task. Essentially, given a similarity matrix, most of these methods tackle a discrete optimization problem by separating it into two stages, i.e., first relaxing the binary constraints and finding a solution of the relaxed optimization problem, then quantizing the solution to obtain the binary codes. This scheme will generate large quantization error. Some discrete optimization methods have been proposed to tackle this; however, the generation of the binary codes is independent of the features in the original space, which makes it not robust to noise. To consider these problems, in this paper, we propose a novel supervised cross-modal hashing method---Semi-Relaxation Supervised Hashing (SRSH). It can learn the hash functions and the binary codes simultaneously. At the same time, to tackle the optimization problem, it relaxes a part of binary constraints, instead of all of them, by introducing an intermediate representation variable. By doing this, the quantization error can be reduced and the optimization problem can also be easily solved by an iterative algorithm proposed in this paper. Extensive experimental results on three benchmark datasets demonstrate that SRSH can obtain competitive results and outperform state-of-the-art unsupervised and supervised cross-modal hashing methods.

References

[1]
Michael M Bronstein, Alexander M Bronstein, Fabrice Michel, and Nikos Paragios. 2010. Data fusion through cross-modality metric learning using similarity-sensitive hashing Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 3594--3601.
[2]
Yue Cao, Mingsheng Long, Jianmin Wang, Qiang Yang, and Philip S Yu. 2016. Deep Visual-Semantic Hashing for Cross-Modal Retrieval Proceedings of ACM International Conference on Knowledge Discovery and Data Mining. 1445--1454.
[3]
Tat-Seng Chua, Jinhui Tang, Richang Hong, Haojie Li, Zhiping Luo, and Yantao Zheng. 2009. NUS-WIDE: a real-world web image database from national university of singapore Proceedings of ACM International Conference on Image and Video Retrieval. 48.
[4]
Qi Dai, Jianguo Li, Jingdong Wang, and Yu-Gang Jiang. 2016. Binary optimized hashing. In Proceedings of ACM International Conference on Multimedia. 1247--1256.
[5]
Guiguang Ding, Yuchen Guo, and Jile Zhou. 2014. Collective matrix factorization hashing for multimodal data Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 2075--2082.
[6]
Aristides Gionis, Piotr Indyk, Rajeev Motwani, and others. 1999. Similarity search in high dimensions via hashing. Proceedings of International Conference on Very Large Data Bases. 518--529.
[7]
Yunchao Gong, Svetlana Lazebnik, Albert Gordo, and Florent Perronnin. 2013. Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 12, 35 (2013), 2916--2929.
[8]
Mark J Huiskes and Michael S Lew. 2008. The MIR flickr retrieval evaluation. In Proceedings of ACM International Conference on Multimedia Information Retrieval. 39--43.
[9]
Qing-Yuan Jiang and Wu-Jun Li. 2015. Scalable graph hashing with feature transformation. Proceedings of International Joint Conference on Artificial Intelligence. 2248--2254.
[10]
Qing-Yuan Jiang and Wu-Jun Li. 2016. Deep Cross-Modal Hashing. arXiv preprint arXiv:1602.02255 (2016).
[11]
Wang-Cheng Kang, Wu-Jun Li, and Zhi-Hua Zhou. 2016. Column sampling based discrete supervised hashing. Proceedings of AAAI Conference on Artificial Intelligence. 1230--1236.
[12]
Weihao Kong and Wu-Jun Li. 2012. Isotropic hashing Proceedings of Advances in Neural Information Processing Systems. 1646--1654.
[13]
Shaishav Kumar and Raghavendra Udupa. 2011. Learning hash functions for cross-view similarity search Proceedings of International Joint Conference on Artificial Intelligence. 1360.
[14]
Guosheng Lin, Chunhua Shen, Qinfeng Shi, Anton van den Hengel, and David Suter. 2014. Fast supervised hashing with decision trees for high-dimensional data Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 1963--1970.
[15]
Guosheng Lin, Chunhua Shen, David Suter, and Anton Van Den Hengel. 2013. A general two-step approach to learning-based hashing Proceedings of IEEE International Conference on Computer Vision. 2552--2559.
[16]
Zijia Lin, Guiguang Ding, Mingqing Hu, and Jianmin Wang. 2015. Semantics-preserving hashing for cross-view retrieval Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 3864--3872.
[17]
Wei Liu, Cun Mu, Sanjiv Kumar, and Shih-Fu Chang. 2014. Discrete graph hashing. In Proceedings of Advances in Neural Information Processing Systems. 3419--3427.
[18]
Wei Liu, Jun Wang, Rongrong Ji, Yu-Gang Jiang, and Shih-Fu Chang. 2012. Supervised hashing with kernels. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 2074--2081.
[19]
Mingsheng Long, Yue Cao, Jianmin Wang, and Philip S Yu. 2016. Composite correlation quantization for efficient multimodal retrieval Proceedings of ACM International Conference on Research and Development in Information Retrieval. 579--588.
[20]
Liqiang Nie, Meng Wang, Zheng-Jun Zha, and Tat-Seng Chua. 2012 a. Oracle in Image Search: A Content-Based Approach to Performance Prediction. ACM Transactions on Information System Vol. 30, 2 (2012), 13:1--13:23.
[21]
Liqiang Nie, Shuicheng Yan, Meng Wang, Richang Hong, and Tat-Seng Chua. 2012 b. Harvesting Visual Concepts for Image Search with Complex Queries Proceedings of ACM International Conference on Multimedia. 59--68.
[22]
Mohammad Norouzi and David M Blei. 2011. Minimal loss hashing for compact binary codes. In Proceedings of International Conference on Machine Learning. 353--360.
[23]
Nikhil Rasiwasia, Jose Costa Pereira, Emanuele Coviello, Gabriel Doyle, Gert RG Lanckriet, Roger Levy, and Nuno Vasconcelos. 2010. A new approach to cross-modal multimedia retrieval Proceedings of ACM International Conference on Multimedia. 251--260.
[24]
Fumin Shen, Chunhua Shen, Wei Liu, and Heng Tao Shen. 2015. Supervised discrete hashing. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 37--45.
[25]
Jingkuan Song, Yang Yang, Yi Yang, Zi Huang, and Heng Tao Shen. 2013. Inter-media hashing for large-scale retrieval from heterogeneous data sources Proceedings of ACM International Conference on Management of Data. 785--796.
[26]
Christoph Strecha, Alex Bronstein, Michael Bronstein, and Pascal Fua. 2012. LDAHash: Improved matching with smaller descriptors. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 34, 1 (2012), 66--78.
[27]
Jinhui Tang, Zechao Li, Meng Wang, and Ruizhen Zhao. 2015. Neighborhood Discriminant Hashing for Large-Scale Image Retrieval. IEEE Transactions on Image Processing Vol. 24, 9 (2015), 2827--2840.
[28]
Jun Wang, Sanjiv Kumar, and Shih-Fu Chang. 2010. Sequential projection learning for hashing with compact codes Proceedings of International Conference on Machine Learning. 1127--1134.
[29]
Jian Wang, Xin-Shun Xu, Shanqing Guo, Lizhen Cui, and Xiao-Lin Wang. 2016 a. Linear unsupervised hashing for ANN search in Euclidean space. Neurocomputing Vol. 171 (2016), 283--292.
[30]
Min Wang, Wengang Zhou, Qi Tian, Zhengjun Zha, and Houqiang Li. 2016 b. Linear Distance Preserving Pseudo-Supervised and Unsupervised Hashing Proceedings of ACM International Conference on Multimedia. 1257--1266.
[31]
Yair Weiss, Antonio Torralba, and Rob Fergus. 2009. Spectral hashing Proceedings of Advances in Neural Information Processing Systems. 1753--1760.
[32]
Botong Wu, Qiang Yang, Wei-Shi Zheng, Yizhou Wang, and Jingdong Wang. 2015. Quantized correlation hashing for fast cross-modal search. Proceedings of International Joint Conference on Artificial Intelligence. 3946--3952.
[33]
Xin-Shun Xu. 2016. Dictionary learning based hashing for cross-modal retrieval Proceedings of ACM International Conference on Multimedia. 177--181.
[34]
Ting-Kun Yan, Xin-Shun Xu, Shanqing Guo, Zi Huang, and Xiao-Lin Wang. 2016. Supervised robust discrete multimodal hashing for cross-media retrieval Proceedings of ACM International on Conference on Information and Knowledge Management. 1271--1280.
[35]
Erkun Yang, Cheng Deng, Wei Liu, Xianglong Liu, Dacheng Tao, and Xinbo Gao. 2017. Pairwise Relationship Guided Deep Hashing for Cross-Modal Retrieval Proceedings of AAAI Conference on Artificial Intelligence. 1618--1625.
[36]
Yang Yang, Zhigang Ma, Yi Yang, Feiping Nie, and Heng Tao Shen. 2015. Multitask spectral clustering by exploring intertask correlation. IEEE Transactions on Cybernetics Vol. 45, 5 (2015), 1083--1094.
[37]
Yang Yang, Zheng-Jun Zha, Yue Gao, Xiaofeng Zhu, and Tat-Seng Chua. 2014. Exploiting web images for semantic video indexing via robust sample-specific loss. IEEE Transactions on Multimedia Vol. 16, 6 (2014), 1677--1689.
[38]
Dongqing Zhang and Wu-Jun Li. 2014. Large-scale supervised multimodal hashing with semantic correlation maximization. Proceedings of AAAI Conference on Artificial Intelligence. 7.
[39]
Yi Zhen and Dit-Yan Yeung. 2012. Co-regularized hashing for multimodal data. In Proceedings of Advances in Neural Information Processing Systems. 1376--1384.
[40]
Jile Zhou, Guiguang Ding, and Yuchen Guo. 2014. Latent semantic sparse hashing for cross-modal similarity search Proceedings of ACM International Conference on Research and Development in Information Retrieval. 415--424.
[41]
Xiaofeng Zhu, Zi Huang, Heng Tao Shen, and Xin Zhao. 2013. Linear cross-modal hashing for efficient multimedia search Proceedings of ACM International Conference on Multimedia. 143--152.

Cited By

View all
  • (2025)Three-Stage Semisupervised Cross-Modal Hashing With Pairwise Relations ExploitationIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.326322136:1(260-273)Online publication date: Jan-2025
  • (2025) Robust Partially Observed Data Sensing via ℓ₂, ₚ Norms With Flexible Adaptive Label Marginal Space for Visual IoT IEEE Internet of Things Journal10.1109/JIOT.2024.349025212:5(5435-5448)Online publication date: 1-Mar-2025
  • (2024)Scalable Discrete and Asymmetric Unequal Length Hashing Learning for Cross-Modal RetrievalIEEE Transactions on Multimedia10.1109/TMM.2024.337287626(7917-7932)Online publication date: 5-Mar-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '17: Proceedings of the 25th ACM international conference on Multimedia
October 2017
2028 pages
ISBN:9781450349062
DOI:10.1145/3123266
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 October 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. approximate nearest neighbor search
  2. cross-modal search
  3. hashing
  4. multimodal

Qualifiers

  • Research-article

Funding Sources

  • Key Research and Development Program of Shandong Province
  • National Natural Science Foundation of China

Conference

MM '17
Sponsor:
MM '17: ACM Multimedia Conference
October 23 - 27, 2017
California, Mountain View, USA

Acceptance Rates

MM '17 Paper Acceptance Rate 189 of 684 submissions, 28%;
Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)25
  • Downloads (Last 6 weeks)0
Reflects downloads up to 01 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Three-Stage Semisupervised Cross-Modal Hashing With Pairwise Relations ExploitationIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.326322136:1(260-273)Online publication date: Jan-2025
  • (2025) Robust Partially Observed Data Sensing via ℓ₂, ₚ Norms With Flexible Adaptive Label Marginal Space for Visual IoT IEEE Internet of Things Journal10.1109/JIOT.2024.349025212:5(5435-5448)Online publication date: 1-Mar-2025
  • (2024)Scalable Discrete and Asymmetric Unequal Length Hashing Learning for Cross-Modal RetrievalIEEE Transactions on Multimedia10.1109/TMM.2024.337287626(7917-7932)Online publication date: 5-Mar-2024
  • (2024)Multi-Modal Hashing for Efficient Multimedia Retrieval: A SurveyIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.328292136:1(239-260)Online publication date: Jan-2024
  • (2024)Disperse Asymmetric Subspace Relation Hashing for Cross-Modal RetrievalIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.328730134:1(603-617)Online publication date: Jan-2024
  • (2024)Robust Asymmetric Cross-Modal Hashing Retrieval With Dual Semantic EnhancementIEEE Transactions on Computational Social Systems10.1109/TCSS.2024.335249411:3(4340-4353)Online publication date: Jun-2024
  • (2024)Robust Perceptual Data Sensing Based on Majorization-Minimized Low-Rank Semidefinite Relaxation for Visual IoTIEEE Internet of Things Journal10.1109/JIOT.2024.339901211:16(27201-27213)Online publication date: 15-Aug-2024
  • (2024)Supervised Consensus Anchor Graph Hashing for Cross Modal RetrievalIEEE Access10.1109/ACCESS.2023.334850812(1805-1821)Online publication date: 2024
  • (2024)Supervised adaptive similarity consistent latent representation hashingNeurocomputing10.1016/j.neucom.2023.127113570(127113)Online publication date: Feb-2024
  • (2024)Weighted cross-modal hashing with label enhancementKnowledge-Based Systems10.1016/j.knosys.2024.111657293(111657)Online publication date: Jun-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media