Abstract
Retrieving similar images based on its visual content is an important yet difficult problem. We propose in this paper a new method to improve the accuracy of content-based image retrieval systems. Typically, given a query image, existing retrieval methods return a ranked list based on the similarity scores between the query and individual images in the database. Our method goes further by relying on an analysis of the underlying connections among individual images in the database to improve this list. Initially, we consider each image in the database as a query and use an existing baseline method to search for its likely similar images. Then, the database is modeled as a graph where images are nodes and connections among possibly similar images are edges. Next, we introduce an algorithm to split this graph into stronger subgraphs, based on our notion of graph’s strength, so that images in each subgraph are expected to be truly similar to each other. We create for each subgraph a structure called integrated image which contains the visual features of all images in the subgraph. At query time, we compute the similarity scores not only between the query and individual database images but also between the query and the integrated images. The final similarity score of a database image is computed based on both its individual score and the score of the integrated image that it belongs to. This leads effectively to a re-ranking of the retrieved images. We evaluate our method on a common image retrieval benchmark and demonstrate a significant improvement over the traditional bag-of-words retrieval model.
Similar content being viewed by others
References
Bay H, Ess A, Tuytelaars T, Gool LV (2008) Speeded-up robust features (surf). Comput Vis Image Underst 110(3):346–359
Cao Y, Wang C, Li Z, Zhang L, Zhang L (2010) Spatial-bag-of-features. In: CVPR, pp 3352–3359
Chi M, Zhang P, Zhao Y, Feng R, Xue X (2009) Web image retrieval reranking with multi-view clustering. In: WWW, pp 1189–1190
Chum O, Mikulík A, Perdoch M, Matas J (2011) Total recall ii: query expansion revisited. In: CVPR, pp 889–896
Chum O, Philbin J, Sivic J, Isard M, Zisserman A (2007) Total recall: automatic query expansion with a generative feature model for object retrieval. In: ICCV, pp 1–8
Datar M, Immorlica N, Indyk P, Mirrokni VS (2004) Locality-sensitive hashing scheme based on p-stable distributions. In: Symposium on computational geometry, pp 253–262
Elsayad I, Martinet J, Urruty T, Djeraba C (2012) Toward a higher-level visual representation for content-based image retrieval. Multimed Tools Appl 60(2):455–482
Fischler MA, Bolles RC (1981) Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun ACM 24(6):381–395
Gionis A, Indyk P, Motwani R (1999) Similarity search in high dimensions via hashing. In: VLDB, pp 518–529
Hayter AJ (2007) Probability and Statistics for Engineering and Scientists, 3rd edn. Thomson Brooks/Cole, Belmont, CA
Hsiao JH, Chen MS (2009) Intention-focused active reranking for image object retrieval. In: CIKM, pp 157–166
Huang Y, Zhang J, Zhao Y, Ma D (2012) A new re-ranking method using enhanced pseudo-relevance feedback for content-based medical image retrieval. IEICE Trans 95-D(2):694–698
Jain P, Kulis B, Grauman K (2008) Fast image search for learned metrics. In: CVPR
Jegou H, Douze M, Schmid C (2008) Hamming embedding and weak geometric consistency for large scale image search. In: ECCV (1), pp 304–317
Jégou H, Douze M, Schmid C (2009) Packing bag-of-features. In: ICCV, pp 2357–2364
Jegou H, Douze M, Schmid C (2010) Improving bag-of-features for large scale image search. Int J Comput Vis 87(3):316–336
Jegou H, Douze M, Schmid C, Pérez P (2010) Aggregating local descriptors into a compact image representation. In: CVPR, pp 3304–3311
Jegou H, Harzallah H, Schmid C (2007) A contextual dissimilarity measure for accurate and efficient image search. In: CVPR
Kilinç D, Alpkocak A (2011) An expansion and reranking approach for annotation-based image retrieval from web. Expert Syst. Appl. 38(10):13121–13127
Kulis B, Grauman K (2009) Kernelized locality-sensitive hashing for scalable image search. In: ICCV, pp 2130–2137
Li J, Ma Q, Asano Y, Yoshikawa M (2012) Re-ranking by multi-modal relevance feedback for content-based social image retrieval. In: APWeb, pp 399–410
Lin WH, Jin R, Hauptmann AG (2003) Web image retrieval re-ranking with relevance model. In: Web Intelligence, pp 242–248
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110
Lu H, Ooi BC, Shen HT, Xue X (2006) Hierarchical indexing structure for efficient similarity search in video retrieval. IEEE Trans Knowl Data Eng 18(11):1544–1559
Mai HT, Kim MH (2012) Integrating similar images to effectively improve image retrieval accuracy. In: ICCM, pp 410–415
Nistér D, Stewénius H (2006) Scalable recognition with a vocabulary tree. In: CVPR (2), pp 2161–2168
Park G, Baek Y, Lee HK (2005) Re-ranking algorithm using post-retrieval clustering for content-based image retrieval. Inf Process Manag 41(2):177–194
Perdoch M, Chum O, Matas J (2009) Efficient representation of local geometry for large scale object retrieval. In: CVPR, pp 9–16
Philbin J, Chum O, Isard M, Sivic J, Zisserman A (2007) Object retrieval with large vocabularies and fast spatial matching. In: CVPR
Qin D, Gammeter S, Bossard L, Quack T, Gool LJV (2011) Hello neighbor: accurate object retrieval with k-reciprocal nearest neighbors. In: CVPR, pp 777–784
Rahman MM, Bhattacharya P (2009) Image retrieval with automatic query expansion based on local analysis in a semantical concept feature space. In: CIVR
Ramaswamy S, Rose K (2011) Adaptive cluster distance bounding for high-dimensional indexing. IEEE Trans Knowl Data Eng 23(6):815–830
Sivic J, Zisserman A (2003) Video google: a text retrieval approach to object matching in videos. In: ICCV, pp 1470–1477
Wang J, Kumar O, Chang SF (2010) Semi-supervised hashing for scalable image retrieval. In: CVPR, pp 3424–3431
Weber R, Schek HJ, Blott S (1998) A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces. In: VLDB, pp 194–205
Weiss Y, Torralba A, Fergus R (2008) Spectral hashing. In: NIPS, pp 1753–1760
Zhang Y, Jia Z, Chen T (2011) Image retrieval with geometry-preserving visual phrases. In: CVPR, pp 809–816
Acknowledgements
This work was supported by the Brain Korea 21 Project, the Department of Computer Science, KAIST in 2012 and the National Research Foundation of Korea (NRF) grant funded by the Korea government (MEST) (No. 2012R1A2A2A01046694). The authors also thank anonymous reviewers for valuable comments to improve this work.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Mai, H.T., Kim, M.H. Utilizing similarity relationships among existing data for high accuracy processing of content-based image retrieval. Multimed Tools Appl 72, 331–360 (2014). https://doi.org/10.1007/s11042-013-1360-9
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-013-1360-9