[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1459359.1459389acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Locality condensation: a new dimensionality reduction method for image retrieval

Published: 26 October 2008 Publication History

Abstract

Content-based image similarity search plays a key role in multimedia retrieval. Each image is usually represented as a point in a high-dimensional feature space. The key challenge of searching similar images from a large database is the high computational overhead due to the "curse of dimensionality". Reducing the dimensionality is an important means to tackle the problem. In this paper, we study dimensionality reduction for top-k image retrieval. Intuitively, an effective dimensionality reduction method should not only preserve the close locations of similar images (or points), but also separate those dissimilar ones far apart in the reduced subspace. Existing dimensionality reduction methods mainly focused on the former. We propose a novel idea called Locality Condensation (LC) to not only preserve localities determined by neighborhood information and their global similarity relationship, but also ensure that different localities will not invade each other in the low-dimensional subspace. To generate non-overlapping localities in the subspace, LC first performs an elliptical condensation, which condenses each locality with an elliptical shape into a more compact hypersphere to enlarge the margins among different localities and estimate the projection in the subspace for overlap analysis. Through a convex optimization, LC further performs a scaling condensation on the obtained hyperspheres based on their projections in the subspace with minimal condensation degrees. By condensing the localities effectively, the potential overlaps among different localities in the low-dimensional subspace are prevented. Consequently, for similarity search in the subspace, the number of false hits (i.e., distant points that are falsely retrieved) will be reduced. Extensive experimental comparisons with existing methods demonstrate the superiority of our proposal.

References

[1]
M. Belkin and P. Niyogi. Laplacian eigenmaps for dimensionality reduction and data representation. Neural Computation, 15(6):1373--1396, 2003.
[2]
Y. Bengio, J.-F. Paiement, P. Vincent, O. Delalleau, N. L. Roux, and M. Ouimet. Out-of-sample extensions for lle, isomap, mds, eigenmaps, and spectral clustering. In NIPS, 2003.
[3]
C. Böhm, S. Berchtold, and D. A. Keim. Searching in high-dimensional spaces: Index structures for improving the performance of multimedia databases. ACM Comput. Surv., 33(3):322--373, 2001.
[4]
S. Boyd and L. Vandenberghe. Convex Optimization. Cambridge University Press, 2004.
[5]
D. Cai and X. He. Orthogonal locality preserving indexing. In SIGIR, pages 3--10, 2005.
[6]
D. Cai, X. He, and J. Han. Spectral regression: a unified subspace learning framework for content-based image retrieval. In ACM Multimedia, pages 403--412, 2007.
[7]
K. Chakrabarti, E. J. Keogh, S. Mehrotra, and M. J. Pazzani. Locally adaptive dimensionality reduction for indexing large time series databases. ACM Trans. Database Syst., 27(2):188--228, 2002.
[8]
K. Chakrabarti and S. Mehrotra. Local dimensionality reduction: A new approach to indexing high dimensional spaces. In VLDB, pages 89--100, 2000.
[9]
V. de Silva and J. B. Tenenbaum. Global versus local methods in nonlinear dimensionality reduction. In NIPS, pages 705--712, 2002.
[10]
S. C. Deerwester, S. T. Dumais, T. K. Landauer, G. W. Furnas, and R. A. Harshman. Indexing by latent semantic analysis. Journal of the American Society of Information Science, 41(6):391--407, 1990.
[11]
C. Elkan. Using the triangle inequality to accelerate k-means. In ICML, pages 147--153, 2003.
[12]
V. Gaede and O. Günther. Multidimensional access methods. ACM Comput. Surv., 30(2):170--231, 1998.
[13]
X. He, D. Cai, and J. Han. Learning a maximum margin subspace for image retrieval. IEEE Trans. Knowl. Data Eng., 20(2):189--201, 2008.
[14]
X. He, D. Cai, H. Liu, and W.-Y. Ma. Locality preserving indexing for document representation. In SIGIR, pages 96--103, 2004.
[15]
X. He, W. Min, D. Cai, and K. Zhou. Laplacian optimal design for image retrieval. In SIGIR, pages 119--126, 2007.
[16]
X. He and P. Niyogi. Locality preserving projections. In NIPS, 2003.
[17]
T. Hofmann. Probabilistic latent semantic indexing. In SIGIR, pages 50--57, 1999.
[18]
A. Hyvärinen, J. Karhunen, and E. Oja. Independent Component Analysis. John Wiley & Sons, 2001.
[19]
H. V. Jagadish, B. C. Ooi, K.-L. Tan, C. Yu, and R. Zhang. idistance: An adaptive b+-tree based indexing method for nearest neighbor search. ACM Trans. Database Syst., 30(2):364--397, 2005.
[20]
I. T. Jolliffe. Principal Component Analysis. Springer-Verlag, second edition, 2002.
[21]
D. A. Keim and B. Bustos. Similarity search in multimedia databases. In ICDE, page 873, 2004.
[22]
K. Mikolajczyk and J. Matas. Improving descriptors for fast tree matching by optimal linear projection. In ICCV, 2007.
[23]
S. T. Roweis and L. K. Saul. Nonlinear dimensionality reduction by locally linear embedding. Science, 290(5500):2323--2326, 2000.
[24]
H. T. Shen, X. Zhou, and A. Zhou. An adaptive and dynamic dimensionality reduction method for high-dimensional indexing. VLDB J., 16(2):219--234, 2007.
[25]
J. B. Tenenbaum, V. d. Silva, and J. C. Langford. A global geometric framework for nonlinear dimensionality reduction. Science, 290(5500):2319--2323, 2000.
[26]
M. E. Tipping and C. M. Bishop. Probabilistic principal component analysis. Journal of The Royal Statistical Society Series B, 61(3):611--622, 1999.
[27]
R. Weber, H.-J. Schek, and S. Blott. A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces. In VLDB, pages 194--205, 1998.
[28]
Y.-L. Wu, D. Agrawal, and A. E. Abbadi. A comparison of dft and dwt based similarity search in time-series databases. In CIKM, pages 488--495, 2000.
[29]
A. Yavlinsky, E. Schofield, and S. M. Rüger. Automated image annotation using global features and robust nonparametric density estimation. In CIVR, pages 507--517, 2005.
[30]
F. W. Young and R. M. Hamer. Multidimensional Scaling: History, Theory and Applications. Lawrence Erlbaum Associates, 1987.
[31]
J. Yu and Q. Tian. Learning image manifolds by semantic subspace projection. In ACM Multimedia, pages 297--306, 2006.
[32]
X. Zheng, D. Cai, X. He, W.-Y. Ma, and X. Lin. Locality preserving clustering for image database. In ACM Multimedia, pages 885--891, 2004.

Cited By

View all
  • (2024)Diversified Semantic Distribution Matching for Dataset DistillationProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3680900(7542-7550)Online publication date: 28-Oct-2024
  • (2019)Hierarchical Multi-Clue Modelling for POI Popularity Prediction with Heterogeneous Tourist InformationIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2018.284219031:4(757-768)Online publication date: 1-Apr-2019
  • (2018)Indexing Techniques for Multimedia Data RetrievalEncyclopedia of Database Systems10.1007/978-1-4614-8265-9_80631(1890-1895)Online publication date: 7-Dec-2018
  • Show More Cited By

Index Terms

  1. Locality condensation: a new dimensionality reduction method for image retrieval

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '08: Proceedings of the 16th ACM international conference on Multimedia
    October 2008
    1206 pages
    ISBN:9781605583037
    DOI:10.1145/1459359
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 26 October 2008

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. dimensionality reduction
    2. locality condensation
    3. top-k image retrieval

    Qualifiers

    • Research-article

    Conference

    MM08
    Sponsor:
    MM08: ACM Multimedia Conference 2008
    October 26 - 31, 2008
    British Columbia, Vancouver, Canada

    Acceptance Rates

    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)2
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Diversified Semantic Distribution Matching for Dataset DistillationProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3680900(7542-7550)Online publication date: 28-Oct-2024
    • (2019)Hierarchical Multi-Clue Modelling for POI Popularity Prediction with Heterogeneous Tourist InformationIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2018.284219031:4(757-768)Online publication date: 1-Apr-2019
    • (2018)Indexing Techniques for Multimedia Data RetrievalEncyclopedia of Database Systems10.1007/978-1-4614-8265-9_80631(1890-1895)Online publication date: 7-Dec-2018
    • (2018)Principal Component AnalysisEncyclopedia of Database Systems10.1007/978-1-4614-8265-9_540(2800-2800)Online publication date: 7-Dec-2018
    • (2017)Incremental Accelerated Kernel Discriminant AnalysisProceedings of the 25th ACM international conference on Multimedia10.1145/3123266.3123401(1575-1583)Online publication date: 23-Oct-2017
    • (2017)Principal Component AnalysisEncyclopedia of Database Systems10.1007/978-1-4899-7993-3_540-2(1-2)Online publication date: 14-Aug-2017
    • (2015)Benchmarking Lymph Node Metastasis ClassificationFor Gastric Cancer StagingBiomedical Image Understanding10.1002/9781118715321.ch10(361-392)Online publication date: 13-Feb-2015
    • (2014)SK-LSHProceedings of the VLDB Endowment10.14778/2732939.27329477:9(745-756)Online publication date: 1-May-2014
    • (2014)The influence of image descriptors’ dimensions’ value cardinalities on large-scale similarity searchInternational Journal of Multimedia Information Retrieval10.1007/s13735-014-0073-94:3(187-204)Online publication date: 26-Nov-2014
    • (2013)MSIDXIEEE Transactions on Multimedia10.1109/TMM.2013.224798915:6(1415-1430)Online publication date: 1-Oct-2013
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media