research-article

Structured metric learning for high dimensional problems

Authors:

Jason V. Davis,

Inderjit S. DhillonAuthors Info & Claims

KDD '08: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining

Pages 195 - 203

https://doi.org/10.1145/1401890.1401918

Published: 24 August 2008 Publication History

Get Access

Abstract

The success of popular algorithms such as k-means clustering or nearest neighbor searches depend on the assumption that the underlying distance functions reflect domain-specific notions of similarity for the problem at hand. The distance metric learning problem seeks to optimize a distance function subject to constraints that arise from fully-supervised or semisupervised information. Several recent algorithms have been proposed to learn such distance functions in low dimensional settings. One major shortcoming of these methods is their failure to scale to high dimensional problems that are becoming increasingly ubiquitous in modern data mining applications. In this paper, we present metric learning algorithms that scale linearly with dimensionality, permitting efficient optimization, storage, and evaluation of the learned metric. This is achieved through our main technical contribution which provides a framework based on the log-determinant matrix divergence which enables efficient optimization of structured, low-parameter Mahalanobis distances. Experimentally, we evaluate our methods across a variety of high dimensional domains, including text, statistical software analysis, and collaborative filtering, showing that our methods scale to data sets with tens of thousands or more features. We show that our learned metric can achieve excellent quality with respect to various criteria. For example, in the context of metric learning for nearest neighbor classification, we show that our methods achieve 24% higher accuracy over the baseline distance. Additionally, our methods yield very good precision while providing recall measures up to 20% higher than other baseline methods such as latent semantic analysis.

References

[1]

R. A. Baeza-Yates and B. A. Ribeiro-Neto. Modern Information Retrieval. ACM Press / Addison-Wesley, 1999.

Digital Library

Google Scholar

[2]

S. Boyd and L. Vandenberghe. Convex Optimization. Cambridge University Press, March 2004.

Digital Library

Google Scholar

[3]

J. V. Davis, B. Kulis, P. Jain, S. Sra, and I. S. Dhillon. Information-theoretic Metric Learning. In ICML, June 2007.

Digital Library

Google Scholar

[4]

S. C. Deerwester, S. T. Dumais, T. K. Landauer, G.W. Furnas, and R. A. Harshman. Indexing by latent semantic analysis. Journal of the American Society of Information Science, 41(6):391--407, 1990.

Crossref

Google Scholar

[5]

R. O. Duda, P. E. Hart, and D. G. Stork. Pattern Classification. Wiley-Interscience Publication, 2000.

Digital Library

Google Scholar

[6]

A. Globerson and S. Roweis. Metric Learning by Collapsing Classes. In NIPS, 2005

Digital Library

Google Scholar

[7]

G.H. Golub and C.F. Van Loan. Matrix Computations. Johns Hopkins University Press, Baltimore, MD, second edition, 1989.

Google Scholar

[8]

J. Ha, C. Rossbach, J. Davis, I. Roy, D. Chen, H. Ramadan, and E. Witchel. Improved Error Reporting for Software that Uses Black Box Components. In Programming Language Design and Implementation (PLDI), 2007.

Digital Library

Google Scholar

[9]

B. Kulis, M. Sustik, and I. S. Dhillon. Learning Low-rank Kernel Matrices. In ICML, 2006.

Digital Library

Google Scholar

[10]

K. Q. Weinberger, J. Blitzer, and L. K. Saul. Distance Metric Learning for Large Margin Nearest Neighbor Classification. In NIPS, 2005.

Digital Library

Google Scholar

[11]

E. P. Xing, A. Y. Ng, M. I. Jordan, and S. Russell. Distance metric learning with application to clustering with side-information. In NIPS, 2002.

Digital Library

Google Scholar

Cited By

View all

Pesántez GGuamán WCórdova JTorres MBenalcazar P(2024)Reinforcement Learning for Efficient Power Systems Planning: A Review of Operational and Expansion StrategiesEnergies10.3390/en1709216717:9(2167)Online publication date: 1-May-2024
https://doi.org/10.3390/en17092167
Hosny AReda S(2023)Automatic MILP solver configuration by learning problem similaritiesAnnals of Operations Research10.1007/s10479-023-05508-x339:1-2(909-936)Online publication date: 14-Jul-2023
https://doi.org/10.1007/s10479-023-05508-x
James NMenzies M(2021)A new measure between sets of probability distributions with applications to erratic financial behaviorJournal of Statistical Mechanics: Theory and Experiment10.1088/1742-5468/ac3d912021:12(123404)Online publication date: 23-Dec-2021
https://doi.org/10.1088/1742-5468/ac3d91
Show More Cited By

Index Terms

Structured metric learning for high dimensional problems
1. Information systems
  1. Information retrieval
    1. Information retrieval query processing

Recommendations

Learning neighborhoods for metric learning
ECMLPKDD'12: Proceedings of the 2012th European Conference on Machine Learning and Knowledge Discovery in Databases - Volume Part I

Metric learning methods have been shown to perform well on different learning tasks. Many of them rely on target neighborhood relationships that are computed in the original feature space and remain fixed throughout learning. As a result, the learned ...
Nonlinear adaptive distance metric learning for clustering
KDD '07: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining

A good distance metric is crucial for many data mining tasks. To learn a metric in the unsupervised setting, most metric learning algorithms project observed data to a low-dimensional manifold, where geometric relationships such as pairwise distances ...
Volume of Metric Balls in High-Dimensional Complex Grassmann Manifolds

Volume of metric balls relates to rate-distortion theory and packing bounds on codes. In this paper, the volume of balls in complex Grassmann manifolds is evaluated for an arbitrary radius. The ball is defined as a set of hyperplanes of a fixed ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

KDD '08: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining

August 2008

1116 pages

ISBN:9781605581934

DOI:10.1145/1401890

General Chair:
Ying Li
Microsoft adCenter Labs
,
Program Chairs:
Bing Liu
University of Illinois at Chicago
,
Sunita Sarawagi
Indian Institute of Technology, Bombay

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 August 2008

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

KDD08

Sponsor:

KDD08: The 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

August 24 - 27, 2008

Nevada, Las Vegas, USA

Acceptance Rates

KDD '08 Paper Acceptance Rate 118 of 593 submissions, 20%;

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

45
Total Citations
View Citations
808
Total Downloads

Downloads (Last 12 months)6
Downloads (Last 6 weeks)0

Reflects downloads up to 13 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Pesántez GGuamán WCórdova JTorres MBenalcazar P(2024)Reinforcement Learning for Efficient Power Systems Planning: A Review of Operational and Expansion StrategiesEnergies10.3390/en1709216717:9(2167)Online publication date: 1-May-2024
https://doi.org/10.3390/en17092167
Hosny AReda S(2023)Automatic MILP solver configuration by learning problem similaritiesAnnals of Operations Research10.1007/s10479-023-05508-x339:1-2(909-936)Online publication date: 14-Jul-2023
https://doi.org/10.1007/s10479-023-05508-x
James NMenzies M(2021)A new measure between sets of probability distributions with applications to erratic financial behaviorJournal of Statistical Mechanics: Theory and Experiment10.1088/1742-5468/ac3d912021:12(123404)Online publication date: 23-Dec-2021
https://doi.org/10.1088/1742-5468/ac3d91
Ali BMoriyama KNumao MFukui K(2020)Reinforcement Learning based Evolutionary Metric Filtering for High Dimensional Problems2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA)10.1109/ICMLA51294.2020.00045(226-233)Online publication date: Dec-2020
https://doi.org/10.1109/ICMLA51294.2020.00045
Deng XWu WWang F(2020)Deep Metric Learning for text data based on Triplet NetworkIOP Conference Series: Materials Science and Engineering10.1088/1757-899X/806/1/012038806(012038)Online publication date: 5-May-2020
https://doi.org/10.1088/1757-899X/806/1/012038
Yao DZhao PPham TCong G(2018)High-dimensional similarity learning via dual-sparse random projectionProceedings of the 27th International Joint Conference on Artificial Intelligence10.5555/3304889.3305078(3005-3011)Online publication date: 13-Jul-2018
https://dl.acm.org/doi/10.5555/3304889.3305078
Lai SZheng WHu JZhang J(2018)Global-Local Temporal Saliency Action PredictionIEEE Transactions on Image Processing10.1109/TIP.2017.275114527:5(2272-2285)Online publication date: May-2018
https://doi.org/10.1109/TIP.2017.2751145
(2018)Discriminative Transformation Learning for Fuzzy Sparse Subspace ClusteringIEEE Transactions on Cybernetics10.1109/TCYB.2017.272954248:8(2218-2231)Online publication date: Aug-2018
https://doi.org/10.1109/TCYB.2017.2729542
Zhang JZhang LSingh SMarkovitch S(2017)Efficient stochastic optimization for low-rank distance metric learningProceedings of the Thirty-First AAAI Conference on Artificial Intelligence10.5555/3298239.3298375(933-939)Online publication date: 4-Feb-2017
https://dl.acm.org/doi/10.5555/3298239.3298375
St.Amand JHuan JMatwin SYu SFarooq F(2017)Sparse Compositional Local Metric LearningProceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining10.1145/3097983.3098153(1097-1104)Online publication date: 13-Aug-2017
https://dl.acm.org/doi/10.1145/3097983.3098153
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

Learning neighborhoods for metric learning

Nonlinear adaptive distance metric learning for clustering

Volume of Metric Balls in High-Dimensional Complex Grassmann Manifolds