[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

A multiresolution manifold distance for invariant image similarity

Published: 01 February 2005 Publication History

Abstract

Accounting for spatial image transformations is a requirement for multimedia problems such as video classification and retrieval, face/object recognition or the creation of image mosaics from video sequences. We analyze a transformation invariant metric recently proposed in the machine learning literature to measure the distance between image manifolds - the tangent distance (TD) - and show that it is closely related to alignment techniques from the motion analysis literature. Exposing these relationships results in benefits for the two domains. On one hand, it allows leveraging on the knowledge acquired in the alignment literature to build better classifiers. On the other, it provides a new interpretation of alignment techniques as one component of a decomposition that has interesting properties for the classification of video. In particular, we embed the TD into a multiresolution framework that makes it significantly less prone to local minima. The new metric - multiresolution tangent distance (MRTD) - can be easily combined with robust estimation procedures, and exhibits significantly higher invariance to image transformations than the TD and the Euclidean distance (ED). For classification, this translates into significant improvements in face recognition accuracy. For video characterization, it leads to a decomposition of image dissimilarity into "differences due to camera motion" plus "differences due to scene activity" that is useful for classification. Experimental results on a movie database indicate that the distance could be used as a basis for the extraction of semantic primitives such as action and romance.

Cited By

View all
  1. A multiresolution manifold distance for invariant image similarity

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image IEEE Transactions on Multimedia
      IEEE Transactions on Multimedia  Volume 7, Issue 1
      February 2005
      182 pages

      Publisher

      IEEE Press

      Publication History

      Published: 01 February 2005

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 20 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2017)Retrieval on Parametric Shape CollectionsACM Transactions on Graphics10.1145/3072959.298361836:4(1)Online publication date: 16-Jul-2017
      • (2017)Retrieval on Parametric Shape CollectionsACM Transactions on Graphics10.1145/298361836:1(1-14)Online publication date: 26-Jan-2017
      • (2016)Local manifold distance based on neighborhood graph reorderingPattern Recognition10.1016/j.patcog.2015.12.00653:C(195-211)Online publication date: 1-May-2016
      • (2008)Image retrievalACM Computing Surveys10.1145/1348246.134824840:2(1-60)Online publication date: 8-May-2008
      • (2007)Recognition of digital images of the human face at ultra low resolution via illumination spacesProceedings of the 8th Asian conference on Computer vision - Volume Part II10.5555/1775728.1775814(733-743)Online publication date: 18-Nov-2007
      • (2007)Learning the Lie Groups of Visual InvarianceNeural Computation10.1162/neco.2007.19.10.266519:10(2665-2693)Online publication date: 1-Oct-2007
      • (2007)Recognition of Digital Images of the Human Face at Ultra Low Resolution Via Illumination SpacesComputer Vision – ACCV 200710.1007/978-3-540-76390-1_72(733-743)Online publication date: 18-Nov-2007
      • (2006)Real-time computerized annotation of picturesProceedings of the 14th ACM international conference on Multimedia10.1145/1180639.1180841(911-920)Online publication date: 23-Oct-2006
      • (2005)Content-based image retrievalProceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval10.1145/1101826.1101866(253-262)Online publication date: 10-Nov-2005

      View Options

      View options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media