[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

An incremental Hausdorff distance calculation algorithm

Published: 01 May 2011 Publication History

Abstract

The Hausdorff distance is commonly used as a similarity measure between two point sets. Using this measure, a set X is considered similar to Y iff every point in X is close to at least one point in Y. Formally, the Hausdorff distance HausDist(X, Y) can be computed as the Max-Min distance from X to Y, i.e., find the maximum of the distance from an element in X to its nearest neighbor (NN) in Y. Although this is similar to the closest pair and farthest pair problems, computing the Hausdorff distance is a more challenging problem since its Max-Min nature involves both maximization and minimization rather than just one or the other. A traditional approach to computing HausDist(X, Y) performs a linear scan over X and utilizes an index to help compute the NN in Y for each x in X. We present a pair of basic solutions that avoid scanning X by applying the concept of aggregate NN search to searching for the element in X that yields the Hausdorff distance. In addition, we propose a novel method which incrementally explores the indexes of the two sets X and Y simultaneously. As an example application of our techniques, we use the Hausdorff distance as a measure of similarity between two trajectories (represented as point sets). We also use this example application to compare the performance of our proposed method with the traditional approach and the basic solutions. Experimental results show that our proposed method outperforms all competitors by one order of magnitude in terms of the tree traversal cost and total response time.

References

[1]
P. K. Agarwal, S. Har-Peled, M. Sharir, and Y. Wang. Hausdorff distance under translation for points and balls. In SCG, pages 282--291, 2003.
[2]
H. Alt, O. Aichholzer, and G. Rote. Matching shapes with a reference point. In SCG, pages 85--92, 1994.
[3]
H. Alt, B. Behrends, and J. Blömer. Approximate matching of polygonal shapes (extended abstract). In SCG, pages 186--193, 1991.
[4]
C. Andújar, P. Brunet, and D. Ayala. Topology-reducing surface simplification using a discrete solid representation. ACM Trans. Graph., 21(2):88--105, 2002.
[5]
W. G. Aref and I. F. Ilyas. An extensible index for spatial databases. In SSDBM, pages 49--58, 2001.
[6]
F. Aurenhammer. Voronoi diagrams - a survey of a fundamental geometric data structure. ACM Comput. Surv., 23(3):345--405, 1991.
[7]
N. Beckmann, H. Kriegel, R. Schneider, and B. Seeger. The R*-tree: an efficient and robust access method for points and rectangles. In SIGMOD, pages 322--331, 1990.
[8]
N. Beckmann and B. Seeger. A revised R*-tree in comparison with related index structures. In SIGMOD, pages 799--812, 2009.
[9]
D. Berndt and J. Clifford. Using dynamic time warping to find patterns in time series. AAAI Workshop on Knowledge Discovery in Databases, pages 229--248, 1994.
[10]
L. Chen and R. Ng. On the marriage of lp-norms and edit distance. VLDB, pages 792--803, 2004.
[11]
L. Chen, M. T. Özsu, and V. Oria. Robust and fast similarity search for moving object trajectories. SIGMOD, pages 491--502, 2005.
[12]
A. Corral, Y. Manolopoulos, Y. Theodoridis, and M. Vassilakopoulos. Closest pair queries in spatial databases. In SIGMOD, pages 189--200, 2000.
[13]
A. Guttman. R-trees: A dynamic index structure for spatial searching. In SIGMOD, pages 47--57, 1984.
[14]
J. Hershberger and J. Snoeyink. Speeding up the douglas-peucker line-simplification algorithm. In Intl. Symp. on Spatial Data Handling, pages 134--143, 1992.
[15]
G. R. Hjaltason and H. Samet. Incremental distance join algorithms for spatial databases. In SIGMOD, pages 237--248, 1998.
[16]
G. R. Hjaltason and H. Samet. Distance browsing in spatial databases. ACM Trans. Database Syst., 24(2):265--318, 1999.
[17]
D. P. Huttenlocher, K. Kedem, and J. M. Kleinberg. On dynamic voronoi diagrams and the minimum hausdorff distance for point sets under euclidean motion in the plane. In SCG, pages 110--119, 1992.
[18]
D. P. Huttenlocher, G. A. Klanderman, and W. Rucklidge. Comparing images using the hausdorff distance. IEEE Trans. Pattern Anal. Mach. Intell., 15(9):850--863, 1993.
[19]
P. Indyk and S. Venkatasubramanian. Approximate congruence in nearly linear time. In SODA, pages 354--360, 2000.
[20]
E. J. Keogh and C. Ratanamahatana. Exact indexing of dynamic time warping. Knowl. Inf. Syst., 7(3):358--386, 2005.
[21]
P.-F. Marteau. Time warp edit distance with stiffness adjustment for time series matching. IEEE Trans. Pattern Anal. Mach. Intell., 31(2):306--318, 2009.
[22]
D. Papadias, Y. Tao, K. Mouratidis, and C. K. Hui. Aggregate nearest neighbor queries in spatial databases. ACM Trans. Database Syst., 30(2):529--576, 2005.
[23]
N. Roussopoulos, S. Kelley, and F. Vincent. Nearest neighbor queries. In SIGMOD, pages 71--79, 1995.
[24]
H. Samet. Foundations of Multidimensional and Metric Data Structures. Morgan-Kaufmann, San Francisco, 2006.
[25]
H. Shin, B. Moon, and S. Lee. Adaptive multi-stage distance join processing. In SIGMOD, pages 343--354, 2000.
[26]
M. Vlachos, D. Gunopulos, and G. Kollios. Discovering similar multidimensional trajectories. In ICDE, pages 673--684, 2002.
[27]
C. Xia, H. Lu, B. C. Ooi, and J. Hu. Gorder: An efficient method for KNN join processing. In VLDB, pages 756--767, 2004.
[28]
Y. Zheng, Q. Li, Y. Chen, X. Xie, and W.-Y. Ma. Understanding mobility based on gps data. In UbiComp, pages 312--321, 2008.
[29]
Y. Zheng, L. Zhang, X. Xie, and W.-Y. Ma. Mining interesting locations and travel sequences from gps trajectories. In WWW, pages 791--800, 2009.

Cited By

View all
  • (2024)Privacy-preserving Spatial Dataset Search in CloudProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679733(1245-1254)Online publication date: 21-Oct-2024
  • (2023)SQUID: subtrajectory query in trillion-scale GPS databaseThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-022-00777-732:4(887-904)Online publication date: 19-Jan-2023
  • (2023)Federated Trajectory Search via a Lightweight Similarity Computation FrameworkWeb and Big Data10.1007/978-981-97-2390-4_32(469-485)Online publication date: 6-Oct-2023
  • Show More Cited By

Index Terms

  1. An incremental Hausdorff distance calculation algorithm

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Proceedings of the VLDB Endowment
    Proceedings of the VLDB Endowment  Volume 4, Issue 8
    May 2011
    58 pages

    Publisher

    VLDB Endowment

    Publication History

    Published: 01 May 2011
    Published in PVLDB Volume 4, Issue 8

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)33
    • Downloads (Last 6 weeks)6
    Reflects downloads up to 14 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Privacy-preserving Spatial Dataset Search in CloudProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679733(1245-1254)Online publication date: 21-Oct-2024
    • (2023)SQUID: subtrajectory query in trillion-scale GPS databaseThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-022-00777-732:4(887-904)Online publication date: 19-Jan-2023
    • (2023)Federated Trajectory Search via a Lightweight Similarity Computation FrameworkWeb and Big Data10.1007/978-981-97-2390-4_32(469-485)Online publication date: 6-Oct-2023
    • (2022)VREProceedings of the VLDB Endowment10.14778/3554821.355483115:12(3398-3410)Online publication date: 1-Aug-2022
    • (2022)Fast dataset search with earth mover's distanceProceedings of the VLDB Endowment10.14778/3551793.355181115:11(2517-2529)Online publication date: 29-Sep-2022
    • (2022)Representative Routes Discovery from Massive TrajectoriesProceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3534678.3539079(4059-4069)Online publication date: 14-Aug-2022
    • (2022)Indexable sub-trajectory matching using multi-segment approximation: a partition-and-stitch frameworkThe Journal of Supercomputing10.1007/s11227-019-02813-w75:9(6129-6157)Online publication date: 10-Mar-2022
    • (2021)TrajDistLearnProceedings of the 14th ACM SIGSPATIAL International Workshop on Computational Transportation Science10.1145/3486629.3490693(1-9)Online publication date: 2-Nov-2021
    • (2020)Querying Recurrent Convoys over Trajectory DataACM Transactions on Intelligent Systems and Technology10.1145/340073011:5(1-24)Online publication date: 3-Aug-2020
    • (2019)Fast large-scale trajectory clusteringProceedings of the VLDB Endowment10.14778/3357377.335738013:1(29-42)Online publication date: 1-Sep-2019
    • Show More Cited By

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media