[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2484838.2484855acmotherconferencesArticle/Chapter ViewAbstractPublication PagesssdbmConference Proceedingsconference-collections
research-article

GIPSY: joining spatial datasets with contrasting density

Published: 29 July 2013 Publication History

Abstract

Many scientific and geographical applications rely on the efficient execution of spatial joins. Past research has produced several efficient spatial join approaches and while each of them can join two datasets, the problem of efficiently joining two datasets with contrasting density, i.e., with the same spatial extent but with a wildly different number of spatial elements, has so far been overlooked. State-of-the-art data-oriented spatial join approaches (e.g., based on the R-Tree) suffer from degraded performance due to overlap, whereas space-oriented approaches excessively read data from disk.
In this paper we develop GIPSY, a novel approach for the spatial join of two datasets with contrasting density. GIPSY uses fine-grained data-oriented partitioning and thus only retrieves the data needed for the join. At the same time it avoids the overlap related problems associated with data-oriented partitioning by using a crawling approach, i.e., without using a hierarchical tree. Our experiments show that GIPSY outperforms state-of-the-art disk-based spatial join algorithms by a factor of 2 to 18 and is particularly efficient when joining a dense dataset with several sparse datasets.

References

[1]
W. G. Aref and H. Samet. Cascaded Spatial Join Algorithms with Spatially Sorted Output. In International Workshop on Advances in Geographic Information Systems (AGIS '96).
[2]
L. Arge, M. D. Berg, H. Haverkort, and K. Yi. The Priority R-tree: A Practically Efficient and Worst-case Optimal R-tree. ACM Transactions on Algorithms, 4(1):1--30, 2008.
[3]
L. Arge, O. Procopiuc, S. Ramaswamy, T. Suel, and J. S. Vitter. Scalable Sweeping-Based Spatial Join. In VLDB '98.
[4]
N. Beckmann, H.-P. Kriegel, R. Schneider, and B. Seeger. The R*-tree: an Efficient and Robust Access Method for Points and Rectangles. In SIGMOD '90.
[5]
T. Brinkhoff, H.-P. Kriegel, and B. Seeger. Efficient Processing of Spatial Joins using R-Trees. In SIGMOD '93.
[6]
J.-P. Dittrich and B. Seeger. Data Redundancy and Duplicate Detection in Spatial Join Processing. In ICDE 2000.
[7]
R. Elmasri and S. B. Navathe. Fundamentals of Database Systems. Addison Wesley, 3rd edition edition, 2000.
[8]
A. Farris, A. Sharma, C. Niedermayr, D. Brat, D. Foran, F. Wang, J. Saltz, J. Kong, L. Cooper, T. Oh, T. Kurc, T. Pan, and W. Chen. A Data Model and Database for High-resolution Pathology Analytical Image Informatics. Journal of Pathology Informatics, 2(1):32, 2011.
[9]
Y. J. Garcia, M. A. Lopez, and S. T. Leutenegger. A Greedy Algorithm for Bulk Loading R-trees. In International Workshop on Advances in Geographic Information Systems (AGIS '98).
[10]
A. Guttman. R-trees: a Dynamic Index Structure for Spatial Searching. In SIGMOD '84.
[11]
D. Hilbert. Über die stetige Abbildung einer Linie auf ein Flächenstück. Mathematische Annalen, 38:459--460, 1891.
[12]
E. H. Jacox and H. Samet. Spatial Join Techniques. ACM Transactions on Database Systems, 32(1):7, 2007.
[13]
I. Kamel and C. Faloutsos. On Packing R-trees. In CIKM '93.
[14]
N. Koudas and K. C. Sevcik. Size Separation Spatial Join. In SIGMOD '97.
[15]
J. Kozloski, K. Sfyrakis, S. Hill, F. Schurmann, C. Peck, and H. Markram. Identifying, tabulating, and analyzing contacts between branched neuron morphologies. IBM Journal of Research and Development, 52(1.2):43--55, 2008.
[16]
S. T. Leutenegger, M. A. Lopez, and J. Edgington. STR: a Simple and Efficient Algorithm for R-tree Packing. In ICDE '97.
[17]
M.-L. Lo and C. V. Ravishankar. Spatial Hash-joins. In SIGMOD '96.
[18]
M.-L. Lo and C. V. Ravishankar. Spatial Joins Using Seeded Trees. In International Conference on Management of Data (SIGMOD '94).
[19]
N. Mamoulis and D. Papadias. Slot Index Spatial Join. IEEE Transactions on Knowledge and Data Engineering, 15(1), 2003.
[20]
H. Markram. The Blue Brain Project. Nature Reviews Neuroscience, 7(2):153--160, 2006.
[21]
S. Nobari, F. Tauheed, T. Heinis, P. Karras, S. Bressan, and A. Ailamaki. TOUCH: In-Memory Spatial Join by Hierarchical Data-Oriented Partitioning. In SIGMOD '13.
[22]
S. Papadomanolakis, A. Ailamaki, J. C. Lopez, T. Tu, D. R. O'Hallaron, and G. Heber. Efficient Query Processing on Unstructured Tetrahedral Meshes. In SIGMOD '06.
[23]
J. M. Patel and D. J. DeWitt. Partition Based Spatial-Merge Join. In SIGMOD '96.
[24]
T. K. Sellis, N. Roussopoulos, and C. Faloutsos. The R+-Tree: A Dynamic Index for Multi-Dimensional Objects. In VLDB '87.
[25]
F. Tauheed, L. Biveinis, T. Heinis, F. Schürmann, H. Markram, and A. Ailamaki. Accelerating range queries for brain simulations. In ICDE '12.
[26]
M. Ubell. The Montage Extensible DataBlade Architecture. In SIGMOD '94.

Cited By

View all
  • (2019)Spatial joinsSIGSPATIAL Special10.1145/3355491.335549411:1(13-21)Online publication date: 5-Aug-2019
  • (2016)GCMFProceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems10.1145/2996913.2996982(1-10)Online publication date: 31-Oct-2016

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
SSDBM '13: Proceedings of the 25th International Conference on Scientific and Statistical Database Management
July 2013
401 pages
ISBN:9781450319218
DOI:10.1145/2484838
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 29 July 2013

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Conference

SSDBM '13

Acceptance Rates

Overall Acceptance Rate 56 of 146 submissions, 38%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 30 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2019)Spatial joinsSIGSPATIAL Special10.1145/3355491.335549411:1(13-21)Online publication date: 5-Aug-2019
  • (2016)GCMFProceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems10.1145/2996913.2996982(1-10)Online publication date: 31-Oct-2016

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media