[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2949689.2949707acmotherconferencesArticle/Chapter ViewAbstractPublication PagesssdbmConference Proceedingsconference-collections
demonstration

Selective Scan for Filter Operator of SciDB

Published: 18 July 2016 Publication History

Abstract

Recently there has been an increasing interest in analyzing scientific data generated by observations and scientific experiments. For managing these data efficiently, SciDB, a multi-dimensional array-based DBMS, is suggested. When SciDB processes a query with where predicates, it uses filter operator internally to produce a result array that matches the predicates. Most queries for scientific data analysis utilize spatial information. However, filter operator of SciDB reads all data without considering features of array-based DBMSs and spatial information. In this demo, we present an efficient query processing scheme utilizing characteristics of array-based data, implemented by employing coordinates. It uses a selective scan that retrieves data corresponding to a range that satisfies specific conditions. In our experiments, the selective scan is up to 30x faster than the original scan. We demonstrate that our implementation of the filter operator will reduce the processing time of a selection query significantly and enable SciDB to handle a massive amount of scientific data in more scalable manner.

References

[1]
P. Baumann, A. Dehmel, P. Furtado, R. Ritsch, and N. Widmann. The Multidimensional Database System RasDaMan. In ACM SIGMOD Record, volume 27, pages 575--577. ACM, 1998.
[2]
P. G. Brown. Overview of SciDB: Large Scale Array Storage, Processing and Analysis. In Proceedings of the 2010 ACM SIGMOD Conference, pages 963--968, 2010.
[3]
P. Cudre-Mauroux, H. Kimura, K.-T. Lim, J. Rogers, S. Madden, M. Stonebraker, S. B. Zdonik, and P. G. Brown. SS-DB: A Standard Science DBMS Benchmark. Extremely Large Databases Conference 2010.
[4]
http://www.postgresql.org/. PostgreSQL.
[5]
L. Libkin, R. Machlin, and L. Wong. A Query Language for Multidimensional Arrays: Design, Implementation, and Optimization Techniques. In ACM SIGMOD Record, volume 25, pages 228--239, 1996.
[6]
D. Malon, P. van Gemmeren, and J.Weinstein. An exploration of SciDB in the context of emerging technologies for data stores in particle physics and cosmology. Journal of Physics: Conference Series, 368(1):012021, 2012.
[7]
NASA Goddard Space Flight Center, Ocean Ecology Laboratory, Ocean Biology Processing Group. SeaWiFS Ocean Color Data.
[8]
N. Widmann and P. Baumann. Efficient Execution of Operations in a DBMS for Multidimensional Arrays. In Scientific and Statistical Database Management, 1998. Proceedings. Tenth International Conference on, pages 155--165. IEEE, 1998.
[9]
Y. Yao, T. Sun, T. Wang, O. Ruebel, T. Northen, and B. P. Bowen. Analysis of Metabolomics Datasets with High-Performance Computing and Metabolite Atlases. Metabolites, 5(3):431--442, 2015.

Cited By

View all
  • (2022)2 mm: A new technique for sorting dataTheoretical Computer Science10.1016/j.tcs.2022.01.037Online publication date: Jan-2022
  • (2022)On the performance limits of thread placement for array databases in non-uniform memory architecturesComputing10.1007/s00607-021-01043-4105:5(1059-1075)Online publication date: 17-Jan-2022
  • (2021)Performance Analysis of Array Database Systems in Non-Uniform Memory Architecture2021 29th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP)10.1109/PDP52278.2021.00034(169-176)Online publication date: Mar-2021

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
SSDBM '16: Proceedings of the 28th International Conference on Scientific and Statistical Database Management
July 2016
290 pages
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 July 2016

Check for updates

Author Tags

  1. Array Database Model
  2. Performance
  3. Query optimization

Qualifiers

  • Demonstration
  • Research
  • Refereed limited

Conference

SSDBM '16

Acceptance Rates

Overall Acceptance Rate 56 of 146 submissions, 38%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)0
Reflects downloads up to 17 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2022)2 mm: A new technique for sorting dataTheoretical Computer Science10.1016/j.tcs.2022.01.037Online publication date: Jan-2022
  • (2022)On the performance limits of thread placement for array databases in non-uniform memory architecturesComputing10.1007/s00607-021-01043-4105:5(1059-1075)Online publication date: 17-Jan-2022
  • (2021)Performance Analysis of Array Database Systems in Non-Uniform Memory Architecture2021 29th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP)10.1109/PDP52278.2021.00034(169-176)Online publication date: Mar-2021

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media