[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Optimizing Nonindexed Join Processing in Flash Storage-Based Systems

Published: 01 July 2013 Publication History

Abstract

Flash memory-based disks (or simply flash disks) have been widely used in today's computer systems. With their continuously increasing capacity and dropping price, it is envisioned that some database systems will operate on flash disks in the near future. However, the I/O characteristics of flash disks are different from those of magnetic hard disks. Motivated by this, we study the core of query processing in row-based database systems—join processing—on flash storage media. More specifically, we propose a new framework, called DigestJoin, to optimize nonindexed join processing by reducing the intermediate result size and exploiting fast random reads of flash disks. DigestJoin consists of two phases: 1) projecting the join attributes followed by a join on the projected attributes, and 2) fetching the full tuples that satisfy the join to produce the final join results. While the problem of tuple/page fetching with the minimum I/O cost (in the second phase) is intractable, we propose three heuristic page-fetching strategies for flash disks. We have implemented DigestJoin and conducted extensive experiments on a real flash disk. Our evaluation results based on TPC-H data sets show that DigestJoin clearly outperforms the traditional sort-merge join and hash join under a wide range of system configurations.

Cited By

View all
  • (2017)Advanced Block Nested Loop Join for Extending SSD LifetimeIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2017.265180329:4(743-756)Online publication date: 1-Apr-2017

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image IEEE Transactions on Computers
IEEE Transactions on Computers  Volume 62, Issue 7
July 2013
202 pages

Publisher

IEEE Computer Society

United States

Publication History

Published: 01 July 2013

Author Tags

  1. Ash
  2. Central Processing Unit
  3. Hard disks
  4. Indexes
  5. Query processing
  6. Schedules
  7. flash memory
  8. joins
  9. relational databases

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2017)Advanced Block Nested Loop Join for Extending SSD LifetimeIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2017.265180329:4(743-756)Online publication date: 1-Apr-2017

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media