[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article
Free access

Optimizing database architecture for the new bottleneck: memory access

Published: 01 December 2000 Publication History

Abstract

In the past decade, advances in the speed of commodity CPUs have far out-paced advances in memory latency. Main-memory access is therefore increasingly a performance bottleneck for many computer applications, including database systems. In this article, we use a simple scan test to show the severe impact of this bottleneck. The insights gained are translated into guidelines for database architecture, in terms of both data structures and algorithms. We discuss how vertically fragmented data structures optimize cache performance on sequential data access. We then focus on equi-join, typically a random-access operation, and introduce radix algorithms for partitioned hash-join. The performance of these algorithms is quantified using a detailed analytical model that incorporates memory access cost. Experiments that validate this model were performed on the Monet database system. We obtained exact statistics on events such as TLB misses and L1 and L2 cache misses by using hardware performance counters found in modern CPUs. Using our cost model, we show how the carefully tuned memory access pattern of our radix algorithms makes them perform well, which is confirmed by experimental results.

References

[1]
{AvdBF+92} Apers P.M.G., van den Berg C.A., Flokstra J., Grefen P.W.P.J., Kersten M., Wilschut A.N. PRISMA/DB: A parallel main memory relational DBMS. IEEE Transactionson Knowledge and Data Engineering 4(6): 541-554, December 1992.
[2]
{BK95} Boncz P., Kersten M. Monet: An impressionist sketch of an advanced database system. In: Proc. Basque Int. Workshop on Information Technology, San Sebastian, Spain, July 1995.
[3]
{BK99} Boncz P., Kersten M. MIL primitives for querying a fragmented world. The VLDB Journal 8(2): 101-119, Oct 1999.
[4]
{BMK99} Boncz P., Manegold S., Kersten M. Database architecture optimized for the new bottleneck: memory access. In: Proc. Int. Conf. on Very Large Data Bases, pp. 54- 65, Edinburgh, Scotland, UK, September 1999.
[5]
{BQK96} Boncz P., Quak W., Kersten M. Monet and its geographical extensions: a novel approach to high-performance GIS processing. In: Proc. Int. Conf. on Extending Database Technology, pp. 147-166, Avignon, France, June 1996.
[6]
{BRK98} Boncz P., Rühl T., Kwakkel F. The drill down benchmark. In: Proc. Int. Conf. on Very Large Data Bases, pp. 628-632, New York, USA, June 1998.
[7]
{BWK98} Boncz P., Wilschut A.N., Kersten M. Flattening an object algebra to provide performance. In: Proc IEEE Int. Conf. on Data Engineering, pp. 568-577, Orlando, Fla., USA, February 1998.
[8]
{CK85} Copeland G.P., Khoshafian S.A decomposition storage model. In: Proc. ACM SIGMOD Int. Conf. on Management of Data, pp. 268-279, Austin, Tex., USA, May 1985.
[9]
{Knu68} Knuth D.E. The art of computer programming, vol. 1. Addison-Wesley, Reading, Mass., USA, 1968.
[10]
{LC86} Lehman T.J., Carey M.J. A study of index structures for main memory database management systems. In: Proc. Int. Conf. on Very Large Data Bases, pp. 294- 303, Kyoto, Japan, August 1986.
[11]
{LN96} Listgarten S., Neimat M.-A. Modelling costs for a MM-DBMS. In: Proc. In.t Workshop on Real-Time Databases, Issues and Applications, pp. 72-78, Newport Beach, Calif., USA, March 1996.
[12]
{MBK99} Manegold S., Boncz P., Kersten M. Optimizing main-memory join on modern hardware. Technical Report INS-R9912, CWI, Amsterdam, The Netherlands, October 1999.
[13]
{MBK00} Manegold S., Boncz P.A., Kersten M.L. What happens during a join? - Dissecting CPU and memory optimization effects. In: Proc. Int. Conf. on Very Large Data Bases, Cairo, Egypt, September 2000, pp. 339-350.
[14]
{McC95} J.D. McCalpin. Memory bandwidth and machine balance in current high performance computers. IEEE Technical Committee on Computer Architecture Newsletter, December 1995.
[15]
{MKW+98} McKee S., Klenke R., Wright K., Wulf W., Salinas M., Aylor J., Batson A. Smarter memory: improving bandwidth for streamed references. IEEE Computer 31(7): 54-63, July 1998.
[16]
{Mow94} Mowry T.C. Tolerating latency through software-controlled data prefetching. PhD thesis, Stanford University, Computer Science Department, 1994.
[17]
{NBC+94} Nyberg C., Barclay T., Cvetanovic Z., Gray J., Lomet D. AlphaSort: a RISC machine sort. In Proc. ACM SIGMOD Int. Conf. on Management of Data, pp. 233- 242, Minneapolis, Minn., USA, May 1994.
[18]
{Ram96} Rambus Technologies, Inc. Direct rambus technology disclosure, 1996. http://www.rambus.com/docs/drtechov.pdf.
[19]
{Ron98} Ronström M. Design and Modeling of a parallel data server for telecom applications. PhD thesis, Linköping University, Sweden, 1998.
[20]
{Sil97} Silicon Graphics, Inc., Mountain View, Calif. Performance Tuning and Optimization for Origin2000 and Onyx2, January 1997.
[21]
{SKN94} Shatdal A., Kant C., Naughton J. Cache conscious algorithms for relational query processing. In: Proc. Int. Conf. on Very Large Data Bases, pp. 510-512, Santiago, Chile, September 1994.
[22]
{SLD97} SLDRAM Inc. SyncLink DRAM Whitepaper, 1997. http://www.sldram.com/Documents/SL- DRAMwhite970910.pdf.
[23]
{Val87} Valduriez P. Join indices. ACM Transactions on Database Systems 12(2): 218-246, June 1987.
[24]
{WK90} Whang K.-Y., Krishnamurthy R. Query Optimization in a Memory-Resident Domain Relational Calculus Database System. ACM Transactions on Database Systems 15(1): 67-95, March 1990.

Cited By

View all
  • (2024)Simple, Efficient, and Robust Hash Tables for Join ProcessingProceedings of the 20th International Workshop on Data Management on New Hardware10.1145/3662010.3663442(1-9)Online publication date: 10-Jun-2024
  • (2023)Cracking-Like Join for Trusted Execution EnvironmentsProceedings of the VLDB Endowment10.14778/3598581.359860216:9(2330-2343)Online publication date: 10-Jul-2023
  • (2023)Microarchitectural Analysis of Graph BI Queries on RDBMSProceedings of the 19th International Workshop on Data Management on New Hardware10.1145/3592980.3595321(102-106)Online publication date: 18-Jun-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image The VLDB Journal — The International Journal on Very Large Data Bases
The VLDB Journal — The International Journal on Very Large Data Bases  Volume 9, Issue 3
December 2000
104 pages

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 01 December 2000

Author Tags

  1. Decomposed storage model
  2. Implementation techniques
  3. Join algorithms
  4. Main-memory databases
  5. Memory access optimization
  6. Query processing

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)39
  • Downloads (Last 6 weeks)5
Reflects downloads up to 03 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Simple, Efficient, and Robust Hash Tables for Join ProcessingProceedings of the 20th International Workshop on Data Management on New Hardware10.1145/3662010.3663442(1-9)Online publication date: 10-Jun-2024
  • (2023)Cracking-Like Join for Trusted Execution EnvironmentsProceedings of the VLDB Endowment10.14778/3598581.359860216:9(2330-2343)Online publication date: 10-Jul-2023
  • (2023)Microarchitectural Analysis of Graph BI Queries on RDBMSProceedings of the 19th International Workshop on Data Management on New Hardware10.1145/3592980.3595321(102-106)Online publication date: 18-Jun-2023
  • (2022)Routing brain traffic through the von Neumann bottleneckParallel Computing10.1016/j.parco.2022.102952113:COnline publication date: 1-Oct-2022
  • (2022)-join: Efficient Join with Versioned Dimension TablesDatabase Systems for Advanced Applications10.1007/978-3-031-00123-9_6(88-95)Online publication date: 11-Apr-2022
  • (2021)Adaptive code generation for data-intensive analyticsProceedings of the VLDB Endowment10.14778/3447689.344769714:6(929-942)Online publication date: 12-Apr-2021
  • (2021)Rare Variants Analysis in Genetic Association Studies with Privacy Protection via Hybrid SystemInformation and Communications Security10.1007/978-3-030-88052-1_11(174-191)Online publication date: 17-Sep-2021
  • (2020)Hardware-aided update acceleration in a hybrid Semantic Web database systemThe Journal of Supercomputing10.1007/s11227-018-2462-y76:10(7961-7984)Online publication date: 1-Oct-2020
  • (2020)VIP: A SIMD vectorized analytical query engineThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-020-00621-w29:6(1243-1261)Online publication date: 13-Jul-2020
  • (2019)Interleaved multi-vectorizingProceedings of the VLDB Endowment10.14778/3368289.336829013:3(226-238)Online publication date: 1-Nov-2019
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Full Access

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media