[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1128022.1128064acmconferencesArticle/Chapter ViewAbstractPublication PagescfConference Proceedingsconference-collections
Article

Cache miss behavior: is it √2?

Published: 03 May 2006 Publication History

Abstract

It has long been empirically observed that the cache miss rate decreased as a power law of cache size, where the power was approximately -1/2. In this paper, we examine the dependence of the cache miss rate on cache size both theoretically and through simulation. By combining the observed time dependence of the cache reference pattern with a statistical treatment of cache entry replacement, we predict that the cache miss rate should vary with cache size as an inverse power law for a first level cache. The exponent in the power law is directly related to the time dependence of cache references, and lies between -0.3 to -0.7. Results are presented for both direct mapped and set associative caches, and for various levels of the cache hierarchy. Our results demonstrate that the dependence of cache miss rate on cache size arises from the temporal dependence of the cache access pattern.

References

[1]
R. Kalla, B.Sinharoy, and J. Tendler. "IBM Power5 Chip: A dual-core multi-threaded processor", IEEE Micro 24(2), 40-47, 2004.
[2]
D. Boggs et al. "The microarchitecture of the Intel Pentium 4 processor on 90 nm technology", Intel Technology Journal 8, Issue 1, 1997.
[3]
http://www.amd.com, Technical Documentation. "AMD Opteron Product Data Sheet", Publication number 23932, 2004.
[4]
http://www.sun.com/processors/manuals/USIV_v1.0.pdf. "UltraSPARC IV Processor", User's Manual Supplement, Version 1.0, 2004.
[5]
C. K. Chow. "On Optimization of Storage Hierarchies", IBM Journal of R & D 18, 194--203, 1974.
[6]
C. K. Chow. "Determination of Cache's Capacity and its Matching Storage Hierarchy", IEEE Transactions on Computers, c-25, 157--164, 1976.
[7]
J. S. Harper, D. J. Kerbyson and G. R. Nudd. "Efficient Analytical Modelling of Multi-Level Set-Associative Caches", Proceedings of the International Conference HPCN Europe '99' 1593, pp. 473--482, 1999.
[8]
S. Przybylski, M. Horowitz and J. Hennessy. "Performance Tradeoffs in Cache Design", Proceedings of the 15th Annual International Symposium on Computer Architecture, pp. 290--298, 1988.
[9]
S. Przybylski, M. Horowitz and J. Hennessy. "Characteristics of Performance-Optimal Multi-Level Cache Hierarchies", Proceedings of the 16th Annual International Symposium on Computer Architecture, pp. 114--121, 1989.
[10]
G. S. Rao. "Performance Analysis of Cache Memories", JACM 25, 378--395, 1978.
[11]
J. H. Saltzer. "A Simple Linear Model of Demand Paging Performance", CACM 17, 181--186, 1974.
[12]
A. J. Smith. "Cache Memories", Computing Surveys 14, 473--528, 1982.
[13]
M. H. Macdougall. "Instruction-level Program and Processor Modeling", Computer 7, 14--24, 1984.
[14]
J. P. Singh, H. S. Stone and D. F. Thiebaut. "A Model of Workloads and Its Use in Miss-Rate Prediction for Fully Associative Caches", IEEE Transactions on Computers 41, 811--825, 1992.
[15]
A. Hartstein and T. R. Puzak. "Optimum Power/Performance Pipeline Depth", Proc. of the 36th Annual IEEE/ACM International Symposium on Microarchitecture, pp. 117--125, 2003.
[16]
D. Thiebaut, J. L. Wolf and H. S. Stone. "Synthetic Traces for Trace-Driven Simulation of Cache Memories", IEEE Transactions on Computers 41, 388--410, 1992.

Cited By

View all
  • (2023)Acceleration of Nuclear Reactor Simulation and Uncertainty Quantification Using Low-Precision ArithmeticApplied Sciences10.3390/app1302089613:2(896)Online publication date: 9-Jan-2023
  • (2023)SERIAL AND DIFFERENT PARALLEL IMPLEMENTATIONS OF LATTICE BOLTZMANN METHOD IN PYTHON: A COMPARATIVE ANALYSISComputational Thermal Sciences: An International Journal10.1615/ComputThermalScien.202304560015:5(55-70)Online publication date: 2023
  • (2022)Utility Optimal Thread Assignment and Resource Allocation in Multi-Server SystemsIEEE/ACM Transactions on Networking10.1109/TNET.2021.312381730:2(735-748)Online publication date: Apr-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CF '06: Proceedings of the 3rd conference on Computing frontiers
May 2006
430 pages
ISBN:1595933026
DOI:10.1145/1128022
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 May 2006

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. cache organization
  2. memory hierarchy
  3. performance

Qualifiers

  • Article

Conference

CF06
Sponsor:
CF06: Computing Frontiers Conference
May 3 - 5, 2006
Ischia, Italy

Acceptance Rates

Overall Acceptance Rate 273 of 785 submissions, 35%

Upcoming Conference

CF '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)20
  • Downloads (Last 6 weeks)2
Reflects downloads up to 06 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Acceleration of Nuclear Reactor Simulation and Uncertainty Quantification Using Low-Precision ArithmeticApplied Sciences10.3390/app1302089613:2(896)Online publication date: 9-Jan-2023
  • (2023)SERIAL AND DIFFERENT PARALLEL IMPLEMENTATIONS OF LATTICE BOLTZMANN METHOD IN PYTHON: A COMPARATIVE ANALYSISComputational Thermal Sciences: An International Journal10.1615/ComputThermalScien.202304560015:5(55-70)Online publication date: 2023
  • (2022)Utility Optimal Thread Assignment and Resource Allocation in Multi-Server SystemsIEEE/ACM Transactions on Networking10.1109/TNET.2021.312381730:2(735-748)Online publication date: Apr-2022
  • (2019)Empirical Discovery of Power-Law Distribution in MapReduce ScalabilityIEEE Transactions on Cloud Computing10.1109/TCC.2017.26693207:3(744-755)Online publication date: 1-Jul-2019
  • (2018)A novel power model for future heterogeneous 3D chip-multiprocessors in the dark silicon ageEURASIP Journal on Embedded Systems10.1186/s13639-018-0086-12018:1Online publication date: 27-Jul-2018
  • (2017)Exploring and analyzing the real impact of modern on-package memory on HPC scientific kernelsProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3126908.3126931(1-14)Online publication date: 12-Nov-2017
  • (2016)A model-driven approach to warp/thread-block level GPU cache bypassingProceedings of the 53rd Annual Design Automation Conference10.1145/2897937.2897966(1-6)Online publication date: 5-Jun-2016
  • (2016)Sizing Cleancache Allocation for Virtual Machines’ Transcendent MemoryIEEE Transactions on Computers10.1109/TC.2015.245602565:6(1949-1963)Online publication date: 1-Jun-2016
  • (2016)An energy-efficient bandwidth allocation method for single-chip heterogeneous processor2016 Seventh International Green and Sustainable Computing Conference (IGSC)10.1109/IGCC.2016.7892617(1-7)Online publication date: 2016
  • (2015)ICCI: In-Cache Coherence InformationIEEE Transactions on Computers10.1109/TC.2014.230818564:4(995-1014)Online publication date: 1-Apr-2015
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media