[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/320080.320100acmconferencesArticle/Chapter ViewAbstractPublication PagesmicroConference Proceedingsconference-collections
Article
Free access

Hardware identification of cache conflict misses

Published: 16 November 1999 Publication History

Abstract

This paper describes the Miss Classification Table, a simple mechanism that enables the processor or memory controller to identify each cache miss as either a conflict miss or a capacity (non-conflict) miss. The miss classification table works by storing part of the tag of the most recently evicted line of a cache set. If the next miss to that cache set has a matching tag, it is identified as a conflict miss. This technique correctly identifies 87% of misses in the worst case.
Several applications of this information are demonstrated, including improvements to victim caching, next-line prefetching, cache exclusion, and a pseudo-associative cache. This paper also presents the Adaptive Miss Buffer (AMB), which combines several of these techniques, targeting each miss with the most appropriate optimization, all within a single small miss buffer. The AMB's combination of techniques achieves 16% better performance than any single technique alone.

References

[1]
A. Agarwal and S. D. Pudar. Column-associative caches: a technique for reducing the miss rate of direct-mapped caches. In 20th Annual International Symposium on Computer Architecture, pages 179-190, San Diego, CA, May 1993. ACM.
[2]
B. N. Bershad, D. Lee, T. H. Romer, and J. B. Chen. Avoiding conflict misses dynamically in large direct-mapped caches. In Six International Conference on Architectural Support for Programming Languages and Operating Systems, pages 158-170, Oct. 1994.
[3]
T-F. Chen and J.-L. Baer. Effective hardware-based data prefetching for high-performance processors. IEEE Transactions on Computers, 44(5):609--623, May 1995.
[4]
A. L. Cox and R. J. Fowler. Adaptive cache coherency for detecting migratory shared data. In 20th Annual International Symposium on Computer Architecture, pages 98-108, San Diego, CA, May 1993. ACM.
[5]
A. Gonzalez, C. Aliagas, and M. Valero. A data cache with multiple caching strategies tuned to different types of locality. In International Conference on Supercomputing, pages 338-347, June 1995.
[6]
M.D. Hill. Aspects of Cache Memory and Instruction Buffer Performance. PhD thesis, University of Califomia, Berkeley, 1987.
[7]
H. Hirata, K. Kimura, S. Nagamine, Y. Mochizuki, A. Nishimura, Y. Nakase, and T. Nishizawa. An elementary processor architecture with simultaneous instruction issuing from multiple threads. In 19th Annual International Symposium on Computer Architecture, pages 136-145, May 1992.
[8]
T. L. Johnson and W. W. Hwu. Run-time adaptive cache hierarchy management via reference analysis, in 24th Annual International Symposium on Computer Architecture, pages 364-373, May 1997.
[9]
D. Joseph and D. Grunwald. Prefetching using markov predictors. IEEE Transactions on Computers, 48(2):121-133, Feb. 1999.
[10]
N. P. Jouppi. Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers. In 17th Annual International Symposium on Computer Architecture, pages 364-373, May 1995.
[11]
V. Milutinovic, M. Tomasevic, B. Markovi, and M. Tremblay. A new cache architecture concept: the split temporal/spatial cache. In Proceedings of 8th Mediterranean Electrotechnical Conference, pages 1108-1111, May 1996.
[12]
B. A. Nayfeh, L. Hammond, and K. Olukotun. Evaluation of design altematives for a multiprocessor microprocessor. In 23rd Annual International Symposium on Computer Architecture, pages 67-77, May 1996.
[13]
T H. Romer, D. Lee, B. N. Bershad, and J. B. Chert. Dynamic page mapping policies for cache conflict resolution on standard hardware. In First Annual Symposium on Operating Systems Design and Implementation, pages 255-266, Nov. 1994.
[14]
A. J. Smith. Cache memories. ACM Computing Surveys, 14(3):473-530, Sept. 1982.
[15]
G.S. Sohi, S. E. Breach, and T N. Vijaykumar. Multiscalar processors. In 22rid Annual International Symposium on Computer Architecture, pages 414--425, June 1995.
[16]
P. Stenstrom, M. Brorsson, and L. Sandberg. An adaptive cache coherence protocol optimized for migratory sharing. In 20th Annual International Symposium on Computer Architecture, pages 109-118, San Diego, CA, May 1993. ACM.
[17]
H.S. Stone. High-Performance Computer Architecture. Addison Wesley, 1987.
[18]
D. M. Tullsen. Simulation and modeling of a simultaneous multithreading processor. In 22nd Annual Computer Measurement Group Conference, Dec. 1996.
[19]
D. M. Tullsen, S. J. Eggers, J. S. Emer, H. M. Levy, J. L. Lo, and R. L. Stamm. Exploiting choice: Instruction fetch and issue on an implementable simultaneous multithreading processor. In 23rd Annual International Symposium on Computer Architecture, May 1996.
[20]
G. Tyson, M. Farrens, J. Matthews, and A. R. Pleszkun. A modified approach to data cache management. In 28th Annual International Symposium on Microarchitecture, pages 93-103, Dec. 1995.

Cited By

View all
  • (2019)Reducing Data Movement and Energy in Multilevel Cache Hierarchies without Losing Performance: Can you have it all?Proceedings of the International Conference on Parallel Architectures and Compilation Techniques10.1109/PACT.2019.00037(382-393)Online publication date: 23-Sep-2019
  • (2018)dCatProceedings of the Thirteenth EuroSys Conference10.1145/3190508.3190555(1-13)Online publication date: 23-Apr-2018
  • (2018)Lightweight detection of cache conflictsProceedings of the 2018 International Symposium on Code Generation and Optimization10.1145/3168819(200-213)Online publication date: 24-Feb-2018
  • Show More Cited By

Index Terms

  1. Hardware identification of cache conflict misses

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      MICRO 32: Proceedings of the 32nd annual ACM/IEEE international symposium on Microarchitecture
      November 1999
      299 pages
      ISBN:076950437X

      Sponsors

      Publisher

      IEEE Computer Society

      United States

      Publication History

      Published: 16 November 1999

      Check for updates

      Qualifiers

      • Article

      Conference

      MICRO99
      Sponsor:

      Acceptance Rates

      MICRO 32 Paper Acceptance Rate 27 of 131 submissions, 21%;
      Overall Acceptance Rate 484 of 2,242 submissions, 22%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)54
      • Downloads (Last 6 weeks)4
      Reflects downloads up to 20 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2019)Reducing Data Movement and Energy in Multilevel Cache Hierarchies without Losing Performance: Can you have it all?Proceedings of the International Conference on Parallel Architectures and Compilation Techniques10.1109/PACT.2019.00037(382-393)Online publication date: 23-Sep-2019
      • (2018)dCatProceedings of the Thirteenth EuroSys Conference10.1145/3190508.3190555(1-13)Online publication date: 23-Apr-2018
      • (2018)Lightweight detection of cache conflictsProceedings of the 2018 International Symposium on Code Generation and Optimization10.1145/3168819(200-213)Online publication date: 24-Feb-2018
      • (2017)Dynamic and discrete cache insertion policies for managing shared last level caches in large multicoresJournal of Parallel and Distributed Computing10.1016/j.jpdc.2017.02.004106:C(215-226)Online publication date: 1-Aug-2017
      • (2014)The dirty-block indexProceeding of the 41st annual international symposium on Computer architecuture10.5555/2665671.2665697(157-168)Online publication date: 14-Jun-2014
      • (2014)The dirty-block indexACM SIGARCH Computer Architecture News10.1145/2678373.266569742:3(157-168)Online publication date: 14-Jun-2014
      • (2012)The evicted-address filterProceedings of the 21st international conference on Parallel architectures and compilation techniques10.1145/2370816.2370868(355-366)Online publication date: 19-Sep-2012
      • (2011)Bypass and insertion algorithms for exclusive last-level cachesACM SIGARCH Computer Architecture News10.1145/2024723.200007539:3(81-92)Online publication date: 4-Jun-2011
      • (2011)Bypass and insertion algorithms for exclusive last-level cachesProceedings of the 38th annual international symposium on Computer architecture10.1145/2000064.2000075(81-92)Online publication date: 4-Jun-2011
      • (2006)Simple penalty-sensitive replacement policies for cachesProceedings of the 3rd conference on Computing frontiers10.1145/1128022.1128068(341-352)Online publication date: 3-May-2006
      • Show More Cited By

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media