Aspects of cache memory and instruction buffer performance

January 1987

Author:
Mark Donald Hill,
Chairman:
Alan Jay Smith

Publisher:

University of California, Berkeley

Order Number:AAI8813907

Pages:

297

Purchase on ProQuest

Bibliometrics

Abstract

Techniques are developed in this dissertation to efficiently evaluate direct-mapped and set-associative caches. These techniques are used to study associativity in CPU caches and examine instruction caches for single-chip RISC microprocessors. This research is motivated in general by the importance of cache memories to computer performance, and more specifically by work done to design the caches in SPUR, a multiprocessor workstation designed at U.C. Berkeley. The studies focus not only on abstract measures of performance such as miss ratios, but also include, when appropriate, detailed implementation factors, such as access times and gate delays.

The simulation algorithms developed compute miss ratios for numerous alternative caches with one pass through an address trace, provided all caches have the same block size, and use demand fetching and LRU replacement. One algorithm (forest simulation) simulates direct-mapped caches by relying on inclusion, a property that all larger caches contain a superset of the data in smaller caches. The other algorithm (all associativity simulation) simulates a broader class of direct-mapped and set-associative caches than could previously be studied with a one-pass algorithm, although somewhat less efficiently than forest simulation, since inclusion does not hold.

The analysis of set-associative caches yields two major results. First, constant factors are obtained which relate to the miss ratios for set-associative caches to miss ratios for other set-associative caches. Then those results are combined with sample cache implementations to show that above certain cache sizes, direct-mapped caches have lower effective access times than set-associative caches, despite having higher miss ratios.

Finally, instruction buffers and target instruction buffers are examined as organizations for instruction memory on single-chip microprocessors. The analysis focuses closely on implementation considerations, including the interaction between instruction fetches, instruction prefetches and data references, and uses the SPUR RISC design as the case study. Results show the effects of varying numerous design parameters, suggest some superior designs, and demonstrate that instruction buffers will be preferred to target instruction buffers in future RISC microprocessors implemented on single CMOS chips.

Cited By

Contributors

M. D. Hill
Microsoft Corporation
- Publication Years1983 - 2023
- Publication counts148
- Citation count21,502
- Available for Download150
- Downloads (cumulative)181,298
- Downloads (12 months)22,303
- Downloads (6 weeks)2,812
- Average Downloads per Article1,209
- Average Citation per Article145
View Full Profile
Alan Jay Smith
University of California, Berkeley
- Publication Years1974 - 2009
- Publication counts121
- Citation count5,592
- Available for Download35
- Downloads (cumulative)55,411
- Downloads (12 months)13,267
- Downloads (6 weeks)8,592
- Average Downloads per Article1,583
- Average Citation per Article46
View Full Profile

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Recommendations

Aspects of Cache Memory and Instruction
Multiprocessor cache memory performance: characterization and optimization
A Performance Study of Instruction Cache Prefetching Methods

Prefetching methods for instruction caches are studied via trace-driven simulation. The two primary methods are "fall-through" prefetch (sometimes referred to as "one block lookahead") and "target" prefetch. Fall-through prefetches are for sequential ...

Browse Theses

Sections

Cited By