[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article
Free access

MemorIES3: a programmable, real-time hardware emulation tool for multiprocessor server design

Published: 12 November 2000 Publication History

Abstract

Modern system design often requires multiple levels of simulation for design validation and performance debugging. However, while machines have gotten faster, and simulators have become more detailed, simulation speeds have not tracked machine speeds, As a result, it is difficult to simulate realistic problem sizes and hardware configurations for a target machine. Instead, researchers have focussed on developing sealing methodologies and running smaller problem sizes and configurations that attempt to represent the behavior of the real problem. Given the increasing size of problems today, it is unclear whether such an approach yields accurate results. Moreover, although commercial workloads are prevalent and important in today's marketplace, many simulation tools are unable to adequately profile such applications, let alone for realistic sizes.In this paper we present a hardware-based emulation tool that can be used to aid memory system designers. Our focus is on the memory system because the ever-widening gap between processor and memory speeds means that optimizing the memory subsystem is critical for performance. We present the design of the Memory Instrumentation and Emulation System (MemoriES). MemoriES is a programmable tool designed using FPGAs and SDRAMs. It plugs into an SMP bus to perform on-line emulation of several cache configurations, structures and protocols while the system is running real-life workloads in real-time, without any slowdown in application execution speed. We demonstrate its usefulness in several case studies, and find several important results. First, using traces to perform system evaluation can lead to incorrect results (off by 100% or more in some cases) if the trace size is not sufficiently large. Second. MemoriES is able to detect performance problems by profiling miss behavior over the entire course of a run, rather than relying on a small interval of time. Finally, we observe that previous studies of SPLASH2 applications using scaled application sizes can result in optimistic miss rates relative to real sizes on real machines, providing potentially misleading data when used for design evaluation.

References

[1]
Altera Corporation, Flex10K Embedded Programmable Logic Family Data Sheet. http://www.altera.com.
[2]
E. Bilir, R. Dickson, Y. Hu, M. Plakal, D. Sorin, M. Hill, and D. Wood. Multicast Snooping: A New Coherence Method using a Multicast Address Network. In Proceedings of the 26th Annual International Symposium on Computer Architecture. May 1999.
[3]
M.Dubois, A. Gefflaut, J. Jeong, A. Moga, and K. Oner, "Rapid prototyping on RPM-Methodology and Experience," IEEE Design and Test of Computers, pp 112-118, July-Sep. 1998.
[4]
B. Falsafi and D. Wood. Reactive NUMA: A Design for Unifying S-COMA with CC-NUMA. In Proceedings of the 24th Annual International Symposium on Computer Architecture. June 1997.
[5]
B. Falsafi and D. Wood. Parallel Dispatch Queue: A Queue-Based Parallel Programming Abstraction to Parallelize Fine-Grain Communication Protocols. In Proceedings of the 5th International Conference on High-Performance Computing. January, 1999.
[6]
D. Fullagar, P. Quinn, C. Grillmair, J. Salmon, and M. Warren. N-body Methods on MIMD Supercomputers: Astrophysics on the Intel Touchstone Delta. In Proceedings of the Fifth Australian Supercomputing Conference. December 1992.
[7]
Y. Hu, H. Lu, A. Cox, and W. Zwaenepoel. OpenMP for Networks of SMPs. In Proceedings of the Thirteenth International Parallel Processing Symposium. April 1999.
[8]
IBM Corp., RS/6000 Enterprise Server S7A Users' Guide, Oct. 1998
[9]
J. Levesque. Personal Communication. April 2000.
[10]
M. Michael, A. Nanda, B.-H. Lim, and M. Scott. Coherence Controller Architectures for SMP-Based CC-NUMA Multiprocessors. In Proceedings of the 24th International Symposium on Computer Architecture. June 1997.
[11]
A.K. Nanda, Y. Hu, M. Ohara, M. Giampapa, C. Benveniste and M. Michael. The Design of COMPASS: An Execution Driven Simulator for Commercial Applications Running on Shared Memory Multiprocessors. In Proceedings of International Parallel Processing Symposium, April 1998.
[12]
A.-T. Nguyen, M. Michael, A. Sharma and J. Torrellas. The Augmint Multiprocessor Simulation Toolkit for Intel x86 Architectures. In Proceedings of the International Conference on Computer Design, pp. 486-490, Oct.1996.
[13]
V. S. Pai, P. Ranganathan, and S. Adve. RSIM: An Execution-Driven Simulator for ILP-Based Shared-Memory Multiprocessors and Uniprocessors. In Proceedings of the Third Workshop on Computer Architecture Education. Feb. 1997.
[14]
Quickturn Corporation. http://www.quickturn.com
[15]
M. Rosenblum, S. Herrod, E. Witchel, and A. Gupta. Complete Computer Simulation: The SimOS Approach. In IEEE Parallel and Distributed Technology. Fall 1995.
[16]
D. Jiang and J. P. Singh. Scaling Application Performance on Cache-coherent Multiprocessors. In Proceedings of the 26th Annual International Symposium on Computer Architecture. May 1999.
[17]
Transaction Processing Council: http://www.tpc.org
[18]
W.-D. Weber. Scalable Directories for Cache-Coherent Shared-Memory Multiprocessors. Stanford University Technical Report CSL-TR-93-557. Jan. 1993.
[19]
Z. Wang, J. Lupo, A. McKenney, and R. Pachter. Large Scale Molecular Dynamics Simulations with Fast Multipole Implementations. In Proceedings of SC99. Nov. 1999.
[20]
S. C. Woo, M. Ohara, E. Torrie, J. P. Singh and A. Gupta. The SPLASH-2 Programs: Characterization and Methodological Considerations. In Proceedings of the 22nd International Symposium on Computer Architecture, June 1995.
[21]
E. Witchel and M. Rosenblum. Embra: Fast and Flexible Machine Simulation. In Proceedings of the International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS). 1996.

Index Terms

  1. MemorIES3: a programmable, real-time hardware emulation tool for multiprocessor server design

                          Recommendations

                          Comments

                          Please enable JavaScript to view thecomments powered by Disqus.

                          Information & Contributors

                          Information

                          Published In

                          cover image ACM SIGOPS Operating Systems Review
                          ACM SIGOPS Operating Systems Review  Volume 34, Issue 5
                          Dec. 2000
                          269 pages
                          ISSN:0163-5980
                          DOI:10.1145/384264
                          Issue’s Table of Contents
                          • cover image ACM Conferences
                            ASPLOS IX: Proceedings of the ninth international conference on Architectural support for programming languages and operating systems
                            November 2000
                            271 pages
                            ISBN:1581133170
                            DOI:10.1145/378993
                          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                          Publisher

                          Association for Computing Machinery

                          New York, NY, United States

                          Publication History

                          Published: 12 November 2000
                          Published in SIGOPS Volume 34, Issue 5

                          Check for updates

                          Qualifiers

                          • Article

                          Contributors

                          Other Metrics

                          Bibliometrics & Citations

                          Bibliometrics

                          Article Metrics

                          • Downloads (Last 12 months)212
                          • Downloads (Last 6 weeks)46
                          Reflects downloads up to 01 Jan 2025

                          Other Metrics

                          Citations

                          View Options

                          View options

                          PDF

                          View or Download as a PDF file.

                          PDF

                          eReader

                          View online with eReader.

                          eReader

                          Login options

                          Media

                          Figures

                          Other

                          Tables

                          Share

                          Share

                          Share this Publication link

                          Share on social media