Changing with the times

Changing with the times: adaptive interconnects and coherence for future chip multiprocessors

January 2011

Author:
Mishali Pankajbhai Naik
University of California, Los Angeles
,
Adviser:
Glenn Reinman
University of California, Los Angeles

Publisher:

University of California at Los Angeles
Computer Science Department 405 Hilgard Avenue Los Angeles, CA
United States

ISBN:978-1-124-98895-5

Order Number:AAI3483231

Pages:

149

Purchase on ProQuest

Bibliometrics

Abstract

Instead of scaling up the frequency of a single core to increase performance, chip multiprocessors (CMPs) have emerged as the practical alternative to scale performance by leveraging parallelism as the means to meet the increasing demands of applications. As chip multiprocessors continue to scale to larger numbers of processing cores, the demands on the on-chip communication framework will grow to satisfy the data and communication requirements of future multithreaded applications. This problem is exacerbated by poor wire scaling, which increases the latency and power consumption of on-chip communication. In response, two alternative interconnects have emerged, both based on electromagnetic wave propagation and both with latency effectively limited by the speed of light: optical interconnect (OI) and RF interconnect (RF-I).

In the first part of this dissertation, we focus on the use of alternative interconnects in future many-core systems to provide performance and power benefit by reducing on-chip access latency. In most conventional NoCs, link bandwidths are allocated in a uniform way in order to provide sufficient bandwidth for varying traffic demands. By studying the communication demands in different applications, we observed that applications tend to exhibit diverse patterns of communication. We demonstrate the use of RF-I to adapt to these varying communication patterns by flexibly allocating RF-I bandwidth to the critical paths of communication. By allocating RF-I bandwidth between components that communicate frequently and using lower bandwidth in other parts of the NoC, we can provide NoC power savings without significant loss in performance.

In order to leverage the abundant processing resources available on-chip, future many-core systems will require an effective means of sharing data between the collaborati cores. Hence, a power-efficient, scalable, and coherent interconnect fabric is vital to scale application performance in the many-core era. We propose a scalable architecture to enable snooping-based coherence, by introducing a low-latency interconnect structure specialized for store traffic in addition to the regular baseline NoC for all other traffic. We see a need to separate store requests from the rest of the on-chip traffic to avoid the impact of stores on load latency and bandwidth. We demonstrate the performance and power advantage of our snooping-based cache coherence architecture.

As part of this dissertation, we also try to study the scalability of the two emerging alternative interconnect technologies, by providing a quantitative comparison of both OI and RF-I at the same technology generation. Ultimately, we will demonstrate where OI and RF-I will most likely be used for future designs. Our analysis will include on-chip communication, and chip-to-chip communication.

Contributors

Glenn D Reinman
University of California, Los Angeles
- Publication Years1998 - 2020
- Publication counts76
- Citation count1,856
- Available for Download55
- Downloads (cumulative)33,625
- Downloads (12 months)2,974
- Downloads (6 weeks)488
- Average Downloads per Article611
- Average Citation per Article24
View Full Profile
Mishali Pankajbhai Naik
University of California, Los Angeles
- Publication Years2008 - 2011
- Publication counts5
- Citation count200
- Available for Download3
- Downloads (cumulative)1,656
- Downloads (12 months)40
- Downloads (6 weeks)10
- Average Downloads per Article552
- Average Citation per Article40
View Full Profile

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Recommendations

Improving parallel system performance by changing the arrangement of the network links
ICS '00: Proceedings of the 14th international conference on Supercomputing

The Midimew network is an excellent contender for implementing the communication subsystem of a high performance computer. This network is an optimal 2D topology in the sense there are no other symmetric direct networks of degree 4 with a lower average ...
A Torus-Based Hierarchical Optical-Electronic Network-on-Chip for Multiprocessor System-on-Chip

Networks-on-chip (NoCs) are emerging as a key on-chip communication architecture for multiprocessor systems-on-chip (MPSoCs). Optical communication technologies are introduced to NoCs in order to empower ultra-high bandwidth with low power consumption. ...
A Simulation Times Model of Multi-core Simulation
WCSE '09: Proceedings of the 2009 WRI World Congress on Software Engineering - Volume 01

Chip multi-processor (CMP) increases processor throughput by duplicating resources for many threads. Due to the main frequency of a single processor approaching to limit, CMP is becoming more and more popular. However, it is not well studied how to ...

Browse Theses

Sections

Improving parallel system performance by changing the arrangement of the network links

A Torus-Based Hierarchical Optical-Electronic Network-on-Chip for Multiprocessor System-on-Chip

A Simulation Times Model of Multi-core Simulation

Sections

Save to Binder

Recommendations

Improving parallel system performance by changing the arrangement of the network links

A Torus-Based Hierarchical Optical-Electronic Network-on-Chip for Multiprocessor System-on-Chip

A Simulation Times Model of Multi-core Simulation