[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

The Synchronization Power of Coalesced Memory Accesses

Published: 01 July 2010 Publication History

Abstract

Multicore architectures have established themselves as the new generation of computer architectures. As part of the one core to many cores evolution, memory access mechanisms have advanced rapidly. Several new memory access mechanisms have been implemented in many modern commodity multicore architectures. By specifying how processing cores access shared memory, memory access mechanisms directly influence the synchronization capabilities of multicore architectures. Therefore, it is crucial to investigate the synchronization power of these new memory access mechanisms. This paper investigates the synchronization power of coalesced memory accesses, a family of memory access mechanisms introduced in recent large multicore architectures such as the Compute Unified Device Architecture (CUDA). We first define three memory access models to capture the fundamental features of the new memory access mechanisms. Subsequently, we prove the exact synchronization power of these models in terms of their consensus numbers. These tight results show that the coalesced memory access mechanisms can facilitate strong synchronization between the threads of multicore architectures, without the need of synchronization primitives other than reads and writes. In the case of the contemporary CUDA processors, our results imply that the coalesced memory access mechanisms have consensus numbers up to 64.

Cited By

View all
  • (2016)GreenBSTProceedings of the 22nd International Conference on Euro-Par 2016: Parallel Processing - Volume 983310.1007/978-3-319-43659-3_37(502-517)Online publication date: 24-Aug-2016
  • (2011)Performance evaluation of the three-dimensional finite-difference time-domain(FDTD) method on Fermi architecture GPUsProceedings of the 11th international conference on Algorithms and architectures for parallel processing - Volume Part I10.5555/2075416.2075459(460-469)Online publication date: 24-Oct-2011

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image IEEE Transactions on Parallel and Distributed Systems
IEEE Transactions on Parallel and Distributed Systems  Volume 21, Issue 7
July 2010
159 pages

Publisher

IEEE Press

Publication History

Published: 01 July 2010

Author Tags

  1. Memory access models
  2. Memory access models, consensus, multicore architectures, interprocess synchronization.
  3. consensus
  4. interprocess synchronization.
  5. multicore architectures

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2016)GreenBSTProceedings of the 22nd International Conference on Euro-Par 2016: Parallel Processing - Volume 983310.1007/978-3-319-43659-3_37(502-517)Online publication date: 24-Aug-2016
  • (2011)Performance evaluation of the three-dimensional finite-difference time-domain(FDTD) method on Fermi architecture GPUsProceedings of the 11th international conference on Algorithms and architectures for parallel processing - Volume Part I10.5555/2075416.2075459(460-469)Online publication date: 24-Oct-2011

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media