[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/237090.237147acmconferencesArticle/Chapter ViewAbstractPublication PagesasplosConference Proceedingsconference-collections
Article
Free access

Evaluation of architectural support for global address-based communication in large-scale parallel machines

Published: 01 September 1996 Publication History

Abstract

Large-scale parallel machines are incorporating increasingly sophisticated architectural support for user-level messaging and global memory access. We provide a systematic evaluation of a broad spectrum of current design alternatives based on our implementations of a global address language on the Thinking Machines CM-5, Intel Paragon, Meiko CS-2, Cray T3D, and Berkeley NOW. This evaluation includes a range of compilation strategies that make varying use of the network processor; each is optimized for the target architecture and the particular strategy. We analyze a family of interacting issues that determine the performance trade-offs in each implementation, quantify the resulting latency, overhead, and bandwidth of the global access operations, and demonstrate the effects on application performance.

References

[1]
C. Amza, A. L. Cox, S. Dwarkadas, P. Keleher, H. Lu, R. Rajamony, W. Yu, and W. ZwaenepoeI. TreadMarks: Shared Memory Computing on Networks of Workstations. IEEE Computer, 29(2), 1996.]]
[2]
T. E. Anderson, D. E. Culler, and D. A. Patterson. A Case for NOW(Network of Workstations). IEEE Micro, 15(1), February 1995.]]
[3]
R. Arpaci, D. Culler, A. Krlshnamurthy, S. Steinberg, and K. Yelick. Empirical Evaluation of the CRAY-T3D: A Compiler Perspective. In International Symposium on Computer Architecture, June 1995.]]
[4]
E. Barton, J. Cownie, and M. McLaren. Message passing on the Meiko CS-2. Parallel Computing, 20(4), April 1994.]]
[5]
B. Bershad, S. Savage, P. Pardyak, E. G. Sirer, D. Becker, M. Fiuczynski, C. Chambers, and S. Eggers. Extensibility, Safety and Performance in the SPIN Operating System. In Fifteenth A CM Symposium on Operating System Principles, 1995.]]
[6]
N. J. Boden, D. Cohen, R. E. Felderman, A. E. Kulawik, C. L. Seitz, J. N. Seizovic, and W. Su. Myrinet: A Gigabitper-Second Local Area Network. IEEE Micro, 15(1), February 1995.]]
[7]
M. C. Carlisle, A. Rogers, J. H. Reppy, and L. J. Hendren. Early experiences with Olden (parallel programming). In Languages and Compilers for Parallel Computing. 6th International Workshop Proceedings. Sprlnger-Verlag, 1994.]]
[8]
J. B. Carter, J. K. Bennett, and W. Zwaenepoel. Implementation and Performance of Munin. Proceedings of the 13th A CM Symposium on Operating Systems Principles, 7(4), November 1989.]]
[9]
K. M. Chandy and C. Kesselman. Compositional C++: Compositional Parallel Programming. In 5th International Workshop on Languages and Compilers for Parallel Computing, New Haven, CT, August 1992.]]
[10]
Cray Research Incorporated. The CRAY T3D Hardware Reference Manual, 1993.]]
[11]
D. Culler, R. Karp, D. Patterson, A. Sahay, K. F,. Schauser, F~. Santos, R. Sumbramonian, and T. von Eicken. LogP: Towards a Realistic Model of Parallel Computation. In Proceedings of the 1993 Conference on Principles and Practice of Parallel Programming, San Diego, CA, May 1993.]]
[12]
D. Culler, L. T. Liu, R. P. Martin, and C. Yoshikawa. LogP Performance Assessment of Fast Network Interfaces. IEEE Micro, February 1996.]]
[13]
D.E. Culler, A. Dusseau, S. C. Goldstein, A. Krishnamurthy, S. Lumetta, T. von Eicken, and K. Yelick. Parallel Programming in Split-C. In Supercomputing '93, Portland, Oregon, November 1993.]]
[14]
W. Groscup. The Intel Paragon XP/S Supercomputer. in Proceedings of the Fifth ECMWF Workshop on the Use of Parallel Processors in Meteorology., Nov 1992.]]
[15]
High Performance Fortran Forum. High Performance Fortran Language Specification Version !.0, May 1993.]]
[16]
Kendall Square Research. KSR1 Technical Summary, 1992.]]
[17]
J. Kubiatowicz and A. Agarwal. Anatomy of a Message in the Alewife Multiprocessor. In 7th A CM International Conference on Supercomputing, July 1993.]]
[18]
J. Kuskin, D. Ofelt, M. Heinrich, J. Heinlein, R. Simoni, K. Gharachorloo, J. Chapin, D. Nakahira, J. Baxter, M. Horowitz, A. Gupta, M. Rosenblum, and J. Hennessy. The Stanford Flash Multiprocessor. In 21st International Symposium on Computer Architecture, April 1994.]]
[19]
C. E. Leiserson, Z. S. Abuhamdeh, D. C. Douglas, C. R. Feynman, M. N. Ganmukhi, J. V. Hill, W. D. Hillis, B. C. Kuszmaul, M. A. St. Pierre, D. S. Wells, M. C. Wong, S. 'fang, and R. Zak. The Network Architecture of the CM-5. In Symposium on Parallel and Di, tributcd Algorithm~ '82, June 1992.]]
[20]
D. Lenoski, J. Laundon, K. Gharachorloo, A. Gupta, and J. L. Hennessy. The Directory Based Cache Coherance Protocol for the DASH Multiprocessor. In Proceedings of the 17th international Symposium on Computer Architecture, 1990.]]
[21]
K. Li and P. Hudak. Memory Coherence in Shared Virtual Memory Systems. A CM Transactions on Computer Systems, November 1989.]]
[22]
L. T. Liu and D. E. Culler. Evaluation of the Intel Paragon on Active Message Communication. in Intel Supercomputer Users Group Conference, 1995.]]
[23]
R. S. Nikhil. Cid: A Parallel, "Shared Memory" C for Distributed Memory Machines. In Languages and Compilers for Parallel Computing. 7th International Workshop Proceedings. Springer-Verlag, 1995.]]
[24]
S. K. Reinhardt, J. R. Larus, and D. A. Wood. Typhoon and Tempest: User-Level Shared Memory. In international Symposium on Computer Architecture, April 1994.]]
[25]
K. E. Schauser and C. J. Scheiman. Experience with Active Messages on the Meiko CS-2. In 9th International Parallel Processing Symposium, April 1995.]]
[26]
K. E. Schauser, C. J. Scheiman, J. M. Ferguson, and P. Z. Kolano. Exploiting the Capabilities of Communications Coprocessors. In l Oth international Parallel Processing Symposium, April 1996.]]
[27]
R. L. Sites. Alpha Architecture Reference Manual. Digital Equipment Corporation, 1992.]]
[28]
T. von Eicken, A. Basu, V. Buch, and W. Vogels. U-Net: A User-Level Network Interface for Parallel and Distributed Computing. In Fifteenth A CM Symposium on Operating System Principles, December 1995.]]
[29]
T. von Eicken, D. E. Culler, S. C. Goldstein, and K. E. Schauser. Active Messages: a Mechanism for Integrated Communication and Computation. In International Symposium on Computer Architecture, 1992.]]
[30]
R. Wahbe, S. Lucco, T. Anderson, and S. Graham. Efficient Software-Based Fault Isolation. In Fourteenth A CM Symposium on Operating System Principles, 1993.]]
[31]
M. J. Zekauskas, W. A. Sawdon, and B. N. Bershad. Software Write Detection for a Distributed Shared Memory. In First Symposium on Operating Systems Design and Implementation, 1994.]]

Cited By

View all
  • (2012)An Out-of-Core Dataflow Middleware to Reduce the Cost of Large Scale Iterative SolversProceedings of the 2012 41st International Conference on Parallel Processing Workshops10.1109/ICPPW.2012.13(71-80)Online publication date: 10-Sep-2012
  • (2006)Network interface active messages for low overhead communication on SMP PC clustersHigh-Performance Computing and Networking10.1007/BFb0100637(764-773)Online publication date: 17-Nov-2006
  • (2002)An analysis of VI Architecture primitives in support of parallel and distributed communicationConcurrency and Computation: Practice and Experience10.1002/cpe.61614:1(55-76)Online publication date: 12-Mar-2002
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
ASPLOS VII: Proceedings of the seventh international conference on Architectural support for programming languages and operating systems
October 1996
290 pages
ISBN:0897917677
DOI:10.1145/237090
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 September 1996

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

ASPLOS96
Sponsor:

Acceptance Rates

ASPLOS VII Paper Acceptance Rate 25 of 109 submissions, 23%;
Overall Acceptance Rate 535 of 2,713 submissions, 20%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)120
  • Downloads (Last 6 weeks)17
Reflects downloads up to 01 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2012)An Out-of-Core Dataflow Middleware to Reduce the Cost of Large Scale Iterative SolversProceedings of the 2012 41st International Conference on Parallel Processing Workshops10.1109/ICPPW.2012.13(71-80)Online publication date: 10-Sep-2012
  • (2006)Network interface active messages for low overhead communication on SMP PC clustersHigh-Performance Computing and Networking10.1007/BFb0100637(764-773)Online publication date: 17-Nov-2006
  • (2002)An analysis of VI Architecture primitives in support of parallel and distributed communicationConcurrency and Computation: Practice and Experience10.1002/cpe.61614:1(55-76)Online publication date: 12-Mar-2002
  • (2000)Evaluating design alternatives for reliable communication on high-speed networksACM SIGOPS Operating Systems Review10.1145/384264.37900434:5(71-81)Online publication date: 12-Nov-2000
  • (2000)Evaluating design alternatives for reliable communication on high-speed networksACM SIGARCH Computer Architecture News10.1145/378995.37900428:5(71-81)Online publication date: 12-Nov-2000
  • (2000)Evaluating design alternatives for reliable communication on high-speed networksProceedings of the ninth international conference on Architectural support for programming languages and operating systems10.1145/378993.379004(71-81)Online publication date: 12-Nov-2000
  • (2000)Evaluating design alternatives for reliable communication on high-speed networksACM SIGPLAN Notices10.1145/356989.35699635:11(71-81)Online publication date: 1-Nov-2000
  • (2000)Network interface active messages for low overhead communication on SMP PC clustersFuture Generation Computer Systems10.1016/S0167-739X(99)00137-516:5(493-502)Online publication date: 1-Mar-2000
  • (1999)Shared Memory vs Message Passing on SCI: A Case Study Using Split-CSCI: Scalable Coherent Interface10.1007/10704208_21(267-280)Online publication date: 1999
  • (1999)Evaluating the Benefits of Communication CoprocessorsJournal of Parallel and Distributed Computing10.1006/jpdc.1998.152857:2(236-256)Online publication date: 1-May-1999
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media