[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1109/ICPP.2014.36guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Kylix: A Sparse Allreduce for Commodity Clusters

Published: 09 September 2014 Publication History

Abstract

Allreduce is a basic building block for parallel computing. Our target here is "Big Data" processing on commodity clusters (mostly sparse power-law data). Allreduce can be used to synchronize models, to maintain distributed datasets, and to perform operations on distributed data such as sparse matrix multiply. We first review a key constraint on cluster communication, the minimum efficient packet size, which hampers the use of direct all-to-all protocols on large networks. Our allreduce network is a nested, heterogeneous-degree butterfly. We show that communication volume in lower layers is typically much less than the top layer, and total communication across all layers a small constant larger than the top layer, which is close to optimal. A chart of network communication volume across layers has a characteristic "Kylix" shape, which gives the method its name. For optimum performance, the butterfly degrees also decrease down the layers. Furthermore, to efficiently route sparse updates to the nodes that need them, the network must be nested. While the approach is amenable to various kinds of sparse data, almost all "Big Data" sets show power-law statistics, and from the properties of these, we derive methods for optimal network design. Finally, we present experiments showing with Kylix on Amazon EC2 and demonstrating significant improvements over existing systems such as PowerGraph and Hadoop.

Cited By

View all
  • (2024)Scalability Limitations of Processing-in-Memory using Real System EvaluationsProceedings of the ACM on Measurement and Analysis of Computing Systems10.1145/36390468:1(1-28)Online publication date: 21-Feb-2024
  • (2023)HEAR: Homomorphically Encrypted AllreduceProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3581784.3607099(1-17)Online publication date: 12-Nov-2023
  • (2021)FlareProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3458817.3476178(1-16)Online publication date: 14-Nov-2021
  • Show More Cited By

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
BRACIS '14: Proceedings of the 2014 Brazilian Conference on Intelligent Systems
October 2014
914 pages
ISBN:9781479956180

Publisher

IEEE Computer Society

United States

Publication History

Published: 09 September 2014

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 11 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Scalability Limitations of Processing-in-Memory using Real System EvaluationsProceedings of the ACM on Measurement and Analysis of Computing Systems10.1145/36390468:1(1-28)Online publication date: 21-Feb-2024
  • (2023)HEAR: Homomorphically Encrypted AllreduceProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3581784.3607099(1-17)Online publication date: 12-Nov-2023
  • (2021)FlareProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3458817.3476178(1-16)Online publication date: 14-Nov-2021
  • (2019)Demystifying Parallel and Distributed Deep LearningACM Computing Surveys10.1145/332006052:4(1-43)Online publication date: 30-Aug-2019
  • (2019)SparCMLProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3295500.3356222(1-15)Online publication date: 17-Nov-2019
  • (2017)An Efficient Task-based All-Reduce for Machine Learning ApplicationsProceedings of the Machine Learning on HPC Environments10.1145/3146347.3146350(1-8)Online publication date: 12-Nov-2017
  • (2016)DISPProceedings of the First Workshop on Optimization of Communication in HPC10.5555/3018058.3018064(53-62)Online publication date: 13-Nov-2016

View Options

View options

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media