More Web Proxy on the site http://driver.im/

research-article

Tarazu: optimizing MapReduce on heterogeneous clusters

Authors:

Srimat T. Chakradhar,

Anand Raghunathan,

T. N. VijaykumarAuthors Info & Claims

ASPLOS XVII: Proceedings of the seventeenth international conference on Architectural Support for Programming Languages and Operating Systems

Pages 61 - 74

https://doi.org/10.1145/2150976.2150984

Published: 03 March 2012 Publication History

Abstract

Data center-scale clusters are evolving towards heterogeneous hardware for power, cost, differentiated price-performance, and other reasons. MapReduce is a well-known programming model to process large amount of data on data center-scale clusters. Most MapReduce implementations have been designed and optimized for homogeneous clusters. Unfortunately, these implementations perform poorly on heterogeneous clusters (e.g., on a 90-node cluster that contains 10 Xeon-based servers and 80 Atom-based servers, Hadoop performs worse than on 10-node Xeon-only or 80-node Atom-only homogeneous sub-clusters for many of our benchmarks). This poor performance remains despite previously proposed optimizations related to management of straggler tasks. In this paper, we address MapReduce's poor performance on heterogeneous clusters. Our first contribution is that the poor performance is due to two key factors: (1) the non-intuitive effect that MapReduce's built-in load balancing results in excessive and bursty network communication during the Map phase, and (2) the intuitive effect that the heterogeneity amplifies load imbalance in the Reduce computation. Our second contribution is Tarazu, a suite of optimizations to improve MapReduce performance on heterogeneous clusters. Tarazu consists of (1) Communication-Aware Load Balancing of Map computation (CALB) across the nodes, (2) Communication-Aware Scheduling of Map computation (CAS) to avoid bursty network traffic and (3) Predictive Load Balancing of Reduce computation (PLB) across the nodes. Using the above 90-node cluster, we show that Tarazu significantly improves performance over a baseline of Hadoop with straightforward tuning for hardware heterogeneity.

References

[1]

R. Agrawal and R. Srikant. Fast algorithms for mining association rules. Proceedings of 20th Intl. Conference on Very Large Data Bases, VLDB, 1994.

Digital Library

[2]

Amazon EC2. http://aws.amazon.com/ec2.

[3]

G. Ananthanarayanan, S. Kandula, A. Greenberg, I. Stoica, Y. Lu, B. Saha, and E. Harris. Reining in the outliers in Map-Reduce clusters using Mantri. In Proceedings of the Usenix Symposium on Operating Systems Design and Implementation (OSDI), 2010.

Digital Library

[4]

D. G. Andersen, J. Franklin, M. Kaminsky, A. Phanishayee, L. Tan, and V. Vasudevan. FAWN: A Fast Array of Wimpy Nodes. In Proceedings of the ACM Symposium on Operating Systems Principles (SOSP), 2009.

Digital Library

[5]

Apache Mahout: Scalable machine learning and data mining. http://mahout.apache.org.

[6]

A. C. Arpaci-Dusseau, R. H. Arpaci-Dusseau, D. E. Culler, J. M. Hellerstein, and D. A. Patterson. High-performance sorting on networks of workstations. In Proceedings of the SIGMOD International Conference on Management of Data, pages 243--254, Tucson,Arizona, May 1997.

Digital Library

[7]

Calxeda, Inc. http://www.calxeda.com.

[8]

Q. Chen, D. Zhang, M. Guo, Q. Deng, and S. Guo. SAMR: A Self-adaptive MapReduce Scheduling Algorithm in Heterogeneous Environment. In Proceedings of the International Conference on Computer and Information Technology (CIT), 2010.

Digital Library

[9]

B.-G. Chun, G. Iannaccone, G. Iannaccone, R. Katz, G. Lee, and L. Niccolini. An Energy Case for Hybrid Datacenters. In SOSP Workshop on Power Aware Computing and Systems (HotPower), 2009.

[10]

J. Dean and S. Ghemawat. MapReduce: Simplified data processing on large clusters. Commun. ACM, pages 107--113, Jan. 2008.

Digital Library

[11]

Facebook Hive. http://hadoop.apache.org/hive.

[12]

A. Ghodsi, M. Zaharia, B. Hindman, A. Konwinski, S. Shenker, and I. Stoica. Dominant Resource Fairness: Fair allocation of multiple resource types. In Proceedings of the 8th USENIX conference on Networked systems design and implementation(NSDI), 2011.

Digital Library

[13]

A. Greenberg, J. R. Hamilton, N. Jain, S. Kandula, C. Kim, P. Lahiri, D. A. Maltz, P. Patel, and S. Sengupta. VL2: A scalable and flexible data center network. In Proceedings of the SIGCOMM conference on Data Communication, pages 51--62, 2009.

Digital Library

[14]

Hadoop. http://lucene.apache.org/hadoop/.

[15]

J. Hamilton. When Very Low-Power, Low-Cost Servers Don't Make Sense. In http://perspectives.mvdirona.com/2010/05/18/WhenVeryLowPowerLowCostServersDontMakeSense.aspx, 2010.

[16]

B. He, W. Fang, Q. Luo, N. K. Govindaraju, and T. Wang. Mars: A MapReduce framework on graphics processors. In Proceedings of the 17th international conference on Parallel Architectures and Compilation Techniques, pages 260--269, 2008.

Digital Library

[17]

HP Labs. Project Moonshot. In http://www.hp.com/hpinfo/newsroom/press_kits/2011/MoonshotInfrastructure/index.html, 2011.

[18]

M. Isard, M. Budiu, Y. Yu, A. Birrell, and D. Fetterly. Dryad: Distributed data-parallel programs from sequential building blocks. In EuroSys '07: Proceedings of the 2nd SIGOPS/EuroSys European Conference on Computer Systems, 2007.

Digital Library

[19]

M. Isard, V. Prabhakaran, J. Currey, U. Wieder, K. Talwar, and A. Goldberg. Quincy: Fair scheduling for distributed computing clusters. In Proceedings of the 22nd symposium on Operating systems principles, SOSP, pages 261--276, USA, 2009.

Digital Library

[20]

V. Janapa Reddi, B. C. Lee, T. Chilimbi, and K. Vaid. Web search using mobile cores: Quantifying and mitigating the price of efficiency. In Proceedings of the International Symposium on Computer Architecture (ISCA), 2010.

Digital Library

[21]

J.Hartigan. Clustering Algorithms. Wiley, 1975.

Digital Library

[22]

A. Krioukov, P. Mohan, S. Alspaugh, L. Keys, D. E. Culler, and R. H. Katz. NapSAC: Design and implementation of a power-proportional web cluster. ACM Computer Communication Review, 41(1), 2011.

Digital Library

[23]

W. Lang, J. M. Patel, and S. Shankar. Wimpy node clusters: What about non-wimpy workloads? In Proceedings of the International Workshop on Data Management on New Hardware (DaMoN), 2010.

Digital Library

[24]

K. Lim, P. Ranganathan, J. Chang, C. Patel, T. Mudge, and S. Reinhardt. Understanding and Designing New Server Architectures for Emerging Warehouse-Computing Environments. In Proceedings of the 35th Annual International Symposium on Computer Architecture (ISCA), pages 315--326, 2008.

Digital Library

[25]

M. D. Linderman, J. D. Collins, H. Wang, and T. H. Meng. Merge: A programming model for heterogeneous multi-core systems. In Proceedings of the 13th international conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2008.

Digital Library

[26]

Netflix movies data. http://www.netflixprize.com/download.

[27]

C. Olston, B. Reed, U. Srivastava, R. Kumar, and A. Tomkins. Pig latin: a not-so-foreign language for data processing. In Proceedings of the 2008 international conference on Management Of Data, SIGMOD, pages 1099--1110, 2008.

Digital Library

[28]

M. M. Rafique, N. Ravi, S. Cadambi, S. T. Chakradhar, and A. R. Butt. Power Management for Heterogeneous Clusters: An Experimental Study. In Proceedings of the IEEE International Conference on Green Computing, 2011.

Digital Library

[29]

C. Ranger, R. Raghuraman, A. Penmetsa, G. Bradski, and C. Kozyrakis. Evaluating MapReduce for Multi-core and Multiprocessor Systems. In 13th International Symposium on High Performance Computer Architecture, HPCA, pages 13--24, 2007.

Digital Library

[30]

SeaMicro, Inc. http://www.seamicro.com.

[31]

A. Vahdat, M. Al-Fares, N. Farrington, R. N. Mysore, G. Porter, and S. Radhakrishnan. Scale-Out Networking in the Data Center. IEEE Micro, 30:29--41, July 2010.

Digital Library

[32]

D. Weld. Lecture notes on MapReduce (based on Jeff Dean's slides). http://rakaposhi.eas.asu.edu/cse494/notes/s07-map-reduce.ppt, 2007.

[33]

X-RIME: Hadoop based large scale social network analysis. http://xrime.sourceforge.net/.

[34]

H.-C. Yang, A. Dasdan, R.-L. Hsiao, and D. S. Parker. Map-reduce-merge: Simplified relational data processing on large clusters. In Proceedings of the SIGMOD international conference on Management Of Data, 2007.

Digital Library

[35]

Y. Yu, M. Isard, D. Fetterly, M. Budiu, U. Erlingsson, P. K. Gunda, and J. Currey. DryadLINQ: A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language. In Proceedings of International Symposium on Operating System Design and Implementation (OSDI), 2008.

Digital Library

[36]

M. Zaharia, D. Borthakur, J. Sen Sarma, K. Elmeleegy, S. Shenker, and I. Stoica. Delay Scheduling: A simple technique for achieving locality and fairness in cluster scheduling. In Proceedings of the 5th European conference on Computer systems, EuroSys '10, pages 265--278, 2010.

Digital Library

[37]

M. Zaharia, A. Konwinski, A. Joseph, R. Katz, and I. Stoica. Improving MapReduce performance in heterogeneous environments. In Proceedings of the Usenix Symposium on Operating Systems Design and Implementation (OSDI), 2008.

Digital Library

[38]

Hadoop rebalancer. http://hadoop.apache.org/common/docs/r0.17.2/hdfs_user_guide.html#Rebalancer.

Cited By

Vijayakumar SMadhavapeddy AKalyvianaki E(2024)Scheduling for Reduced Tail Task Latencies in Highly Utilized DatacentersProceedings of the 2024 ACM Symposium on Cloud Computing10.1145/3698038.3698522(302-321)Online publication date: 20-Nov-2024
https://dl.acm.org/doi/10.1145/3698038.3698522
Usman SMehmood RKatib IAlbeshri A(2022)Data Locality in High Performance Computing, Big Data, and Converged Systems: An Analysis of the Cutting Edge and a Future System ArchitectureElectronics10.3390/electronics1201005312:1(53)Online publication date: 23-Dec-2022
https://doi.org/10.3390/electronics12010053
Baruah NKraft PKazhamiaka FBailis PZaharia M(2022)Parallelism-Optimizing Data Placement for Faster Data-Parallel ComputationsProceedings of the VLDB Endowment10.14778/3574245.357426016:4(760-771)Online publication date: 1-Dec-2022
https://dl.acm.org/doi/10.14778/3574245.3574260
Show More Cited By

Index Terms

Tarazu: optimizing MapReduce on heterogeneous clusters
1. Computing methodologies
  1. Distributed computing methodologies
    1. Distributed programming languages
  2. Parallel computing methodologies
    1. Parallel programming languages
2. Software and its engineering
  1. Software notations and tools
    1. General programming languages
      1. Language types
        Distributed programming languages
        Parallel programming languages

Recommendations

Tarazu: optimizing MapReduce on heterogeneous clusters
ASPLOS '12

Data center-scale clusters are evolving towards heterogeneous hardware for power, cost, differentiated price-performance, and other reasons. MapReduce is a well-known programming model to process large amount of data on data center-scale clusters. Most ...
Tarazu: optimizing MapReduce on heterogeneous clusters
ASPLOS '12

Data center-scale clusters are evolving towards heterogeneous hardware for power, cost, differentiated price-performance, and other reasons. MapReduce is a well-known programming model to process large amount of data on data center-scale clusters. Most ...
Performance Modeling of MapReduce Jobs in Heterogeneous Cloud Environments
CLOUD '13: Proceedings of the 2013 IEEE Sixth International Conference on Cloud Computing

Many companies start using Hadoop for advanced data analytics over large datasets. While a traditional Hadoop cluster deployment assumes a homogeneous cluster, many enterprise clusters are grown incrementally over time, and might have a variety of ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

ASPLOS XVII: Proceedings of the seventeenth international conference on Architectural Support for Programming Languages and Operating Systems

March 2012

476 pages

ISBN:9781450307598

DOI:10.1145/2150976

General Chair:
Tim Harris
Microsoft Research
,
Program Chair:
Michael L. Scott
University of Rochester

ACM SIGARCH Computer Architecture News Volume 40, Issue 1
ASPLOS '12
March 2012
453 pages
ISSN:0163-5964
DOI:10.1145/2189750
Issue’s Table of Contents
ACM SIGPLAN Notices Volume 47, Issue 4
ASPLOS '12
April 2012
453 pages
ISSN:0362-1340
EISSN:1558-1160
DOI:10.1145/2248487
Issue’s Table of Contents

Copyright © 2012 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 March 2012

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

ASPLOS'12

Sponsor:

ASPLOS'12: Seventeenth International Conference on Architectural Support for Programming Languages and Operating Systems

March 3 - 7, 2012

London, England, UK

Acceptance Rates

Overall Acceptance Rate 535 of 2,713 submissions, 20%

Upcoming Conference

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

211
Total Citations
View Citations
2,169
Total Downloads

Downloads (Last 12 months)33
Downloads (Last 6 weeks)5

Reflects downloads up to 11 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Vijayakumar SMadhavapeddy AKalyvianaki E(2024)Scheduling for Reduced Tail Task Latencies in Highly Utilized DatacentersProceedings of the 2024 ACM Symposium on Cloud Computing10.1145/3698038.3698522(302-321)Online publication date: 20-Nov-2024
https://dl.acm.org/doi/10.1145/3698038.3698522
Usman SMehmood RKatib IAlbeshri A(2022)Data Locality in High Performance Computing, Big Data, and Converged Systems: An Analysis of the Cutting Edge and a Future System ArchitectureElectronics10.3390/electronics1201005312:1(53)Online publication date: 23-Dec-2022
https://doi.org/10.3390/electronics12010053
Baruah NKraft PKazhamiaka FBailis PZaharia M(2022)Parallelism-Optimizing Data Placement for Faster Data-Parallel ComputationsProceedings of the VLDB Endowment10.14778/3574245.357426016:4(760-771)Online publication date: 1-Dec-2022
https://dl.acm.org/doi/10.14778/3574245.3574260
Yang DCheng DRang WWang Y(2022)Joint Optimization of MapReduce Scheduling and Network Policy in Hierarchical Data CentersIEEE Transactions on Cloud Computing10.1109/TCC.2019.296165310:1(461-473)Online publication date: 1-Jan-2022
https://doi.org/10.1109/TCC.2019.2961653
Wang XWang CBai MMa QLi G(2022)HTD: heterogeneous throughput-driven task scheduling algorithm in MapReduceDistributed and Parallel Databases10.1007/s10619-021-07375-640:1(135-163)Online publication date: 1-Mar-2022
https://dl.acm.org/doi/10.1007/s10619-021-07375-6
Kumar ANarayanan IZhu TSivasubramaniam A(2020)The Fast and The Frugal: Tail Latency Aware Provisioning for Coping with Load VariationsProceedings of The Web Conference 202010.1145/3366423.3380117(314-326)Online publication date: 20-Apr-2020
https://dl.acm.org/doi/10.1145/3366423.3380117
Loghin DCai SChen GDinh TFan FLin QNg JOoi BSun XTa QWang WXiao XYang YZhang MZhang Z(2020)The Disruptions of 5G on Data-Driven Technologies and ApplicationsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2020.296767032:6(1179-1198)Online publication date: 1-Jun-2020
https://doi.org/10.1109/TKDE.2020.2967670
Dong YTang BYe BQu ZLu S(2020)Intermediate Value Size Aware Coded MapReduce2020 IEEE 26th International Conference on Parallel and Distributed Systems (ICPADS)10.1109/ICPADS51040.2020.00054(348-355)Online publication date: Dec-2020
https://doi.org/10.1109/ICPADS51040.2020.00054
Li BZhang ZZheng TZhong QHuang QCheng X(2020)Marabunta: Continuous Distributed Processing of Skewed Streams2020 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGRID)10.1109/CCGrid49817.2020.00-68(252-261)Online publication date: May-2020
https://doi.org/10.1109/CCGrid49817.2020.00-68
Liang YHu ZLi K(2020)Power consumption model based on feature selection and deep learning in cloud computing scenariosIET Communications10.1049/iet-com.2019.071714:10(1610-1618)Online publication date: Jun-2020
https://doi.org/10.1049/iet-com.2019.0717
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents