[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2663165.2666089acmconferencesArticle/Chapter ViewAbstractPublication PagesmiddlewareConference Proceedingsconference-collections
research-article

Improving MapReduce performance in heterogeneous environments with adaptive task tuning

Published: 08 December 2014 Publication History

Abstract

The deployment of MapReduce in datacenters and clouds present several challenges in achieving good job performance. Compared to in-house dedicated clusters, datacenters and clouds often exhibit significant hardware and performance heterogeneity due to continuous server replacement and multi-tenant interferences. As most Mapreduce implementations assume homogeneous clusters, heterogeneity can cause significant load imbalance in task execution, leading to poor performance and low cluster utilizations. Despite existing optimizations on task scheduling and load balancing, MapReduce still performs poorly on heterogeneous clusters.
In this paper, we find that the homogeneous configuration of tasks on heterogeneous nodes can be an important source of load imbalance and thus cause poor performance. Tasks should be customized with different settings to match the capabilities of heterogeneous nodes. To this end, we propose an adaptive task tuning approach, Ant, that automatically finds the optimal settings for individual tasks running on different nodes. Ant works best for large jobs with multiple rounds of map task execution. It first configures tasks with randomly selected configurations and gradually improves tasks settings by reproducing the settings from best performing tasks and discarding poor performing configurations. To accelerate task tuning and avoid trapping in local optimum, Ant uses genetic functions during task configuration. Experimental results on a heterogeneous cluster and a virtual cluster with varying hardware capabilities show that Ant improves the average job completion time by 23%, 11%, and 16% compared to stock Hadoop, customized Hadoop with industry recommendations, and a profiling-based configuration approach, respectively.

References

[1]
PUMA: Purdue mapreduce benchmark suite. http: //web.ics.purdue.edu/~fahmad/benchmarks.htm.
[2]
Ahmad, F., Chakradhar, S., Raghunathan, A., and Vijaykumar, T. N. Tarazu: optimizing mapreduce on heterogeneous clusters. In Proc. Int'l Conf. on Architecture Support for Programming Language and Operating System (ASPLOS) (2012).
[3]
Appuswamy, R., Gkantsidis, C., Narayanan, D., Hodson, O., and Rowstron, A. Scale-up vs scale-out for hadoop: Time to rethink? In Proc. ACM Symposium on Cloud Computing (SoCC) (2013).
[4]
Carrera, D., Steinder, M., Whalley, I., Torres, J., and Ayguadé, E. Enabling resource sharing between transactional and batch workloads using dynamic application placement. In Proc. ACM/IFIP/USENIX Int'l Conf. on Middleware (Middleware) (2008).
[5]
Chiang, R. C., and Huang, H. H. Interference-aware scheduling for data-intensive applications in virtualized environments. In Proc. Int'l Conference for High Performance Computing, Networking, Storage and Analysis (SC) (2011).
[6]
Cho, B., Rahman, M., Chajed, T., Gupta, I., Abad, C., Roberts, N., and Lin, P. Natjam: Eviction policies for supporting priorities and deadlines in mapreduce clusters. In Proc. ACM Symposium on Cloud Computing (SoCC) (2013).
[7]
Cloudera. Configuration parameters. http://blog.cloudera.com/blog/author/aaron/.
[8]
Deb, K., Pratap, A., Agarwal, S., and Meyarivan, T. A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans. on Evolutionary Computation 6 (2002), 182{197.
[9]
Dittrich, J., Quiané-Ruiz, J.-A., Jindal, A., Kargin, Y., Setty, V., and Schad, J. Hadoop++: making a yellow elephant run like a cheetah (without it even noticing). In Proc. Int'l Conf. on Very Large Data Bases (VLDB) (2010).
[10]
Guo, Y., Rao, J., Jiang, C., and Zhou, X. Moving mapreduce into the cloud with exible slot management. In Proc. Int'l Conference for High Performance Computing, Networking, Storage and Analysis (SC) (2014).
[11]
Guo, Y., Rao, J., and Zhou, X. ishuffle: Improving hadoop performance with shuffle-on-write. In Proc. Int'l Conference on Autonomic Computing (ICAC) (2013).
[12]
Herodotou, H., and Babu, S. Profiling, what-if analysis, and cost-based optimization of mapreduce programs. In Proc. Int' Conf. on Very Large Data Bases (VLDB) (2011).
[13]
Herodotou, H., Dong, F., and Babu, S. No one (cluster) size fits all: Automatic cluster sizing for data-intensive analytics. In Proc. ACM Symposium on Cloud Computing (SoCC) (2011).
[14]
Herodotou, H., Lim, H., Luo, G., Borisov, N., Dong, L., Cetin, F. B., and Babu, S. Starfish: A self-tuning system for big data analytics. In Proc. Conference on Innovative Data Systems Research (CIDR) (2011).
[15]
Jinda, A., Quian-Ruiz, J., and Dittrich, J. Trojan data layouts: Right shoes for a running elephant. In Proc. of ACM Symposium on Cloud Computing (SoCC) (2011).
[16]
Kambatla, K., Pathak, A., and Pucha, H. Towards optimizing hadoop provisioning in the cloud. In Proc. USENIX HotCloud Workshop (2009).
[17]
Lama, P., and Zhou, X. Aroma: Automated resource allocation and configuration of mapreduce environment in the cloud. In Proc. Int'l Conf. on Autonomic computing (ICAC) (2012).
[18]
Li, X., Wang, Y., Jiao, Y., Xu, C., and Yu, W. Coomr: Cross-task coordination for efficient data management in mapreduce programs. In Proc. Int'l Conference for High Performance Computing, Networking, Storage and Analysis (SC) (2013).
[19]
Li, Z., Cheng, Y., Liu, C., and Zhao, C. Minimum standard deviation difference-based thresholding. In Proc. Int'l Conference on Measuring Technology and Mechatronics Automation (ICMTMA) (2010).
[20]
MapR. The executive's guide to big data. http://www.mapr.com/resources/white-papers.
[21]
Pettijohn, E., Guo, Y., Lama, P., and Zhou, X. User-centric heterogeneity-aware mapreduce job provisioning in the public cloud. In Proc. Int'l Conference on Autonomic Computing (ICAC) (2014).
[22]
Rao, S., Ramakrishnan, R., Silberstein, A., Ovsiannikov, M., and Reeves, D. Sailfish: A framework for large scale data processing. In Proc. of ACM Symposium on Cloud Computing (SoCC) (2012).
[23]
Sharma, B., Wood, T., and Das, C. R. Hybridmr: A hierarchical mapreduce scheduler for hybrid data centers. In Proc. IEEE Int'l Conference on Distributed Computing Systems (ICDCS) (2013).
[24]
Vavilapalli, V. K., Murthy, A. C., Douglas, C., Agarwal, S., Konar, M., Evans, R., Graves, T., Lowe, J., Shah, H., Seth, S., Saha, B., Curino, C., O'Malley, O., Radia, S., Reed, B., and Baldeschwieler, E. Apache hadoop yarn: Yet another resource negotiator. In Proc. ACM Symposium on Cloud Computing (SoCC) (2013).
[25]
Verma, A., Cherkasova, L., and Campbell, R. H. Resource provisioning framework for mapreduce jobs with performance goals. In Proc. ACM/IFIP/USENIX Int'l Middleware Conference (Middleware) (2011).
[26]
Wang, C., Rayan, I. A., Eisenhauer, G., Schwan, K., Talwar, V., Wolf, M., and Huneycutt, C. Vscope: Middleware for troubleshooting time-sensitive data center applications. In Proc. ACM/IFIP/USENIX Int'l Middleware Conference (Middleware) (2012).
[27]
White, T. Hadoop: The Definitive Guide, 3rd ed. O'Reilly Media / Yahoo Press, 2012.
[28]
Wolf, J., Rajan, D., Hildrum, K., Khandekar, R., Kumar, V., Parekh, S., Wu, K., and Balmin, A. Flex: A slot allocation scheduling optimizer for mapreduce workloads. In Proc. ACM/IFIP/USENIX Int'l Middleware Conference (Middleware) (2010).
[29]
Yigitbasi, N., Willke, T., Liao, G., and Epema, D. Towards machine learning-based auto-tuning of mapreduce. In Proc. IEEE/ACM Int'l Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS) (2013).
[30]
Zaharia, M., Konwinski, A., Joseph, A. D., Katz, R., and Stoica, I. Improving mapreduce performance in heterogeneous environments. In Proc. USENIX Symposium on Operating System Design and Implementation (OSDI) (2008).
[31]
Zhang, Q., Zhani, M. F., Boutaba, R., and Hellerstein, J. L. Harmony: Dynamic heterogeneity-aware resource provisioning in the cloud. In Proc. IEEE Int'l Conference on Distributed Computing Systems (ICDCS) (2013).

Cited By

View all
  • (2023)Cost-based Data Prefetching and Scheduling in Big Data Platforms over Tiered Storage SystemsACM Transactions on Database Systems10.1145/362538948:4(1-40)Online publication date: 13-Nov-2023
  • (2023)Efficient Node Selection for Coding-based Timely Computation over Heterogeneous Systems2023 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom)10.1109/ISPA-BDCloud-SocialCom-SustainCom59178.2023.00065(246-253)Online publication date: 21-Dec-2023
  • (2023)Forseti: Dynamic chunk-level reshaping for data processing on heterogeneous clustersJournal of Parallel and Distributed Computing10.1016/j.jpdc.2022.09.003171(14-23)Online publication date: Jan-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
Middleware '14: Proceedings of the 15th International Middleware Conference
December 2014
334 pages
ISBN:9781450327855
DOI:10.1145/2663165
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • Orange
  • Conseil Régional d'Aquitaine
  • LaBRI: LaBRI
  • Raytheon BBN Technologies: Raytheon BBN Technologies
  • ACM: Association for Computing Machinery
  • Red Hat JBoss Middleware: Red Hat JBoss Middleware
  • Bordeaux: City of Bordeaux
  • USENIX Assoc: USENIX Assoc
  • GDR ASR: GDR Architecture, Systèmes et Réseaux
  • IBM: IBM
  • HP: HP
  • IFIP

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 December 2014

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. adaptive task tuning
  2. heterogeneous clusters
  3. improving MapReduce performance

Qualifiers

  • Research-article

Funding Sources

Conference

Middleware '14
Sponsor:
  • LaBRI
  • Raytheon BBN Technologies
  • ACM
  • Red Hat JBoss Middleware
  • Bordeaux
  • USENIX Assoc
  • GDR ASR
  • IBM
  • HP

Acceptance Rates

Middleware '14 Paper Acceptance Rate 27 of 144 submissions, 19%;
Overall Acceptance Rate 203 of 948 submissions, 21%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)14
  • Downloads (Last 6 weeks)2
Reflects downloads up to 14 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Cost-based Data Prefetching and Scheduling in Big Data Platforms over Tiered Storage SystemsACM Transactions on Database Systems10.1145/362538948:4(1-40)Online publication date: 13-Nov-2023
  • (2023)Efficient Node Selection for Coding-based Timely Computation over Heterogeneous Systems2023 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom)10.1109/ISPA-BDCloud-SocialCom-SustainCom59178.2023.00065(246-253)Online publication date: 21-Dec-2023
  • (2023)Forseti: Dynamic chunk-level reshaping for data processing on heterogeneous clustersJournal of Parallel and Distributed Computing10.1016/j.jpdc.2022.09.003171(14-23)Online publication date: Jan-2023
  • (2022)SymTunerProceedings of the 44th International Conference on Software Engineering10.1145/3510003.3510185(2068-2079)Online publication date: 21-May-2022
  • (2022)OSC: An Online Self-Configuring Big Data Framework for Optimization of QoSIEEE Transactions on Computers10.1109/TC.2021.306327871:4(809-823)Online publication date: 1-Apr-2022
  • (2021)Latency-aware Straggler Mitigation Strategy in Hadoop MapReduce Framework: A ReviewSystematic Literature Review and Meta-Analysis Journal10.54480/slrm.v2i2.192:2(53-60)Online publication date: 19-Oct-2021
  • (2021)TridentProceedings of the VLDB Endowment10.14778/3461535.346154514:9(1570-1582)Online publication date: 22-Oct-2021
  • (2021)Survey on improving the performance of MapReduce in HadoopProceedings of the 4th International Conference on Networking, Information Systems & Security10.1145/3454127.3456617(1-5)Online publication date: 1-Apr-2021
  • (2021)GML: Efficiently Auto-Tuning Flink's Configurations Via Guided Machine LearningIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2021.308160032:12(2921-2935)Online publication date: 1-Dec-2021
  • (2021)swMR: A Framework for Accelerating MapReduce Applications on Sunway TaihulightIEEE Transactions on Emerging Topics in Computing10.1109/TETC.2018.28812659:2(1020-1030)Online publication date: 1-Apr-2021
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media