[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3503823.3503870acmotherconferencesArticle/Chapter ViewAbstractPublication PagespciConference Proceedingsconference-collections
short-paper

On Exploring the Optimum Configuration of Apache Spark Framework in Heterogeneous Clusters

Published: 22 February 2022 Publication History

Abstract

During the previous decade, both industry and academia have started to apply the Big Data paradigm, exploring the value of data. As the volume of the collected data increases, the required computational infrastructures need to increase their capacities in order to be able to process the data. This work proposes a model for assessing the optimal configuration parameters in a heterogeneous Spark Cluster which is validated against two different use cases. The performed experiments have shown that the proposed model can successfully estimate the optimal Spark configuration parameters, both for memory-intensive and CPU-intensive applications.

References

[1]
J. Nishank and G. Dharanipragada, "Sparker: Optimizing Spark for Heterogeneous Clusters," in IEEE International Conference on Cloud Computing Technology and Science, 2018.
[2]
T. White, Hadoop: The definitive guide, O'Reilly Media, Inc., 2012.
[3]
K. Aziz, "Leveraging resource management for efficient performance of Apache Spark," J. of Big Data, p. 6:78, 2019.
[4]
Z. Tang, A. Zeng, X. Zhang and L. Yang, "Dynamic memory-aware scheduling in spark computing environment," Journal of Parallel and Distributed Computing, vol. 141, pp. 10-22, 2020.
[5]
X. Huang, L. Chunlin and L. Youlong, "Optimized Speculative Execution Strategy for Different Workload Levels in Heterogeneous Spark Cluster," in ICBDC 2019, Guangzhou, China, 2019.
[6]
Y. Zhiwei, "Adaptive Task Scheduling Strategy for Heterogeneous Spark Cluster," Computer Engineering, vol. 42, no. 1, pp. 31-35, 2016.
[7]
J. Dean, "MapReduce: simplified data processing on large clusters," Communications of the ACM, vol. 51, no. 1, pp. 107-113, 2008.
[8]
G. Ananthanarayanan, S. Kandula and A. e. a. Greenberg, "Reining in the outliers in map-reduce clusters using Mantri," in Usenix Conference on Operating Systems Design and Implementation, 2010.
[9]
S. Brin and L. Page, "The anatomy of a large-scale hypertextual Web search engine," Computer Networks and ISDN Systems, vol. 30, 1998.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
PCI '21: Proceedings of the 25th Pan-Hellenic Conference on Informatics
November 2021
499 pages
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 February 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Apache Spark
  2. Configuration Model
  3. Linear Regression
  4. PageRank

Qualifiers

  • Short-paper
  • Research
  • Refereed limited

Funding Sources

  • Operational Program Competitiveness, Entrepreneurship and Innovation, under the call RESEARCH ? CREATE ? INNOVATE

Conference

PCI 2021

Acceptance Rates

Overall Acceptance Rate 190 of 390 submissions, 49%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 54
    Total Downloads
  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)1
Reflects downloads up to 13 Dec 2024

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media