[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3184066.3184083acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlscConference Proceedingsconference-collections
research-article

Improving the shortest path finding algorithm in apache spark graphX

Published: 02 February 2018 Publication History

Abstract

The shortest path finding problem is one of the most important and common problems on graphs. It is also a basic problem applied to solve other problems such as the betweenness centrality problem, the closeness centrality problem... Therefore, in all graph processing platforms, there is a way to solve this problem. Apache Spark GraphX is also. However, the shortest path finding algorithm in GraphX has some drawbacks to discuss more. Therefore, in this paper we analyze some issues in finding the shortest path in GraphX, then we propose two new algorithms to improve for better performance, and finally we compare the performance between the shortest path finding algorithm in GraphX and proposed algorithms.

References

[1]
Rindra R., Apache Spark Graph Processing, 1st ed., BIRMINGHAM - MUMBAI: Packt Publishing, 2015.
[2]
Karau H., Konwinski A., Wendell P., Zaharia M., Learning Spark, Sebastopol: O'Reilly Media, Inc, 2015.
[3]
Min C., Shiwen M., Yunhao L., "Big Data: A Survey," Springer Science, 2014.
[4]
Sabia J., Love A., "Technologies to Handle Big Data: A Survey," 2014.
[5]
Michael S.M., Robin E., Spark GraphX in Action, New York: Manning Publications Co., 2016.
[6]
Krishna S., Fast Data Processing with Spark 2 - Third Edition, Birmingham: Packt Publishing Ltd., 2016.
[7]
Apache Spark API, "Apache Spark API - Scala," 2017. {Online}. Available: http://spark.apache.org/docs/latest/api/scala/#package. {Accessed 20 8 2017}.
[8]
Ahmed O., Fatima-Zahra B., Ayoub A. L., Samir B., "Big Data technologies: A survey," Journal of King Saud University - Computer and Information Sciences, 2017.
[9]
Apache Spark Cluster, "Cluster Mode Overview," 2017. {Online}. Available: https://spark.apache.org/docs/latest/cluster-overview.html. {Accessed 20 08 2017}.

Cited By

View all
  • (2024)Benchmarking Big Data Systems: Performance and Decision-Making Implications in Emerging TechnologiesTechnologies10.3390/technologies1211021712:11(217)Online publication date: 3-Nov-2024
  • (2021)Best path in mountain environment based on parallel A* algorithm and Apache SparkThe Journal of Supercomputing10.1007/s11227-021-04072-078:4(5075-5094)Online publication date: 15-Sep-2021

Index Terms

  1. Improving the shortest path finding algorithm in apache spark graphX

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICMLSC '18: Proceedings of the 2nd International Conference on Machine Learning and Soft Computing
    February 2018
    198 pages
    ISBN:9781450363365
    DOI:10.1145/3184066
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 02 February 2018

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. apache spark
    2. bigdata
    3. graph
    4. graphX
    5. pregel API
    6. scala
    7. shortest path

    Qualifiers

    • Research-article

    Conference

    ICMLSC 2018

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 21 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Benchmarking Big Data Systems: Performance and Decision-Making Implications in Emerging TechnologiesTechnologies10.3390/technologies1211021712:11(217)Online publication date: 3-Nov-2024
    • (2021)Best path in mountain environment based on parallel A* algorithm and Apache SparkThe Journal of Supercomputing10.1007/s11227-021-04072-078:4(5075-5094)Online publication date: 15-Sep-2021

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media