[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2833312.2833322acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicdcnConference Proceedingsconference-collections
research-article

STIC-D: algorithmic techniques for efficient parallel pagerank computation on real-world graphs

Published: 04 January 2016 Publication History

Abstract

Computing metrics on nodes of a graph is an essential step in understanding the properties of the graph. Pagerank is one such metric that is popular and is being used to measure the importance of nodes in not only web graphs but also in social networks, biological networks, road networks, and the like. The core of the computation of pagerank can be seen as an iterative approach that updates the pageranks of nodes until the values converge.
However, as real-world graphs such as road networks and the web have a large size, one needs to design efficient techniques to address the challenges of scale. In addition to parallelism that can be exploited, it is important to also look for specific properties of graphs and their impact on the algorithm.
In this paper, we present four algorithmic techniques that optimize the pagerank computation on real-world graphs. The techniques are presented with the aim of exploiting the nature of the real-world graphs and eliminating redundancies in the pagerank computation. Our techniques also have the advantage that with little extra effort one can quickly identify which of the techniques will be suitable for a given input graph.
We implement our algorithm on an Intel i7 980x CPU running 12 threads using OpenMP Version 3.0. We study our techniques on four classes of real-world graphs: web graphs, social networks, citation and collaboration networks, and road networks. Our implementation achieves an average speedup of 32% compared to a baseline implementation.

References

[1]
Stanford network analysis platform dataset. http://www.cise.uïňĆ.edu/research/sparse/matrices/SNAP.
[2]
The University of Florida Sparse Matrix Collection. http://www.cise.ufl.edu/research/sparse/matrices/.
[3]
Albert, R., Jeong, H., and Barabási, A.-L. Internet: Diameter of the world-wide web. Nature 401, 6749 (1999), 130--131.
[4]
Arasu, A., Novak, J., Tomkins, A., and Tomlin, J. Pagerank computation and the structure of the web: Experiments and algorithms. In Proceedings of the Eleventh International World Wide Web Conference, Poster Track (2002), pp. 107--117.
[5]
Backstrom, L., Huttenlocher, D., Kleinberg, J., and Lan, X. Group formation in large social networks: membership, growth, and evolution. In Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining (2006), ACM, pp. 44--54.
[6]
Banerjee, D. S., Kumar, A., Chaitanya, M., Sharma, S., and Kothapalli, K. Work efficient parallel algorithms for large graph exploration on emerging heterogeneous architectures. JPDC 76 (2015), 81--93.
[7]
Banerjee, D. S., Sharma, S., and Kothapalli, K. Work efficient parallel algorithms for large graph exploration. In HiPC (2013).
[8]
Boldi, P., Rosa, M., Santini, M., and Vigna, S. Layered label propagation: A multiresolution coordinate-free ordering for compressing social networks. In Proceedings of the 20th international conference on World Wide Web (2011), ACM Press.
[9]
Boldi, P., and Vigna, S. The WebGraph framework I: Compression techniques. In Proc. of the Thirteenth International World Wide Web Conference (WWW 2004) (Manhattan, USA, 2004), ACM Press, pp. 595--601.
[10]
Broder, A., Kumar, R., Maghoul, F., Raghavan, P., Rajagopalan, S., Stata, R., Tomkins, A., and Wiener, J. Graph structure in the web. Computer Networks 33, 1--6 (2000), 309--320.
[11]
Broder, A. Z., Lempel, R., Maghoul, F., and Pedersen, J. O. Efficient pagerank approximation via graph aggregation. In Proceedings of the 13th international conference on World Wide Web (2004), pp. 484--485.
[12]
Cong, G., and Bader, D. A. An experimental study of parallel biconnected components algorithms on symmetric multiprocessors (smps). In Proc. IEEE IPDPS (2005).
[13]
Cormen, Leiserson, Rivest, and Stein. Introduction To Algorithms. Prentice Hall.
[14]
Duong, N. T., Nguyen, Q. A. P., Nguyen, A. T., and Nguyen, H.-D. Parallel pagerank computation using gpus. In Proceedings of the Third Symposium on Information and Communication Technology (2012), SoICT '12, pp. 223--230.
[15]
Eiron, N., McCurley, K. S., and Tomlin, J. A. Ranking the web frontier. In Proceedings of the 13th international conference on World Wide Web (2004), pp. 309--318.
[16]
Geisberger, R., Sanders, P., and Schultes, D. Better approximation of betweenness centrality. In ALENEX (2008), SIAM, pp. 90--100.
[17]
Gleich, D. F. PageRank beyond the web. arXiv cs.SI (2014), 1407.5107. Accepted for publication in SIAM Review.
[18]
Hong, S., Rodia, N. C., and Olukotun, K. On Fast Parallel Detection of Strongly Connected Components (SCC) in Small-World Graphs. In Proc. of SC'13 (2013).
[19]
Kamvar, S., Haveliwala, T., Manning, C., and Golub, G. Exploiting the block structure of the web for computing pagerank. Tech. Rep. 2003-17, Stanford University, 2003.
[20]
Kohlschutter, C., Chirita, P.-A., and Nejdl, W. Efficient parallel computation of pagerank. In Proc. of the 28th European Conference on IR Research (2006), pp. 241--252.
[21]
Leskovec, J., Lang, K., Dasgupta, A., and Mahoney, M. Community structure in large networks: Natural cluster sizes and the absence of large well-defined clusters.
[22]
Page, L., Brin, S., Motwani, R., and Winograd, T. The pagerank citation ranking: bringing order to the web.
[23]
Richardson, M., Agrawal, R., and Domingos, P. Trust management for the semantic web. In The Semantic Web-ISWC 2003. Springer, 2003, pp. 351--368.
[24]
Sarma, A. D., Molla, A. R., Pandurangan, G., and Upfal, E. Fast distributed pagerank computation. In Proc. of ICDCN (2013), pp. 11--26.
[25]
SarÄŚyuce, A. E., Saüle, E., Kaya, K., and Çatalyürek, U. V. Shattering and compressing networks for betweenness centrality. In SIAM Data Mining (2013).
[26]
Slota, K. G., and Madduri, K. Simple parallel biconnectivity algorithms for multicore platforms. In HiPC (2014), pp. 1--10.
[27]
Wang, Y., and DeWitt, D. J. Computing pagerank in a distributed internet search engine system. In Proceedings of the Thirtieth International Conference on Very Large Data Bases (2004), pp. 420--431.
[28]
Wu, J., and Aberer, K. Using siterank for decentralized computation of web document ranking. In Proc. of the Third Workshop on Adaptive Hypermedia and Adaptive Web-Based Systems (2004), pp. 265--274.

Cited By

View all
  • (2024)Lock-free Computation of PageRank in Dynamic Graphs2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)10.1109/IPDPSW63119.2024.00148(825-834)Online publication date: 27-May-2024
  • (2024)DF* PageRank: Incrementally Expanding Approaches for Updating PageRank on Dynamic GraphsEuro-Par 2024: Parallel Processing10.1007/978-3-031-69583-4_22(312-326)Online publication date: 26-Aug-2024
  • (2022)An Improved/Optimized Practical Non-Blocking PageRank Algorithm for Massive Graphs*International Journal of Parallel Programming10.1007/s10766-022-00725-650:3-4(381-404)Online publication date: 26-Mar-2022
  • Show More Cited By

Index Terms

  1. STIC-D: algorithmic techniques for efficient parallel pagerank computation on real-world graphs

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICDCN '16: Proceedings of the 17th International Conference on Distributed Computing and Networking
    January 2016
    370 pages
    ISBN:9781450340328
    DOI:10.1145/2833312
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 04 January 2016

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. algorithmic optimizations
    2. pagerank
    3. parallel computing
    4. real-world graphs
    5. strongly connected components

    Qualifiers

    • Research-article

    Conference

    ICDCN '16

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)8
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 12 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Lock-free Computation of PageRank in Dynamic Graphs2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)10.1109/IPDPSW63119.2024.00148(825-834)Online publication date: 27-May-2024
    • (2024)DF* PageRank: Incrementally Expanding Approaches for Updating PageRank on Dynamic GraphsEuro-Par 2024: Parallel Processing10.1007/978-3-031-69583-4_22(312-326)Online publication date: 26-Aug-2024
    • (2022)An Improved/Optimized Practical Non-Blocking PageRank Algorithm for Massive Graphs*International Journal of Parallel Programming10.1007/s10766-022-00725-650:3-4(381-404)Online publication date: 26-Mar-2022
    • (2021)An Efficient Practical Non-Blocking PageRank Algorithm for Large Scale Graphs2021 29th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP)10.1109/PDP52278.2021.00015(35-43)Online publication date: Mar-2021
    • (2021)Efficient Parallel Algorithms for Computing Percolation Centrality2021 IEEE 28th International Conference on High Performance Computing, Data, and Analytics (HiPC)10.1109/HiPC53243.2021.00025(111-120)Online publication date: Dec-2021
    • (2021)Efficient parallel algorithms for dynamic closeness‐ and betweenness centralityConcurrency and Computation: Practice and Experience10.1002/cpe.665035:17Online publication date: 29-Sep-2021
    • (2020)HyPR: Hybrid Page Ranking on Evolving Graphs2020 IEEE 27th International Conference on High Performance Computing, Data, and Analytics (HiPC)10.1109/HiPC50609.2020.00020(62-71)Online publication date: Dec-2020
    • (2020)Parallel PageRank Algorithm Using MapReduceICDSMLA 201910.1007/978-981-15-1420-3_85(787-795)Online publication date: 19-May-2020
    • (2019)BRICS – Efficient Techniques for Estimating the Farness-Centrality in Parallel2019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)10.1109/IPDPSW.2019.00110(645-654)Online publication date: May-2019
    • (2017)Approximate Computing Techniques for Iterative Graph Algorithms2017 IEEE 24th International Conference on High Performance Computing (HiPC)10.1109/HiPC.2017.00013(23-32)Online publication date: Dec-2017

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media