[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2350716.2350751acmotherconferencesArticle/Chapter ViewAbstractPublication PagessoictConference Proceedingsconference-collections
research-article

Parallel PageRank computation using GPUs

Published: 23 August 2012 Publication History

Abstract

Fast & efficient computing of web rank scores is a necessary issue of search engines today. Because of the enormous size of data and the dynamic nature of World Wide Web, this computation is generally executed on large web graphs (to billions webpages) and requires refreshing quite often, so it becomes a challenging task. In this paper, we propose an efficient method for computing PageRank score -- a Google ranking method based on analyzing the link structure of the Web on graphics processing units (GPUs). We have employed a slightly modification of a storage data format called binary 'link structure file' which inspirited from [2] for storing the web graph data. We then divided the PageRank calculating phases into parallel operations for exploiting the computing power of the graphics cards. Our program was written in CUDA language to experiment on a system equipped two double NVIDIA GeForce GTX 295 graphics cards, using two real datasets which were crawled from Vietnamese sites containing 7 million pages, 132 million links and 15 million pages, 200 million links, respectively. The experimental results showed that the computation speed increase from 10 to 20 times when compared to a CPU Intel Q8400 at 2.67 GHz based version, on both datasets. Our method can also scale up well for larger web graphs.

References

[1]
S. Brin and L. Page. 1998. The anatomy of a large-scale hypertextual web search engine. In Proceedings of the 7th WWW Conference.
[2]
A. Rungsawang and B. Manaskasemsak. 2004. Parallel PageRank Computation on a Gigabit PC Cluster. In Proceedings of the 18th International Conference on Advance Information Networking and Application.
[3]
A. Rungsawang and B. Manaskasemsak. 2003. PageRank computation using PC cluster. In Proceedings of the 10th European PVM/MPI User's Group Meeting.
[4]
A. Rungsawang and B. Manaskasemsak. 2004. An Efficient Partition-Based Parallel PageRank Algorithm. In Proceedings of the 11th International Conference Parallel and Distributed Computing.
[5]
K. Sankaralingam, S. Sethumadhavan and J. C. Browne. 2003. Distributed PageRank for P2P system. In Proceedings of the 11th IEEE HPD'03 Conference.
[6]
Amy N. Langville and Carl D. Meyer. 2006. Google's PageRank and Beyond: The Science of Search Engine Rankings. Princeton University Press, 41 William Street, Princeton, New Jersey, 2006, p. 31--46.
[7]
Nathan Bell and Michael Garland. 2008. Ecient Sparse Matrix-Vector Multiplication on CUDA. NVIDIA Technical Report.
[8]
Xintian Yang, Srinivasan Parthasarathy, P. Sadayappan. 2011. Fast Sparse Matrix-Vector Multiplication on GPUs: Implications for Graph Mining. Proceedings of the VLDB Endowment, Vol. 4, No. 4. Seattle, Washington.
[9]
Praveen K., Vamshi Krishna K., Anil Sri Harsha B., S. Balasubramanian, P. K. Baruah. 2011. Cost Efficient PageRank Computation using GPU. IEEE International Conference on High Performance Computing (HiPC), Student Research Symposium
[10]
Tianji WU, Bo WANG, Yi SHAN, Feng YAN, Yu WANG and Ningyi XU. 2010. Efficient PageRank and SpMV Computation on AMD GPUs. 39th International Conference on Parallel Processing, DOI 10.1109, p. 81--89
[11]
Ali Cevahir, Cevdet Aykanat, Ata Turk, B. Barla Cambazoglu, Akira Nukada and Satoshi Matsuoka. 2010. Efficient PageRank on GPU Clusters. IPSJ SIG Technical Report, Vol. 2010-HPC-128.
[12]
Chebyshev distance. http://en.wikipedia.org/wiki/Chebyshev_distance
[13]
M. Harris. 2007. Parallel Prefix Sum (Scan) with CUDA. NVIDIA Corporation.
[14]
CUDA zone, http://www.NVIDIA.com/object/cuda_home_new.html
[15]
NVIDIA, 2009 "NVIDIA CUDA Programming Guide 3.0".

Cited By

View all
  • (2024)Performance Analysis of Parallelized PageRank Algorithm using OpenMP, MPI and CUDA2024 International Conference on Smart Systems for Electrical, Electronics, Communication and Computer Engineering (ICSSEECC)10.1109/ICSSEECC61126.2024.10649542(44-49)Online publication date: 28-Jun-2024
  • (2023)Choosing the Best Parallelization and Implementation Styles for Graph Analytics Codes: Lessons Learned from 1106 ProgramsProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3581784.3607038(1-14)Online publication date: 12-Nov-2023
  • (2020)HyPR: Hybrid Page Ranking on Evolving Graphs2020 IEEE 27th International Conference on High Performance Computing, Data, and Analytics (HiPC)10.1109/HiPC50609.2020.00020(62-71)Online publication date: Dec-2020
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
SoICT '12: Proceedings of the 3rd Symposium on Information and Communication Technology
August 2012
290 pages
ISBN:9781450312325
DOI:10.1145/2350716
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 August 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. CUDA
  2. GPU
  3. PageRank
  4. parallel algorithm

Qualifiers

  • Research-article

Conference

SoICT '12

Acceptance Rates

Overall Acceptance Rate 147 of 318 submissions, 46%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)46
  • Downloads (Last 6 weeks)6
Reflects downloads up to 22 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Performance Analysis of Parallelized PageRank Algorithm using OpenMP, MPI and CUDA2024 International Conference on Smart Systems for Electrical, Electronics, Communication and Computer Engineering (ICSSEECC)10.1109/ICSSEECC61126.2024.10649542(44-49)Online publication date: 28-Jun-2024
  • (2023)Choosing the Best Parallelization and Implementation Styles for Graph Analytics Codes: Lessons Learned from 1106 ProgramsProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3581784.3607038(1-14)Online publication date: 12-Nov-2023
  • (2020)HyPR: Hybrid Page Ranking on Evolving Graphs2020 IEEE 27th International Conference on High Performance Computing, Data, and Analytics (HiPC)10.1109/HiPC50609.2020.00020(62-71)Online publication date: Dec-2020
  • (2019)Which Graph Representation to Select for Static Graph-Algorithms on a CUDA-capable GPUProceedings of the 12th Workshop on General Purpose Processing Using GPUs10.1145/3300053.3319416(22-31)Online publication date: 13-Apr-2019
  • (2019)GPU-based Graph Traversal on Compressed GraphsProceedings of the 2019 International Conference on Management of Data10.1145/3299869.3319871(775-792)Online publication date: 25-Jun-2019
  • (2019)Solving write conflicts in GPU-accelerated graph computation: A PageRank case-study2019 IEEE 5th International forum on Research and Technology for Society and Industry (RTSI)10.1109/RTSI.2019.8895572(144-148)Online publication date: Sep-2019
  • (2016)A Parallel Data Mining Algorithm for PageRank ComputationProceedings of the International Conference on Big Data and Advanced Wireless Technologies10.1145/3010089.3010118(1-5)Online publication date: 10-Nov-2016
  • (2016)STIC-DProceedings of the 17th International Conference on Distributed Computing and Networking10.1145/2833312.2833322(1-10)Online publication date: 4-Jan-2016
  • (2016)Compiler-Assisted Workload Consolidation for Efficient Dynamic Parallelism on GPU2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS.2016.98(534-543)Online publication date: May-2016
  • (2015)GraphReduceProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/2807591.2807655(1-12)Online publication date: 15-Nov-2015
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media