[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/775152.775190acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
Article

Extrapolation methods for accelerating PageRank computations

Published: 20 May 2003 Publication History

Abstract

We present a novel algorithm for the fast computation of PageRank, a hyperlink-based estimate of the ''importance'' of Web pages. The original PageRank algorithm uses the Power Method to compute successive iterates that converge to the principal eigenvector of the Markov matrix representing the Web link graph. The algorithm presented here, called Quadratic Extrapolation, accelerates the convergence of the Power Method by periodically subtracting off estimates of the nonprincipal eigenvectors from the current iterate of the Power Method. In Quadratic Extrapolation, we take advantage of the fact that the first eigenvalue of a Markov matrix is known to be 1 to compute the nonprincipal eigenvectors using successive iterates of the Power Method. Empirically, we show that using Quadratic Extrapolation speeds up PageRank computation by 25-300% on a Web graph of 80 million nodes, with minimal overhead. Our contribution is useful to the PageRank community and the numerical linear algebra community in general, as it is a fast method for determining the dominant eigenvector of a matrix that is too large for standard fast methods to be practical.

References

[1]
A. Aitken. On Bernoulli's numerical solution of algebraic equations. Proc. Roy. Soc. Edinburgh, 46:289--305, 1926.
[2]
A. Arasu, J. Novak, A. Tomkins, and J. Tomlin. PageRank computation and the structure of the web: Experiments and algorithms. In Proceedings of the Eleventh International World Wide Web Conference, Poster Track, 2002.
[3]
K. Bharat and M. R. Henzinger. Improved algorithms for topic distillation in a hyperlinked environment. In Proceedings of the ACM-SIGIR, 1998.
[4]
S. Chakrabarti, B. Dom, D. Gibson, J. Kleinberg, P. Raghavan, and S. Rajagopalan. Automatic resource compilation by analyzing hyperlink structure and associated text. In Proceedings of the Seventh International World Wide Web Conference, 1998.
[5]
S. Chakrabarti, M. M. Joshi, K. Punera, and D. M. Pennock. The structure of broad topics on the web. In Proceedings of the Eleventh International World Wide Web Conference, 2002.
[6]
S. Chakrabarti, M. van den Berg, and B. Dom. Focused crawling: A new approach to topic-specific web resource discovery. In Proceedings of the Eighth International World Wide Web Conference, 1999.
[7]
R. Fagin, R. Kumar, and D. Sivakumar. Comparing top k lists. In Proceedings of the ACM-SIAM Symposium on Discrete Algorithms, 2003.
[8]
G. H. Golub and C. F. V. Loan. Matrix Computations. The Johns Hopkins University Press, Baltimore, 1996.
[9]
G. Grimmett and D. Stirzaker. Probability and Random Processes. Oxford University Press, 1989.
[10]
T. H. Haveliwala. Efficient computation of PageRank. Stanford University Technical Report, 1999.
[11]
T. H. Haveliwala. Topic-sensitive PageRank. In Proceedings of the Eleventh International World Wide Web Conference, 2002.
[12]
T. H. Haveliwala and S. D. Kamvar. The second eigenvalue of the Google matrix. Stanford University Technical Report, 2003.
[13]
J. Hirai, S. Raghavan, H. Garcia-Molina, and A. Paepcke. WebBase: A repository of web pages. In Proceedings of the Ninth International World Wide Web Conference, 2000.
[14]
G. Jeh and J. Widom. Scaling personalized web search. In Proceedings of the Twelfth International World Wide Web Conference, 2003.
[15]
J. Kleinberg. Authoritative sources in a hyperlinked environment. In Proceedings of the ACM-SIAM Symposium on Discrete Algorithms, 1998.
[16]
J. Kleinberg, S. R. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins. The web as a graph: Measurements, models, and methods. In Proceedings of the International Conference on Combinatorics and Computing, 1999.
[17]
U. Krieger. Numerical solution of large finite markov chains by algebraic multigrid techniques. In Proceedings of the 2nd International Workshop on the Numerical Solution of Markov Chains, 1995.
[18]
L. Page, S. Brin, R. Motwani, and T. Winograd. The PageRank citation ranking: Bringing order to the web. Stanford Digital Libraries Working Paper, 1998.
[19]
D. Rafiei and A. O. Mendelzon. What is this page known for? Computing web page reputations. In Proceedings of the Ninth International World Wide Web Conference, 2000.
[20]
M. Richardson and P. Domingos. The intelligent surfer: Probabilistic combination of link and content information in pagerank. In Advances in Neural Information Processing Systems, volume 14. MIT Press, Cambridge, MA, 2002.
[21]
L. N. Trefethen and D. Bau. Numerical Linear Algebra. SIAM Press, Philadelphia, 1997.
[22]
P. Wynn. On the convergence and stability of the epsilon algorithm. SIAM Journal of Numerical Analysis, 33:91--122, 1966.

Cited By

View all
  • (2025)Weak dangling block reordering and multi-step block compression for efficiently computing and updating PageRank solutionsJournal of Computational and Applied Mathematics10.1016/j.cam.2024.116332458(116332)Online publication date: Apr-2025
  • (2024)Efficient and Accurate PageRank Approximation on Large GraphsProceedings of the ACM on Management of Data10.1145/36771322:4(1-26)Online publication date: 30-Sep-2024
  • (2024)Efficient Algorithms for Personalized PageRank Computation: A SurveyIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.337600036:9(4582-4602)Online publication date: Sep-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
WWW '03: Proceedings of the 12th international conference on World Wide Web
May 2003
772 pages
ISBN:1581136803
DOI:10.1145/775152
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 May 2003

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. PageRank
  2. eigenvector computation
  3. link analysis

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)28
  • Downloads (Last 6 weeks)6
Reflects downloads up to 06 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Weak dangling block reordering and multi-step block compression for efficiently computing and updating PageRank solutionsJournal of Computational and Applied Mathematics10.1016/j.cam.2024.116332458(116332)Online publication date: Apr-2025
  • (2024)Efficient and Accurate PageRank Approximation on Large GraphsProceedings of the ACM on Management of Data10.1145/36771322:4(1-26)Online publication date: 30-Sep-2024
  • (2024)Efficient Algorithms for Personalized PageRank Computation: A SurveyIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.337600036:9(4582-4602)Online publication date: Sep-2024
  • (2024)Personalized PageRanks over Dynamic Graphs - The Case for Optimizing Quality of Service2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00038(409-422)Online publication date: 13-May-2024
  • (2024)Application of an extrapolation method in the Hessenberg algorithm for computing PageRankThe Journal of Supercomputing10.1007/s11227-024-06327-y80:15(22836-22859)Online publication date: 1-Oct-2024
  • (2023)Recursive reordering and elimination method for efficient computation of PageRank problemsAIMS Mathematics10.3934/math.202312828:10(25104-25130)Online publication date: 2023
  • (2023)Social Networks for COVID-19 Across AfricaJournal of Asian and African Studies10.1177/00219096231200593Online publication date: 25-Sep-2023
  • (2023)Accelerating Personalized PageRank Vector ComputationProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599251(262-273)Online publication date: 6-Aug-2023
  • (2023)Axiomatic characterization of PageRankArtificial Intelligence10.1016/j.artint.2023.103900318(103900)Online publication date: May-2023
  • (2023)A survey on dynamic graph processing on GPUs: concepts, terminologies and systemsFrontiers of Computer Science10.1007/s11704-023-2656-118:4Online publication date: 16-Dec-2023
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media