[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1150402.1150443acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

Using structure indices for efficient approximation of network properties

Published: 20 August 2006 Publication History

Abstract

Statistics on networks have become vital to the study of relational data drawn from areas such as bibliometrics, fraud detection, bioinformatics, and the Internet. Calculating many of the most important measures - such as betweenness centrality, closeness centrality, and graph diameter-requires identifying short paths in these networks. However, finding these short paths can be intractable for even moderate-size networks. We introduce the concept of a network structure index (NSI), a composition of (1) a set of annotations on every node in the network and (2) a function that uses the annotations to estimate graph distance between pairs of nodes. We present several varieties of NSIs, examine their time and space complexity, and analyze their performance on synthetic and real data sets. We show that creating an NSI for a given network enables extremely efficient and accurate estimation of a wide variety of network statistics on that network.

References

[1]
L. A. Adamic, R. M. Lukose, A. R. Puniyani, and B. A. Huberman. Search in power-law networks. Physical Review E, 64, 2001.
[2]
Y. Bartal. Probabilistic approximation of metric spaces and its algorithmic applications. In Proceedings of the 37th Annual IEEE Symposium on Foundations of Computer Science, 1996.
[3]
U. Brandes. A faster algorithm for betweenness centrality. Journal of Mathematical Sociology 25:163--177, 2001.
[4]
E. Chow. A graph search heuristic for shortest distance paths. Technical Report UCRL-JRNL-202894, Lawrence Livermore National Laboratory, 2004.
[5]
D. Coppersmith and S. Winograd. Matrix muliplication via arithmetic progressions. Journal of Symbolic Computing. 9:251--280, 1990.
[6]
P. Erdös and A. Rényi. On the evolution of random graphs. Publ. Math. Inst. Hungar. Acad. Sci. 5, 17--61. 1960.
[7]
C. Faloutsos, K. S. McCurley, and A. Tomkins. Fast discovery of connection subgraphs. In Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004.
[8]
R. W. Floyd. Algorithm 97 (shortest path). Communications of the ACM, 5(6):345, 1962.
[9]
L. C. Freeman. Centrality in social networks: Conceptual clarification. Social Networks 1:215--239, 1979.
[10]
N. Friedman, L. Getoor, D. Koller, and A. Pfeffer. Learning probabilistic relational models. In Proceedings of the International Joint Conference on Artificial Intelligence, 1999.
[11]
A. V. Goldberg and C. Harrelson. Computing the shortest path: A* search meets graph theory. In ACM-SIAM Symposium on Discrete Algorithms, 2005.
[12]
J. Kleinberg. Navigation in a small world. Nature, 406:845, 2000.
[13]
J. Kleinberg. The small-world phenomenon: An algorithmic perspective. In Proceedings of the 32nd ACM Symposium on Theory of Computing, 2000.
[14]
J. Kleinberg, A. Slivkins, and T. Wexler. Triangulation and embedding using small sets of beacons. In Proceedings of the 45th Annual IEEE Symposium on Foundations of Computer Science, 2004.
[15]
R. Krauthgamer and J. R. Lee. Navigating nets: Simple algorithms for proximity search. In ACM-SIAM Symposium on Discrete Algorithms, 2004.
[16]
J. Leskovec, J. Kleinberg, and C. Faloutsos. Graphs over time: Densification laws, shrinking diameters and possible explanations. In Proceedings of the 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2005.
[17]
J. Neville and D. Jensen. Dependency networks for relational data. In Proceedings of the 4th IEEE International Conference on Data Mining, 2004.
[18]
T. S. E. Ng and H. Zhang. Predicting Internet network distance with coordinates-based approaches. In Proceedings of the 21st Annual Joint Conference of the IEEE Computer and Communications Societies 1:170--179, 2002.
[19]
C. R. Palmer, P. B. Gibbons, and C. Faloutsos. ANF: A fast and scalable tool for data mining in massive graphs. In Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2002.
[20]
M. Pias, J. Crowcroft, S. Wilbur, T. Harris, and S. Bhatti. Lighthouses for scalable distributed location. In Proceedings of the 2nd International Workshop on Peer-To-Peer Systems, 2003.
[21]
Y. Shavitt and T. Tankel. Big-Bang simulation for embedding network distances in Euclidean space. In Proceedings of IEEE Infocom, 2003.
[22]
A. Slivkins. Distance estimation and object location via rings of neighbors. In Proceedings of the ACM Symposium on Principles of Distributed Computing, 2005.
[23]
Ö. Şimşek and D. Jensen. Decentralized search in networks using homophily and degree disparity. In Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, 2005.
[24]
L. Tang and M. Crovella. Virtual landmarks for the Internet. In Proceedings of the Internet Measurement Conference, 2003.
[25]
D. J. Watts, P. S. Dodds, and M. E. J. Newman. Identity and search in social networks. Science, 296:1302--1305, 2002.
[26]
D. J. Watts and S. H. Strogatz. Collective dynamics of small-world networks. Nature, 393:440--442, 1998.
[27]
Wikipedia contributors (2006). Six Degrees of Kevin Bacon. Wikipedia, The Free Encyclopedia. Retrieved 15:10, April 15, 2006 from http://en.wikipedia.org/w/index.php? title=Six_Degrees_of_Kevin_Bacon&oldid=48040027.
[28]
B. Wong, A. Slivkins, and E. G. Sirer. Meridian: A lightweight network location service without virtual coordinates. In Proceedings of SIGCOMM, 2005.

Cited By

View all
  • (2024)Clustering graph data: the roadmap to spectral techniquesDiscover Artificial Intelligence10.1007/s44163-024-00102-x4:1Online publication date: 22-Jan-2024
  • (2024)Quickcent: a fast and frugal heuristic for harmonic centrality estimation on scale-free networksComputing10.1007/s00607-024-01303-z106:8(2675-2705)Online publication date: 8-Jun-2024
  • (2023)Estimating feature importance in circuit network using machine learningMultimedia Tools and Applications10.1007/s11042-023-16814-8Online publication date: 15-Sep-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '06: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
August 2006
986 pages
ISBN:1595933395
DOI:10.1145/1150402
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 August 2006

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. centrality
  2. knowledge discovery in graphs
  3. network structure index
  4. social network analysis

Qualifiers

  • Article

Conference

KDD06

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Upcoming Conference

KDD '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Clustering graph data: the roadmap to spectral techniquesDiscover Artificial Intelligence10.1007/s44163-024-00102-x4:1Online publication date: 22-Jan-2024
  • (2024)Quickcent: a fast and frugal heuristic for harmonic centrality estimation on scale-free networksComputing10.1007/s00607-024-01303-z106:8(2675-2705)Online publication date: 8-Jun-2024
  • (2023)Estimating feature importance in circuit network using machine learningMultimedia Tools and Applications10.1007/s11042-023-16814-8Online publication date: 15-Sep-2023
  • (2022)The Approximate Shortest Path Algorithm for Complex Networks Based on EIN Overlay NetworkComputer Science and Application10.12677/CSA.2022.12408212:04(806-816)Online publication date: 2022
  • (2022)GraphWalks: Efficient Shape Agnostic Geodesic Shortest Path Estimation2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)10.1109/CVPRW56347.2022.00335(2967-2976)Online publication date: Jun-2022
  • (2021)Centrality Measures: A Tool to Identify Key Actors in Social NetworksPrinciples of Social Networking10.1007/978-981-16-3398-0_1(1-27)Online publication date: 19-Aug-2021
  • (2021)Shortest Path Distance Prediction Based on CatBoostWeb Information Systems and Applications10.1007/978-3-030-87571-8_12(133-143)Online publication date: 17-Sep-2021
  • (2020)Approximation Algorithm for Shortest Path in Large Social NetworksAlgorithms10.3390/a1302003613:2(36)Online publication date: 6-Feb-2020
  • (2020)Scaling up network centrality computations – A brief overviewit - Information Technology10.1515/itit-2019-003262:3-4(189-204)Online publication date: 11-Mar-2020
  • (2019)Social Community Detection Scheme Based on Social-Aware in Mobile Social NetworksIEEE Access10.1109/ACCESS.2019.29561497(173407-173418)Online publication date: 2019
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media