[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3132847.3132960acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

GPU-Accelerated Graph Clustering via Parallel Label Propagation

Published: 06 November 2017 Publication History

Abstract

Graph clustering has recently attracted much attention as a technique to extract community structures from various kinds of graph data. Since available graph data becomes increasingly large, the acceleration of graph clustering is an important issue for handling large-scale graphs. To this end, this paper proposes a fast graph clustering method using GPUs. The proposed method is based on parallelization of label propagation, one of the fastest graph clustering algorithms. Our method has the following three characteristics: (1) efficient parallelization: the algorithm of label propagation is transformed into a sequence of data-parallel primitives; (2) load balance: the method takes into account load balancing by adopting the primitives that make the load among threads and blocks well balanced; and (3) out-of-core processing: we also develop algorithms to efficiently deal with large-scale datasets that do not fit into GPU memory. Moreover, this GPU out-of-core algorithm is extended to simultaneously exploit both CPUs and GPUs for further performance gain. Extensive experiments with real-world and synthetic datasets show that our proposed method outperforms an existing parallel CPU implementation by a factor of up to 14.3 without sacrificing accuracy.

References

[1]
J. Arai, H. Shiokawa, T. Yamamuro, M. Onizuka, and I. Sotetsu. Rabbit Order: Just-in-Time Parallel Reordering for Fast Graph Analysis. In Proc. IPDPS, pp. 22--31, 2016.
[2]
A.-L. Barabási and R. Albert. Emergence of scaling in random networks. Science, vol. 286, no. 5439, pp. 509--512, Oct. 1999.
[3]
P. Boldi, M. Santini, and S. Vigna. A Large Time-Aware Graph. SIGIR Forum, vol. 42, no. 2, pp. 33--38, Dec. 2008.
[4]
S. Baxter. Modern GPU, ver. 1.0. https://github.com/moderngpu/moderngpu.
[5]
N. Bell and M. Garland. Implementing Sparse Matrix-Vector Multiplication on Throughput-Oriented Processors. In Proc. SC, pp. 18:1--18:11, 2009.
[6]
V. D. Blondel, J.-L. Guillaume, R. Lambiotte, and E. Lefebvre. Fast unfolding of communities in large networks. J. Stat. Mech. Theor. Exp., vol. 2008, no. 10, p. P10008, Oct. 2008.
[7]
M. Chen, K. Kuzmin, and B. K. Szymanski. Community Detection via Maximization of Modularity and Its Variants. IEEE Trans. Computational Soc. Syst., vol. 1, no. 1, pp. 46--65, Mar. 2014.
[8]
G. Cordasco and L. Gargano. Label Propagation Algorithm: A Semi-synchronous Approach. Int. J. Soc. Netw. Min., vol. 1, no. 1, pp. 3--26, 2012.
[9]
H. N. Djidjev and M. Onus. Scalable and Accurate Graph Clustering and Community Structure Detection. IEEE Trans. Parallel Distrib. Syst., vol. 24, no. 5, pp. 1022--1029, May 2013.
[10]
E. Duriakova, N. Hurley, D. Ajwani, and A. Sala. Analysis of the Semi-synchronous Approach to Large-scale Parallel Community Finding. In Proc. COSN, pp. 51--62, 2014.
[11]
S. Fortunato and M. Barthélemy. Resolution limit in community detection. Proc. Natl. Acad. Sci., vol. 104, no. 1, pp. 36--41, Dec. 2007.
[12]
S. Fortunato. Community detection in graphs. Phys. Rep., vol. 486, no. 3--5, pp. 75--174, Feb. 2010.
[13]
B. He, M. Lu, K. Yang, R. Fang, N. K. Govindaraju, Q. Luo, and P. V. Sander. Relational Query Coprocessing on Graphics Processors. ACM Trans. Database Syst., vol. 34, pp. 21:1--21:39, Dec. 2009.
[14]
A. Lancichinetti, S. Fortunato, and F. Radicchi. Benchmark graphs for testing community detection algorithms. Phys. Rev. E, vol. 78, no. 4, p. 046110, Oct. 2008.
[15]
J. Leskovec and A. Krevl. SNAP Datasets: Stanford large network dataset collection. http://snap.stanford.edu/data, Jun. 2014.
[16]
I. X. Y. Leung, P. Hui, P. Liò, and J. Crowcroft. Towards real-time community detection in large networks. Phys. Rev. E, vol. 79, p. 066107, Jun 2009.
[17]
D. Merrill. CUB, ver. 1.6.4. http://nvlabs.github.io/cub/.
[18]
M. E. J. Newman and M. Girvan. Finding and evaluating community structure in networks. Phys. Rev. E, vol. 69, no. 2, p. 026113, Feb. 2004.
[19]
NVIDIA. CUDA C Programming Guide. http://docs.nvidia.com/cuda/pdf/CUDA_C_Programming_Guide.pdf.
[20]
J. D. Owens, M. Houston, D. Luebke, S. Green, J. E. Stone, and J. C. Phillips. GPU Computing. Proc. IEEE, vol. 96, no. 5, pp. 879--899, May 2008.
[21]
S. Papadopoulos, Y. Kompatsiaris, A. Vakali, and P. Spyridonos. Community Detection in Social Media. Data Min. Knowl. Discov., vol. 24, no. 3, pp. 515--554, May 2012.
[22]
U. N. Raghavan, R. Albert, and S. Kumara. Near linear time algorithm to detect community structures in large-scale networks. Phys. Rev. E, vol. 76, p. 036106, Sep. 2007.
[23]
J. Soman and A. Narang. Fast Community Detection Algorithm with GPUs and Multicore Architectures. In Proc. IPDPS, pp. 568--579, 2011.
[24]
C. L. Staudt and H. Meyerhenke. Engineering Parallel Algorithms for Community Detection in Massive Networks. IEEE Trans. Parallel Distrib. Syst., vol. 27, no. 1, pp. 171--184, Jan. 2016.
[25]
T. R. Stovall, S. Kockara, and R. Avci. GPUSCAN: GPU-Based Parallel Structural Clustering Algorithm for Networks. IEEE Trans. Parallel Distrib. Syst., vol. 26, no. 12, pp. 3381--3393, Dec. 2015.
[26]
M. Wang, C. Wang, J. X. Yu, and J. Zhang. Community Detection in Social Networks: An In-depth Benchmarking Study with a Procedure-oriented Framework. Proc. VLDB Endow., vol. 8, no. 10, pp. 998--1009, Jun. 2015.
[27]
X. Xu, N. Yuruk, Z. Feng, and T. A. J. Schweiger. SCAN: A Structural Clustering Algorithm for Networks. In Proc. KDD, pp. 824--833, 2007.

Cited By

View all
  • (2024)Large-Scale Graph Label Propagation on GPUsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.333632936:10(5234-5248)Online publication date: Oct-2024
  • (2022)Direction-optimizing Label Propagation Framework for Structure Detection in Graphs: Design, Implementation, and Experimental AnalysisACM Journal of Experimental Algorithmics10.1145/356459327(1-31)Online publication date: 13-Dec-2022
  • (2022)DPISCAN: Distributed and parallel architecture with indexing for structural clustering of massive dynamic graphsInternational Journal of Data Science and Analytics10.1007/s41060-021-00303-y13:3(199-223)Online publication date: 12-Jan-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '17: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management
November 2017
2604 pages
ISBN:9781450349185
DOI:10.1145/3132847
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 November 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. community detection
  2. gpu
  3. graph clustering
  4. label propagation

Qualifiers

  • Research-article

Funding Sources

Conference

CIKM '17
Sponsor:

Acceptance Rates

CIKM '17 Paper Acceptance Rate 171 of 855 submissions, 20%;
Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)24
  • Downloads (Last 6 weeks)2
Reflects downloads up to 09 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Large-Scale Graph Label Propagation on GPUsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.333632936:10(5234-5248)Online publication date: Oct-2024
  • (2022)Direction-optimizing Label Propagation Framework for Structure Detection in Graphs: Design, Implementation, and Experimental AnalysisACM Journal of Experimental Algorithmics10.1145/356459327(1-31)Online publication date: 13-Dec-2022
  • (2022)DPISCAN: Distributed and parallel architecture with indexing for structural clustering of massive dynamic graphsInternational Journal of Data Science and Analytics10.1007/s41060-021-00303-y13:3(199-223)Online publication date: 12-Jan-2022
  • (2022)IISCAN: Index-Based Incremental Structural Clustering Technique for Billion-Edge GraphsProceedings of the 13th International Conference on Soft Computing and Pattern Recognition (SoCPaR 2021)10.1007/978-3-030-96302-6_4(43-54)Online publication date: 22-Feb-2022
  • (2021)GPU-Accelerated Graph Label Propagation for Real-Time Fraud DetectionProceedings of the 2021 International Conference on Management of Data10.1145/3448016.3452774(2348-2356)Online publication date: 9-Jun-2021
  • (2021)Detecting Communities in Online Learning RepositoryAnalysing Users' Interactions with Khan Academy Repositories10.1007/978-3-030-89166-4_7(57-64)Online publication date: 16-Nov-2021
  • (2020)ScaphProceedings of the 2020 USENIX Conference on Usenix Annual Technical Conference10.5555/3489146.3489185(573-588)Online publication date: 15-Jul-2020
  • (2020)V-CombinerProceedings of the 34th ACM International Conference on Supercomputing10.1145/3392717.3392739(1-13)Online publication date: 29-Jun-2020
  • (2020)Direction-optimizing label propagation and its application to community detectionProceedings of the 17th ACM International Conference on Computing Frontiers10.1145/3387902.3392634(192-201)Online publication date: 11-May-2020
  • (2020)RnR: A Software-Assisted Record-and-Replay Hardware Prefetcher2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO)10.1109/MICRO50266.2020.00057(609-621)Online publication date: Oct-2020
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media