[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3061639.3062194acmconferencesArticle/Chapter ViewAbstractPublication PagesdacConference Proceedingsconference-collections
research-article

Accelerating Graph Community Detection with Approximate Updates via an Energy-Efficient NoC

Published: 18 June 2017 Publication History

Abstract

Community detection is an advanced graph operation that is used to reveal tightly-knit groups of vertices (aka. communities) in real-world networks. Given the intractability of the problem, efficient heuristics are used in practice. Yet, even the best of these state-of-the-art heuristics can become computationally demanding over large inputs and can generate workloads that exhibit inherent irregularity in data movement on manycore platforms. In this paper, we posit that effective acceleration of the graph community detection operation can be achieved by reducing the cost of data movement through a combined innovation at both software and hardware levels. More specifically, we first propose an efficient software-level parallelization of community detection that uses approximate updates to cleverly exploit a diminishing returns property of the algorithm. Secondly, as a way to augment this innovation at the software layer, we design an efficient Wireless Network on Chip (WiNoC) architecture that is suited to handle the irregular on-chip data movements exhibited by the community detection algorithm under both unicast- and broadcast-heavy cache coherence protocols. Experimental results show that our resulting WiNoC-enabled manycore platform achieves on average 52% savings in execution time, without compromising on the quality of the outputs, when compared to a traditional manycore platform designed with a wireline mesh NoC and running community detection without employing approximate updates.

References

[1]
S. Fortunato "Community detection in graphs." Physics Reports, 486(3): 75--174.
[2]
K. Duraisamy, at al., "High-Performance and Energy-Efficient Network-on-Chip Architectures for Graph Analytics". ACM Transactions on Embedded. Computing Systems, 15(4), Article-66, 26 pages.
[3]
P. Wettin, et al. "Design Space Exploration for Wireless NoCs Incorporating Irregular Network Routing," IEEE TCAD, 33(11).
[4]
Graph500: http://www.graph500.org/.
[5]
D. Ediger, "Analyzing Hybrid Architectures for Massively Parallel Graph Analysis", Ph. D. Dissertation, Georgia Institute of Technology, May-2013.
[6]
D. A. Bader, et al. "On the architectural requirements for efficient execution of graph algorithms". In Proc. of ICPP, 2005, pp. 547--556.
[7]
M. Castro, et al. "Analysis of computing and energy perfor- ' mance of multicore, NUMA, and manycore platforms for an irregular application". In Proc. of 3rd Workshop on Irregular Applications: Architectures and Algorithms.
[8]
E. Francesquini, et al. "On the energy efficiency and performance of irregular application executions on multicore, NUMA and manycore platforms". Journal of Parallel and Distributed Computing, 76, pp. 32--48.
[9]
M. Frasca, et al. "NUMA-aware graph mining techniques for performance and energy efficiency". In Proc. of IEEE Intl. Conf. on High Performance Computing. Networking, Storage and Analysis (SC), pp. 1--11.
[10]
H. Lu, et al. "Balanced coloring for parallel computing applications", Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2015.
[11]
D. Chavarrıa-Miranda, et al., "Scaling graph community detection on the Tilera manycore architecture," in HiPC 2014, 11 pages.
[12]
E. J. Riedy, et al. "Parallel community detection for massive graphs." in Parallel Processing and Applied Mathematics, pp. 286--296. Springer, 2012.
[13]
J. Soman and A. Narang, "Fast Community Detection Algorithm with GPUs and Multicore Architectures," In Proc. of IPDPS, 2011, pp. 568--579.
[14]
H. Lu, M. Halappanavar, A. Kalyanaraman, "Parallel Heuristics for Scalable Community Detection", Parallel Computing, vol. 47, pp. 597--606, 2015.
[15]
M. E. J. Newman, "Modularity and community structure in networks." Proceedings of the National Academy of Sciences, 103(23), pp. 8577--8582, 2006.
[16]
DIMACS10, The 10th DIMACS implementation challenge -- graph partioning and clustering. URL: http://www.cc.gatech.edu/dimacs10/
[17]
A. Ros, et al. "Dealing with Traffic-Area Trade-Off in Direct Coherence Protocols for Many-Core CMPs." Advanced Parallel Processing Technologies, Springer, pp. 11--27.
[18]
T. Petermann, P. Los Rios, "Spatial Small-World Networks: A Wiring Cost Perspective", arXiv: condmat/0501420v2
[19]
K. Duraisamy, R. Kim, P. Pande, "Enhancing Performance of Wireless NoCs with Distributed MAC Protocols", in Proceedings of ISQED, 2015.
[20]
M. P. Malumbres, and Duato, J. "An efficient implementation of tree-based multicast routing for distributed shared-memory multiprocessors". Jour. of Sys. Arch., 46(11), pp. 1019--1032.
[21]
T. Krishna, et al. "Towards the ideal on-chip fabric for 1-to-many and many-to-1 communication," In Proceedings of the MICRO, 2011, pp. 71--82.
[22]
N. Binkert, et al., "The GEM5 Simulator", ACM SIGARCH Computer Architecture News, 39(2):1--7, 2011.
[23]
S. Li, et al., "McPAT: an integrated power, area, and timing modeling framework for multicore and manycore architectures", In Proc., of the 42nd Annual IEEE/ACM International Symp. on Microarchitecture, pp. 469--480, 2009.

Cited By

View all
  • (2020)Prune the Unnecessary: Parallel Pull-Push Louvain Algorithms with Automatic Edge PruningProceedings of the 49th International Conference on Parallel Processing10.1145/3404397.3404455(1-11)Online publication date: 17-Aug-2020
  • (2019)A Brief Survey of Algorithms, Architectures, and Challenges toward Extreme-scale Graph Analytics2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)10.23919/DATE.2019.8715024(1307-1312)Online publication date: Mar-2019
  • (2019)ReplicaProceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems10.1145/3297858.3304033(849-863)Online publication date: 4-Apr-2019
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
DAC '17: Proceedings of the 54th Annual Design Automation Conference 2017
June 2017
533 pages
ISBN:9781450349277
DOI:10.1145/3061639
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 June 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Graph Community Detection
  2. Wireless Network-on-Chip

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

DAC '17
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,770 of 5,499 submissions, 32%

Upcoming Conference

DAC '25
62nd ACM/IEEE Design Automation Conference
June 22 - 26, 2025
San Francisco , CA , USA

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)1
Reflects downloads up to 13 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2020)Prune the Unnecessary: Parallel Pull-Push Louvain Algorithms with Automatic Edge PruningProceedings of the 49th International Conference on Parallel Processing10.1145/3404397.3404455(1-11)Online publication date: 17-Aug-2020
  • (2019)A Brief Survey of Algorithms, Architectures, and Challenges toward Extreme-scale Graph Analytics2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)10.23919/DATE.2019.8715024(1307-1312)Online publication date: Mar-2019
  • (2019)ReplicaProceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems10.1145/3297858.3304033(849-863)Online publication date: 4-Apr-2019
  • (2017)Approximate Computing Techniques for Iterative Graph Algorithms2017 IEEE 24th International Conference on High Performance Computing (HiPC)10.1109/HiPC.2017.00013(23-32)Online publication date: Dec-2017

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media