[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/3571885.3571892acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
research-article

Exaflops biomedical knowledge graph analytics

Published: 18 November 2022 Publication History

Abstract

We are motivated by newly proposed methods for mining large-scale corpora of scholarly publications (e.g., full biomedical literature), which consists of tens of millions of papers spanning decades of research. In this setting, analysts seek to discover relationships among concepts. They construct graph representations from annotated text databases and then formulate the relationship-mining problem as an all-pairs shortest paths (APSP) and validate connective paths against curated biomedical knowledge graphs (e.g., Spoke). In this context, we present Coast (Exascale Communication-Optimized All-Pairs Shortest Path) and demonstrate 1.004 EF/s on 9,200 Frontier nodes (73,600 GCDs). We develop hyperbolic performance models (HyPerMod), which guide optimizations and parametric tuning. The proposed Coast algorithm achieved the memory constant parallel efficiency of 99% in the single-precision tropical semiring. Looking forward, Coast will enable the integration of scholarly corpora like PubMed into the Spoke biomedical knowledge graph.

Supplementary Material

MP4 File (SC22_Presentation_Kannan.mp4)
Presentation at SC '22

References

[1]
S. E. Baranzini, K. Börner, J. Morris, C. A. Nelson, K. Soman, E. Schleimer, M. Keiser, M. Musen, R. Pearce et al., "A biomedical open knowledge network harnesses the power of AI to understand deep human biology," AI Magazine, vol. 43, no. 1, pp. 46--58, 2022.
[2]
T. A. Tummino, V. V. Rezelj, B. Fischer, A. Fischer, M. J. O'Meara, B. Monel, T. Vallet, K. M. White, Z. Zhang, A. Alon et al., "Drug-induced phospholipidosis confounds drug repurposing for SARS-CoV-2," Science, vol. 373, no. 6554, pp. 541--547, 2021.
[3]
D. E. Gordon, G. M. Jang, M. Bouhaddou, J. Xu, K. Obernier, K. M. White, M. J. O'Meara, V. V. Rezelj, J. Z. Guo, D. L. Swaney et al., "A sars-cov-2 protein interaction map reveals targets for drug repurposing," Nature, vol. 583, no. 7816, pp. 459--468, 2020.
[4]
M. R. Garvin, C. Alvarez, J. I. Miller, E. T. Prates, A. M. Walker, B. K. Amos, A. E. Mast, A. Justice, B. Aronow, and D. Jacobson, "A mechanistic model and therapeutic interventions for covid-19 involving a ras-mediated bradykinin storm," elife, vol. 9, p. e59177, 2020.
[5]
Y. H. Kim, S. H. Beak, A. Charidimou, and M. Song, "Discovering new genes in the pathways of common sporadic neurodegenerative diseases: a bioinformatics approach," Journal of Alzheimer's Disease, vol. 51, no. 1, pp. 293--312, 2016.
[6]
S. H. Baek, D. Lee, M. Kim, J. H. Lee, and M. Song, "Enriching plausible new hypothesis generation in," PloS one, vol. 12, no. 7, 2017.
[7]
J. Sybrandt, M. Shtutman, and I. Safro, "Moliere: Automatic biomedical hypothesis generation system," in Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2017, pp. 1633--1642.
[8]
E. Solomonik, A. Buluç, and J. Demmel, "Minimizing communication in all-pairs shortest paths," in Proceedings of the 27th IPDPS, 5 2013.
[9]
A. Buluç, J. R. Gilbert, and C. Budak, "Solving path problems on the GPU," Parallel Computing, vol. 36, no. 5--6, pp. 241--253, 2010.
[10]
R. Kannan, P. Sao, H. Lu, D. Herrmannova, V. Thakkar, R. Patton, R. Vuduc, and T. Potok, "Scalable knowledge graph analytics at 136 petaflop/s," in SC20. IEEE, 2020, pp. 1--13.
[11]
L. L. Wang, K. Lo, Y. Chandrasekhar, R. Reas, J. Yang, D. Eide, K. Funk, R. Kinney, Z. Liu, W. Merrill et al., "CORD-19: The Covid-19 Open Research Dataset," arXiv preprint arXiv:2004.10706, 2020. [Online]. Available: https://www.semanticscholar.org/cord19
[12]
T. C. Rindflesch, H. Kilicoglu, M. Fiszman, G. Rosemblat, and D. Shin, "Semantic medline: An advanced information management application for biomedicine," Information Services & Use, vol. 31, no. 1--2, pp. 15--21, 2011.
[13]
J. T. Fineman and E. Robinson, "Fundamental graph algorithms," in Graph Algorithms in the Language of Linear Algebra, J. Kepner and J. Gilbert, Eds. Philadelphia, PA, USA: Society of Industrial and Applied Mathematics, 2011, ch. 5, pp. 45--58.
[14]
AMD. AMD CDNA 2 white paper. AMD. [Online]. Available: https://www.amd.com/system/files/documents/amd-cdna2-white-paper.pdf
[15]
J. Dongarra, M. Gates, A. Haidar, J. Kurzak, P. Luszczek, S. Tomov, and I. Yamazaki, "Accelerating numerical dense linear algebra calculations with gpus," Numerical Computations with GPUs, pp. 1--26, 2014.
[16]
T. A. Davis and Y. Hu, "The University of Florida Sparse Matrix Collection," ACM TOMS, vol. 38, no. 1, p. 1, 2011.
[17]
K.-H. Huang and J. A. Abraham, "Algorithm-based fault tolerance for matrix operations," IEEE Transactions on Computers, vol. 100, no. 6, pp. 518--528, 1984.
[18]
S. Huang, J. Morris, and S. E. Baranzini, "Neighborhood explorer." [Online]. Available: https://spoke.rbvi.ucsf.edu/
[19]
O. Bodenreider, "The Unified Medical Language System (UMLS): integrating biomedical terminology," Nucleic Acids Research, vol. 32, pp. D267--D270, 2004.

Cited By

View all
  • (2023)Optimizing Communication in 2D Grid-Based MPI Applications at ExascaleProceedings of the 30th European MPI Users' Group Meeting10.1145/3615318.3615327(1-11)Online publication date: 11-Sep-2023

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SC '22: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
November 2022
1277 pages
ISBN:9784665454445

Sponsors

In-Cooperation

  • IEEE CS

Publisher

IEEE Press

Publication History

Published: 18 November 2022

Check for updates

Author Tags

  1. high-performance computing
  2. parallel algorithms
  3. shortest path problem

Qualifiers

  • Research-article

Conference

SC '22
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,516 of 6,373 submissions, 24%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)26
  • Downloads (Last 6 weeks)3
Reflects downloads up to 10 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Optimizing Communication in 2D Grid-Based MPI Applications at ExascaleProceedings of the 30th European MPI Users' Group Meeting10.1145/3615318.3615327(1-11)Online publication date: 11-Sep-2023

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media