[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2382936.2383057acmconferencesArticle/Chapter ViewAbstractPublication PagesbcbConference Proceedingsconference-collections
research-article

Parallelization of the spectral deconvolution stage of the proteomic discovery process

Published: 07 October 2012 Publication History

Abstract

Mass spectrometry is one of the primary analytical tools for the quantification and identification of proteins in human body. Scientists analyze the data generated from mass spectrometry instruments to detect protein patterns. The operations performed on mass spectrometric output to detect such patterns include spectrum visualization, deconvolution, alignment, normalization, statistical significance tests, and pattern recognition. Bindley Bioscience center in Purdue University's Discovery Park hosts a Omics Discovery Pipeline (ODP). The ODP is available as a web-based analysis platform that performs the aforementioned operations.
The time taken to generate output in the spectrum deconvolution stage is a major concern. Much of this concern results from the fact that this stage has to deal with a sizable number of large input files that are present in potentially many different input formats. Due to slow operation speed of this stage, overall performance of the process for protein quantification and identification is hampered. Our research work concentrates on discovering and implementing ways to increase the performance of the spectral deconvolution stage of the process for protein quantification used in the ODP by leveraging multi-core and multi-node architectures. By leveraging OpenMP and MPI, we were able to obtain an order of magnitude speedup of the spectrum deconvolution stage of the ODP.

References

[1]
Purdue Steele User Guide at https://www.xsede.org/purdue-steele, 2012.
[2]
R. D. Bjornson, N. J. Carriero, C. Colangelo, M. Shifman, K.-H. Cheung, P. L. Miller, and K. Williams. X!!Tandem, an Improved Method for Running X!Tandem in Parallel on Collections of Commodity Computers. Journal of Proteome Research, 7(1):293--299, 2008.
[3]
E. H.-H. Chi, E. Shoop, J. Carlis, E. Retzel, and J. Riedl. Efficiency of Shared-Memory Multiprocessors for a Genetic Sequence Similarity Search Algorithm, 1997.
[4]
D. T. Duncan, R. Craig, and A. J. Link. Parallel Tandem:âĂL' A Program for Parallel Processing of Tandem Mass Spectra Using PVM or MPI and X!Tandem. Journal of Proteome Research, 4(5):1842--1847, Aug. 2005.
[5]
J. Lichtenberg, K. Kurz, X. Liang, R. Al-ouran, L. Neiman, L. J. Nau, J. D. Welch, E. Jacox, T. Bitterman, K. Ecker, L. Elnitski, F. Drews, S. S. Lee, and L. R. Welch. WordSeeker: concurrent bioinformatics software for discovering genome-wide patterns and word-based genomic signatures. BMC bioinformatics, 11 Suppl 1(Suppl 12):S6, Jan. 2010.
[6]
H. N. B. Moseley, A. N. Lane, A. C. Belshoff, R. M. Higashi, and T. W. M. Fan. A novel deconvolution method for modeling UDP-N-acetyl-D-glucosamine biosynthetic pathways based on (13)C mass isotopologue profiles under non-steady-state conditions. BMC biology, 9(1):37, Jan. 2011.
[7]
B. Pratt, J. J. Howbert, N. I. Tasman, and E. J. Nilsson. MR-Tandem: Parallel X!Tandem using Hadoop MapReduce on Amazon Web Services. Bioinformatics, Nov. 2011.
[8]
R. G. Sadygov, J. Eng, E. Durr, A. Saraf, H. McDonald, M. J. MacCoss, and J. R. Yates. Code Developments to Improve the Efficiency of Automated MS/MS Spectra Interpretation. Journal of Proteome Research, 1(3):211--215, Mar. 2002.
[9]
S. E. Stein. An Integrated Method for Spectrum Extraction and Compound Identification from Gas Chromatography/Mass Spectrometry Data. J. Am. Soc. Mass Spectrom., 10:770--781, 1999.
[10]
O. Thorsen, B. E. Smith, C. P. Sosa, K. Jiang, H. Lin, A. E. Peters, and W.-c. Feng. Parallel genomic sequence-search on a massively parallel system. In U. Banerjee, J. Moreira, M. Dubois, and P. Stenström, editors, Conf. Computing Frontiers, pages 59--68. ACM, 2007.
[11]
X. Zhang, W. Hines, J. Adamec, J. M. Asara, S. Naylor, and F. E. Regnier. An automated method for the analysis of stable isotope labeling data in proteomics. Journal of the American Society for Mass Spectrometry, 16(7):1181--91, July 2005.
[12]
H. Zheng, P. C. Ojha, S. McClean, N. D. Black, J. G. Hughes, and C. Shaw. Heuristic charge assignment for deconvolution of electrospray ionization mass spectra. Rapid communications in mass spectrometry: RCM, 17(5):429--36, Jan. 2003.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
BCB '12: Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine
October 2012
725 pages
ISBN:9781450316705
DOI:10.1145/2382936
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 October 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. MPI
  2. OpenMP
  3. parallel computing
  4. proteomics

Qualifiers

  • Research-article

Funding Sources

Conference

BCB' 12
Sponsor:

Acceptance Rates

BCB '12 Paper Acceptance Rate 33 of 159 submissions, 21%;
Overall Acceptance Rate 254 of 885 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 91
    Total Downloads
  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)1
Reflects downloads up to 22 Dec 2024

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media