[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3233547.3233703acmconferencesArticle/Chapter ViewAbstractPublication PagesbcbConference Proceedingsconference-collections
research-article
Public Access

Community Detection for Decoy Selection in Template-free Protein Structure Prediction

Published: 15 August 2018 Publication History

Abstract

Significant efforts are devoted to resolving biologically-active structures in wet and dry laboratories. In particular, due to hardware and algorithmic innovations, computational methods can now obtain thousands of structures that populate the structure space of a protein of interest. With such advances, attention turns to organizing computed structures to extract the underlying organization of the structure space in service of highlighting biologically-active structural states. In this paper we report on the promise of leveraging community detection methods, designed originally to detect communities in social networks, to organize protein structure spaces probed in silico. We report on a principled comparison of such methods along several metrics and on proteins of diverse folds and lengths. More importantly, we present a rigorous evaluation in the context of decoy selection in template-free protein structure prediction. The presented results make the case that network-based community detection methods warrant further investigation to advance analysis of protein structure spaces for automated selection of biologically-active structures.

References

[1]
N. Akhter and A. Shehu . 2017. From Extraction of Local Structures of Protein Energy Landscapes to Improved Decoy Selection in Template-free Protein Structure Prediction. Molecules, Vol. 23, 1 (2017), 216.
[2]
Arieh Ben-Naim . 1997. Statistical potentials extracted from protein structures: are these meaningful potentials? The Journal of Chemical Physics Vol. 107, 9 (1997), 3698--3706.
[3]
H. M. Berman, K. Henrick, and H. Nakamura . 2003. Announcing the worldwide Protein Data Bank. Nat. Struct. Biol., Vol. 10, 12 (2003), 980--980.
[4]
Vincent D Blondel, Jean-Loup Guillaume, Renaud Lambiotte, and Etienne Lefebvre . 2008. Fast unfolding of communities in large networks. Journal of Statistical Mechanics Vol. 2008 (2008), P10008.
[5]
D. D. Boehr and P. E. Wright . 2008. How do proteins interact? Science, Vol. 320, 5882 (2008), 1429--1430.
[6]
Renzhi Cao, Zheng Wang, Yiheng Wang, and Jianlin Cheng . 2014. SMOQ: a tool for predicting the absolute residue-specific quality of a single protein model with support vector machines. BMC bioinformatics, Vol. 15, 1 (2014), 120.
[7]
R. Clausen, B. Ma, R. Nussinov, and A. Shehu . 2015. Mapping the Conformation Space of Wildtype and Mutant H-Ras with a Memetic, Cellular, and Multiscale Evolutionary Algorithm. PLoS Comput Biol, Vol. 11, 9 (2015), e1004470.
[8]
R. Clausen and A. Shehu . 2014. A Multiscale Hybrid Evolutionary Algorithm to Obtain Sample-based Representations of Multi-basin Protein Energy Landscapes ACM Conf on Bioinf and Comp Biol (BCB). Newport Beach, CA, 269--278.
[9]
Aaron Clauset, M. E. J. Newman, and Cristopher Moore . 2004. Finding community structure in very large networks. Phys. Rev. E Vol. 70 (2004), 066111.
[10]
M E J Newman . 2006. Finding Community Structure in Networks Using the Eigenvectors of Matrices. Physical review. E Vol. 74 (2006), 036104.
[11]
Anthony K Felts, Emilio Gallicchio, Anders Wallqvist, and Ronald M Levy . 2002. Distinguishing native conformations of proteins from decoys with an effective free energy estimator based on the opls all-atom force field and the surface generalized Born solvent model. Proteins: Structure, Function, and Bioinformatics, Vol. 48, 2 (2002), 404--422.
[12]
M. Girvan and M. E. J. Newman . 2002. Community structure in social and biological networks. Proc. Natl. Acad. Sci. USA Vol. 99, 12 (2002), 7821--7826.
[13]
Zhiquan He, Meshari Alazmi, Jingfen Zhang, and Dong Xu . 2013. Protein structural model selection by combining consensus and single scoring methods. PloS one, Vol. 8, 9 (2013), e74006.
[14]
Xiaoyang Jing, Kai Wang, Ruqian Lu, and Qiwen Dong . 2016. Sorting protein decoys by machine-learning-to-rank. Scientific Reports Vol. 6 (2016), 31571.
[15]
A. Kryshtafovych, A. Barbato, K. Fidelis, B. Monastyrskyy, T. Schwede, and A. Tramontano . 2014. Assessment of the assessment: evaluation of the model quality estimates in CASP10. Proteins, Vol. 82, Suppl 2 (2014), 112--126.
[16]
A. Leaver-Fay and others . 2011. ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules. Methods Enzymol Vol. 487 (2011), 545--574.
[17]
Tong Liu, Yiheng Wang, Jesse Eickholt, and Zheng Wang . 2016. Benchmarking deep networks for predicting residue-specific quality of individual protein models in CASP11. Scientific reports Vol. 6 (2016), 19301.
[18]
Stephan Lorenzen and Yang Zhang . 2007. Identification of near-native structures by clustering protein docking conformations. PROTEINS: Structure, Function, and Bioinformatics, Vol. 68, 1 (2007), 187--194.
[19]
Balachandran Manavalan, Juyong Lee, and Jooyoung Lee . 2014. Random forest-based protein model quality assessment (RFMQA) using structural features and potential energy terms. PloS one, Vol. 9, 9 (2014), e106542.
[20]
T. Maximova, D. Carr, E. Plaku, and A. Shehu . 2016 a. Sample-based Models of Protein Structural Transitions ACM Conf Bioinf & Comp Biol (BCB). Seattle, WA, 128--137.
[21]
T. Maximova, R. Moffatt, B. Ma, R. Nussinov, and A. Shehu . 2016 b. Principles and Overview of Sampling Methods for Modeling Macromolecular Structure and Dynamics. PLoS Comp. Biol., Vol. 12, 4 (2016), e1004619.
[22]
T. Maximova, ZQi. Zhao, D. B. Carr, E. Plaku, and A. Shehu . 2017. Sample-based Models of Protein Energy Landscapes and Slow Structural Rearrangements. J Comput Biol, Vol. 25, 1 (2017), 33--50.
[23]
Shokoufeh Mirzaei, Tomer Sidi, Chen Keasar, and Silvia Crivelli . 2016. Purely structural protein scoring functions using support vector machine and ensemble learning. IEEE/ACM transactions on computational biology and bioinformatics (2016).
[24]
J. Moult, K. Fidelis, A. Kryshtafovych, T. Schwede, and A. Tramontano . 2014. Critical assessment of methods of protein structure prediction (CASP) -- round X. Proteins: Struct. Funct. Bioinf. Vol. 82 (2014), 109--115.
[25]
Son P Nguyen, Yi Shang, and Dong Xu . 2014. DL-PRO: A novel deep learning method for protein model quality assessment Neural Networks (IJCNN), 2014 International Joint Conference on. IEEE, 2071--2078.
[26]
B. Olson and A. Shehu . 2013. Multi-Objective Stochastic Search for Sampling Local Minima in the Protein Energy Surface ACM Conf on Bioinf and Comp Biol (BCB). Washington, D. C., 430--439.
[27]
Marcin Pawlowski, Lukasz Kozlowski, and Andrzej Kloczkowski . 2016. MQAPsingle: A quasi single-model approach for estimation of the quality of individual protein structure models. Proteins: Structure, Function, and Bioinformatics, Vol. 84, 8 (2016), 1021--1028.
[28]
Usha Nandini Raghavan, Réka Albert, and Soundar Kumara . 2007. Near linear time algorithm to detect community structures in large-scale networks. Phys. Rev. E Vol. 76 (2007), 036106.
[29]
M. Rosvall, D. Axelsson, and C. T. Bergstrom . 2009. The map equation. Eur. Phys. J. Special Topics Vol. 178 (2009), 13--23.
[30]
E. Sapin, D. B. Carr, K. A. De Jong, and A. Shehu . 2016. Computing energy landscape maps and structural excursions of proteins. BMC Genomics, Vol. 17, Suppl 4 (2016), 456.
[31]
Karolis Uziela and Björn Wallner . 2016. ProQ2: estimation of model accuracy implemented in Rosetta. Bioinformatics, Vol. 32, 9 (2016), 1411--1413.
[32]
D. Xu and Y. Zhang . 2012. Ab initio protein structure assembly using continuous structure fragments and optimized knowledge-based force field. Proteins: Struct. Funct. Bioinf. Vol. 80, 7 (2012), 1715--1735.
[33]
J. Yang and J. Leskovec . 2012. Defining and Evaluating Network Communities based on Ground-truth Intl Conf on Data Mining (ICDM). 745--754.
[34]
Yang Zhang and Jeffrey Skolnick . 2004. SPICKER: A clustering approach to identify near-native protein folds. Journal of computational chemistry Vol. 25, 6 (2004), 865--871.

Cited By

View all
  • (2020)From molecular energy landscapes to equilibrium dynamics via landscape analysis and markov state modelsJournal of Bioinformatics and Computational Biology10.1142/S021972001940014617:06(1940014)Online publication date: 31-Jan-2020
  • (2019)Graph-Based Community Detection for Decoy Selection in Template-Free Protein Structure PredictionMolecules10.3390/molecules2405085424:5(854)Online publication date: 28-Feb-2019

Index Terms

  1. Community Detection for Decoy Selection in Template-free Protein Structure Prediction

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      BCB '18: Proceedings of the 2018 ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics
      August 2018
      727 pages
      ISBN:9781450357944
      DOI:10.1145/3233547
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 15 August 2018

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. community detection
      2. decoy selection
      3. nearest-neighbor graph
      4. protein structure space

      Qualifiers

      • Research-article

      Funding Sources

      Conference

      BCB '18
      Sponsor:

      Acceptance Rates

      BCB '18 Paper Acceptance Rate 46 of 148 submissions, 31%;
      Overall Acceptance Rate 254 of 885 submissions, 29%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)64
      • Downloads (Last 6 weeks)8
      Reflects downloads up to 19 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2020)From molecular energy landscapes to equilibrium dynamics via landscape analysis and markov state modelsJournal of Bioinformatics and Computational Biology10.1142/S021972001940014617:06(1940014)Online publication date: 31-Jan-2020
      • (2019)Graph-Based Community Detection for Decoy Selection in Template-Free Protein Structure PredictionMolecules10.3390/molecules2405085424:5(854)Online publication date: 28-Feb-2019

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media