[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1007/978-3-642-13078-6_4guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Touring protein space with matt

Published: 23 May 2010 Publication History

Abstract

Using the Matt structure alignment program, we take a tour of protein space, producing a hierarchical clustering scheme that divides protein structural domains into clusters based on geometric dissimilarity. While it was known that purely structural, geometric, distance-based metrics of structural similarity, such as Dali/FSSP, could largely replicate hand-curated schemes such as SCOP at the family level, it was an open question as to whether any such scheme could approximate SCOP at the more distant superfamily and fold levels. We partially answer this question in the affirmative, by designing a clustering scheme based on Matt that approximately matches SCOP at the superfamily level. Implications for the debate over the organization of protein fold space are discussed.

References

[1]
Altschul, S., Madden, T., Schaffer, A., Zhang, J., Zhang, Z., Miller, W., Lipman, L.: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389-3402 (1997).
[2]
Andreeva, A., Howorth, D., Brenner, S., Hubbard, T., Chothia, C., Murzin, A.: SCOP database in 2004: refinements integrate structure and sequence family data. Nucleic Acids Research 32, D226-D229 (2004).
[3]
Berbalk, C., Schwaiger, C., Lackner, P.: Accuracy analysis of multiple structure alignments. Protein Science 18, 2027-2035 (2009).
[4]
Cheek, S., Qi, Y., Krishna, S., Kinch, L., Grishin, N.V.: SCOPmap: Automated assignment of protein structures to evolutionary superfamilies. BMC Bioinformatics 7 (2006).
[5]
Chi, P.-H., Shyu, C.-R., Xu, D.: A fast SCOP fold classification system using content-based E-predict algorithm. BMC Bioinformatics 7, 10.1186/1471-2105-7-362 (2006).
[6]
Choi, I.-G., Kim, S.-H.: Evolution of protein structural classes and protein sequence families. Proc. Nat. Acad. Sci. 103, 14056-14061 (2006).
[7]
Day, R., Beck, D., Armen, R., Daggett, V.: A consensus view of fold space: Combining SCOP, CATH, and the Dali domain dictionary. Protein Science 12, 2150-2160 (2003).
[8]
Gerstein, M., Levitt, M.: Comprehensive assement of automatic structural alignment against a manual standard, the SCOP classification of proteins. Protein Sci., 445-456 (1998).
[9]
Getz, G., Vendruscolo, M., Sachs, D., Domany, E.: Automatic assignment of SCOP and CATH protein structure classifications from FSSP scores. Proteins: Structure Function and Genetics 46, 405-415 (2002).
[10]
Gibrat, J., Madej, T., Bryant, S.: Suprising similarities in structure comparison. Curr. Opin. Struct. Biol. 6, 377-385 (2006).
[11]
Greene, L., Lewis, T., Addou, S., Cuff, A., Dallman, T., Dibley, M., Redfern, O., Pearl, F., Nambudiry, R., Reid, A., Silitoe, I., Yeats, C., Thornton, J., Orengo, C.: The CATH domain structure database: new protocols and classification levels give a more comprehensive resource for exploring evolution. Nucleic Acids Res. 35, D291-D297 (2007).
[12]
Hadley, C., Jones, D.: A systematic comparison of protein structure classifications: SCOP, CATH and FSSP. Structure 7, 1099-1112 (1999).
[13]
Harrison, A., Pearl, F., Mott, R., Thornton, J., Orengo, C.: Quantifying the similarity within fold space. J. Mol. Bio. 323, 909-926 (2002).
[14]
Holland, T., Veretnik, S., Shindyalov, I.N., Bourne, P.: Partitioning protein structures into domains: why is it so difficult? J. Mol. Biol. 361, 562-590 (2006).
[15]
Holm, L., Park, J.: DaliLite workbench for protein structure comparison. Bioinformatics 16, 566-567 (2000).
[16]
Holm, L., Sander, C.: Mapping the protein universe. Science 260, 595-602 (1996).
[17]
Holm, L., Sander, C.: Touring protein fold space with Dali/FSSP. Nucleic Acids Res., 316-319 (1998).
[18]
Kolodny, R., Petrey, D., Honig, B.: Protein structure comparison: implications for the nature of fold space, and structure and function prediction. Curr. Opin. Struct. Biol. 16, 393-398 (2006).
[19]
Madej, T., Gibrat, J.-F., Bryant, S.: Threading a database of protein cores. Proteins 23, 356-369 (1995).
[20]
Menke, M., Berger, B., Cowen, L.: Matt: Local flexibility aids protein multiple structure alignment. PLoS Comput. Biol. 4(1), e10 (2008)
[21]
Murzin, A., Brenner, S., Hubbard, T., Chothia, C.: SCOP: a structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol. 297, 536-540 (1995).
[22]
Orengo, C., Michie, A., Jones, S., Jones, D., Swindells, M., Thornton, J.: Cath- a hierarchic classification of protein domain structures. Structure 5(8), 1093-1108 (1997).
[23]
Pearl, F., Bennett, C., Bray, J., Harrison, A., Martin, N., Shepherd, A., Sillitoe, I., Thornton, J., Orengo, C.: The CATH database: an extended protein family resource for structural and functional genomics. Nucleic Acids Res. 31, 452-455 (2003).
[24]
Redfern, O., Harrison, A., Dallman, T., Pearl, F., Orengo, C.: CATHEDRAL: A fast and effective algorithm to predict folds and domain boundaries from multidomain protein structures. PLOS Computational Biology, e232 (2007)
[25]
Rocha, J., Segura, J., Wilson, R., Dasgupta, S.: Flexible structural protein alignment by a sequence of local transformations. Bioinformatics 25, 1625-1631 (2009).
[26]
Rost, B.: Did evolution leap to create the protein universe? Curr. Opinion in Struct. Biol., 409-416 (2002).
[27]
Sadreyev, R., Kim, B.-H., Grishin, N.: Discrete-continous duality of protein structure space. Curr. Opinion Structural Biol. 19, 321-328 (2009).
[28]
Sam, V., Tai, C., Garnier, J., Gibrat, J.F., Lee, B., Munson, P.: ROC and confusion analysis of structure comparison methods identify the main causes of divergence from manual protein classification. BMC Bioinformatics 7, 206 (2006).
[29]
Sam, V., Tai, C., Garnier, J., Gibrat, J.F., Lee, B., Munson, P.: Towards an automatic classification of protein structural domains based on structural similarity. BMC Bioinformatics 9 (2008).
[30]
Shindyalov, I., Bourne, P.: An alternative view of protein fold space. Proteins 38, 513-514 (2000).
[31]
Simonsen, M., Mailund, T., Pedersen, C.N.S.: Rapid neighbour-joining. In: Crandall, K.A., Lagergren, J. (eds.) WABI 2008. LNCS (LNBI), vol. 5251, pp. 113-122. Springer, Heidelberg (2008).
[32]
Suhrer, S., Wederstein, M., Sippl, M.: QSCOP-SCOP quantified by structural relationships. Bioinformatics 23, 513-514 (2007).
[33]
Valas, R., Yang, S., Bourne, P.: Nothing about protein structure classification makes sense except in the light of evolution. Curr. Opin. Struct. Biol. 19, 329-334 (2009).
[34]
Veretnik, S., Bourne, P., Alexandrov, N., Shindyalov, I.: Toward consistent assignment of structural domains in proteins. J. Mol. Biol. 339, 647-678 (2004).
[35]
Vuk, M., Curk, T.: Roc curve, lift chart and calibration plot. Metodolo ski zvezki 2, 89-108 (2006).
[36]
Zemla, A., Geisbrecht, B., Smith, J., Lam, M., Kirkpatrick, B., Wagner, M., Slezak, T., Zhou, C.: STRALCP-structure alignment-based clustering of proteins. Nucleic Acids Res. 35, e150 (2007).

Cited By

View all
  • (2015)MRFyIEEE/ACM Transactions on Computational Biology and Bioinformatics10.1109/TCBB.2014.234468212:1(4-16)Online publication date: 1-Jan-2015
  • (2012)Touring Protein Space with MattIEEE/ACM Transactions on Computational Biology and Bioinformatics10.1109/TCBB.2011.709:1(286-293)Online publication date: 1-Jan-2012
  • (2011)FormattProceedings of the 2nd ACM Conference on Bioinformatics, Computational Biology and Biomedicine10.1145/2147805.2147842(315-319)Online publication date: 1-Aug-2011

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
ISBRA'10: Proceedings of the 6th international conference on Bioinformatics Research and Applications
May 2010
253 pages
ISBN:3642130771
  • Editors:
  • Mark Borodovsky,
  • Johann Peter Gogarten,
  • Teresa M. Przytycka,
  • Sanguthevar Rajasekaran

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 23 May 2010

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2015)MRFyIEEE/ACM Transactions on Computational Biology and Bioinformatics10.1109/TCBB.2014.234468212:1(4-16)Online publication date: 1-Jan-2015
  • (2012)Touring Protein Space with MattIEEE/ACM Transactions on Computational Biology and Bioinformatics10.1109/TCBB.2011.709:1(286-293)Online publication date: 1-Jan-2012
  • (2011)FormattProceedings of the 2nd ACM Conference on Bioinformatics, Computational Biology and Biomedicine10.1145/2147805.2147842(315-319)Online publication date: 1-Aug-2011

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media