[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2147805.2147842acmconferencesArticle/Chapter ViewAbstractPublication PagesbcbConference Proceedingsconference-collections
short-paper

Formatt: correcting protein multiple structural alignments by sequence peeking

Published: 01 August 2011 Publication History

Abstract

We present Formatt, a multiple structure alignment program based on the Matt purely geometric multiple structural alignment program, that also takes into account sequence similarity when constructing alignments. We show that Formatt is superior to Matt in alignment quality based on objective measures (most notably Staccato sequence and structure scores) while preserving the same advantages in core length and RMSD that Matt has as a flexible structure aligner, as compared to other multiple structure alignment programs on popular benchmark datasets. Applications include producing better training data for threading methods.

References

[1]
N. Daniels, A. Kumar, L. Cowen, and M. Menke. Touring protein space with Matt. Bioinformatics Research and Applications, 6053/2010:18--28, Jan 2010.
[2]
C. Do, M. Mahabhashyam, M. Brudno, and S. Batzoglou. Probcons: probabilistic consistency-based multiple sequence alignment. Genome Research, 15:220--240, 2005.
[3]
R. Edgar. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res., 32:1792--1797, 2004.
[4]
H. Hasegawa and L. Holm. Advances and pitfalls of protein structural alignment. Current Opinion in Structural Biology, 19(3):341--348, 2009.
[5]
K. Katoh and H. Toh. Recent developments in the MAFFT multiple sequence alignment program. Briefings in Bioinformatics, 9:286--298, 2008.
[6]
C. Kim and B. Lee. Accuracy of structure-based sequence alignment of automatic methods. BMC Bioinformatics, 8:355, 2007.
[7]
A. Konagurthu, J. Whisstock, P. Stuckey, and A. Lesk. MUSTANG: A multiple structural alignment algorithm. Proteins: Structure, Function, and Bioinformatics, 64:559--574, 2006.
[8]
A. Kumar and L. Cowen. Recognition of beta structural motifs using hidden Markov models trained with simulated evolution. Bioinformatics, 26:i287--i293, 2010.
[9]
M. Madhusudhan, B. M. Webb, M. A. Marti-Renom, N. Eswar, and A. Sali. Alignment of multiple protein structures based on sequence and structure features. Protein Engineering, Design and Selection, pages 1--6, 2009.
[10]
M. Menke, B. Berger, and L. Cowen. Matt: Local flexibility aids protein multiple structure alignment. PLoS Computational Biology, 4(1):e10, 2008.
[11]
M. Menke, B. Berger, and L. Cowen. Markov random fields reveal an N-terminal double propeller motif as part of a bacterial hybrid two-component sensor system. PNAS, 107:4069--4074, 2010.
[12]
K. Mizuguchi, C. Deane, T. L. Blundell, and J. Overington. HOMSTRAD: a database of protein structure alignments for homologous families. Protein Sci., 11:2469--2471, 1998.
[13]
R. Mosca, B. Brannetti, and T. R. Schneider. Alignment of protein structures in the presence of domain motions. BMC Bioinformatics, 9:352, 2008.
[14]
C. Notredame, D. Higgins, and J. Heringa. T-coffee: a novel method for fast and accurate multiple sequence alignment. Journal of Molecular Biology, 302(1):205--217, 2000.
[15]
O. O'Sullivan, K. Suhre, C. Abergel, D. Higgins, and C. Notredame. 3DCoffee: combining protein sequences and structures with multiple sequence alignments. Journal of Molecular Biology, 340:385--395, 2004.
[16]
J. Pei and N. Grishin. MUMMALS: multiple sequence alignment improved by using hidden Markov models with local structure information. Nucleic Acids Res., 34:4364--4374, 2006.
[17]
J. Pei, B. H. Kim, and N. V. Grishin. PROMALS3D: a tool for multiple protein sequence and structure alignments. Nucleic Acids Res., 36:2295--2300, 2008.
[18]
S. Salem, M. J. Zaki, and C. Bystroff. FlexSnap: Flexible non-sequential protein structure alignment. Algorithms in Molecular Biology, 12(5), 2010.
[19]
M. Shatsky, R. Nussinov, and H. Wolfson. Flexible protein alignment and hinge detection. Proteins, 48:242--256, 2002.
[20]
M. Shatsky, R. Nussinov, and H. Wolfson. Optimization of multiple-sequence alignment based on multiple-structure alignment. Proteins: Structure, Function and Bioinformatics, 62:209--217, 2006.
[21]
J. D. Thompson, D. G. Higgins, and T. J. Gibson. CLUSTAL-W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res., 22:4673--4680, 1994.
[22]
I. VanWalle, I. Lasters, and L. Wyns. SABmark-a benchmark for sequence alignment that covers the entire known fold space. Bioinformatics, 21:1267--1268, 2005.
[23]
Y. Ye and A. Godzik. Flexible structure alignment by chaining aligned fragment pairs allowing twists. Bioinformatics, Suppl 2:II246--II255, 2003.
[24]
Y. Ye and A. Godzik. Multiple flexible structure alignment using partial order graphs. Bioinformatics, 21:2362--2369, 2005.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
BCB '11: Proceedings of the 2nd ACM Conference on Bioinformatics, Computational Biology and Biomedicine
August 2011
688 pages
ISBN:9781450307963
DOI:10.1145/2147805
  • General Chairs:
  • Robert Grossman,
  • Andrey Rzhetsky,
  • Program Chairs:
  • Sun Kim,
  • Wei Wang
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 August 2011

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Short-paper

Funding Sources

Conference

BCB' 11
Sponsor:

Acceptance Rates

Overall Acceptance Rate 254 of 885 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 75
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 30 Jan 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media