Goldman et al., 1996 - Google Patents

Using evolutionary trees in protein secondary structure prediction and other comparative sequence analyses

Goldman et al., 1996

Document ID: 2710988542949552545
Author: Goldman N; Thorne J; Jones D
Publication year: 1996
Publication venue: Journal of molecular biology

External Links

Cited by

Snippet

Previously proposed methods for protein secondary structure prediction from multiple sequence alignments do not efficiently extract the evolutionary information that these alignments contain. The predictions of these methods are less accurate than they could be …

Continue reading at www.sciencedirect.com (other versions)

108090000623 proteins and genes 0 title abstract description 27

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/22—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for sequence comparison involving nucleotides or amino acids, e.g. homology search, motif or SNP [Single-Nucleotide Polymorphism] discovery or sequence alignment
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/18—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for functional genomics or proteomics, e.g. genotype-phenotype associations, linkage disequilibrium, population genetics, binding site identification, mutagenesis, genotyping or genome annotation, protein-protein interactions or protein-nucleic acid interactions
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/28—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for programming tools or database systems, e.g. ontologies, heterogeneous data integration, data warehousing or computing architectures
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/16—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for molecular structure, e.g. structure alignment, structural or functional relations, protein folding, domain topologies, drug targeting using structure data, involving two-dimensional or three-dimensional structures
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/24—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for machine learning, data mining or biostatistics, e.g. pattern finding, knowledge discovery, rule extraction, correlation, clustering or classification
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/20—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for hybridisation or gene expression, e.g. microarrays, sequencing by hybridisation, normalisation, profiling, noise correction models, expression ratio estimation, probe design or probe optimisation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30386—Retrieval requests
- G06F17/30424—Query processing
- G06F17/30533—Other types of queries
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99936—Pattern matching access
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2216/00—Indexing scheme relating to additional aspects of information retrieval not explicitly covered by G06F17/30 and subgroups
- G06F2216/03—Data mining
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system

Similar Documents

Publication	Publication Date	Title
Goldman et al.	1996	Using evolutionary trees in protein secondary structure prediction and other comparative sequence analyses
Fischer et al.	1999	CAFASP‐1: Critical assessment of fully automated structure prediction methods
Shi et al.	2001	FUGUE: sequence-structure homology recognition using environment-specific substitution tables and structure-dependent gap penalties
Kawabata et al.	2000	Protein structure comparison using the markov transition model of evolution
Salzberg	1997	A method for identifying splice sites and translational start sites in eukaryotic mRNA
US6128587A (en)	2000-10-03	Method and apparatus using Bayesian subfamily identification for sequence analysis
Chandonia et al.	1999	New methods for accurate prediction of protein secondary structure
US7974788B2 (en)	2011-07-05	Gene discovery through comparisons of networks of structural and functional relationships among known genes and proteins
Wiehe et al.	2001	SGP-1: prediction and validation of homologous genes based on sequence alignments
Dunbrack Jr	1999	Comparative modeling of CASP3 targets using PSI‐BLAST and SCWRL
Li et al.	2000	Saturated BLAST: an automated multiple intermediate sequence search used to detect distant homology
Geourjon et al.	2001	Identification of related proteins with weak sequence identity using secondary structure information
JP2005512015A (en)	2005-04-28	System and method for validating, aligning and reordering one or more gene sequence maps using at least one ordered restriction enzyme map
CN113555062B (en)	2022-07-12	Data analysis system and analysis method for genome base variation detection
Holm	1998	Unification of protein families
Drott et al.	1978	An empirical examination of Bradford's law and the scattering of scientific literature
Cline et al.	2019	Assessment of blind predictions of the clinical significance of BRCA1 and BRCA2 variants
Rodin et al.	2005	Mining genetic epidemiology data with Bayesian networks application to APOE gene variation and plasma lipid levels
Qian et al.	2003	Detecting distant homologs using phylogenetic tree‐based HMMs
Panchenko et al.	2002	A comparison of position‐specific score matrices based on sequence and structure alignments
Homaeian et al.	2007	Prediction of protein secondary structure content for the twilight zone sequences
von Ohsen et al.	2001	Improving profile-profile alignments via log average scoring
Dietmann et al.	2002	Automated detection of remote homology
Yin et al.	2004	GeneScout: a data mining system for predicting vertebrate genes in genomic DNA sequences
Çamoğlu et al.	2005	Decision tree based information integration for automated protein classification