McGuffin, L J; Bryson, K; Jones, D T

doi:10.1093/bioinformatics/17.1.63

Your institution may have access to this item. Find your institution then sign in to continue.

Title: What are the baselines for protein fold recognition?
Authors: McGuffin, L J; Bryson, K; Jones, D T
Abstract: What constitutes a baseline level of success for protein fold recognition methods? As fold recognition benchmarks are often presented without any thought to the results that might be expected from a purely random set of predictions, an analysis of fold recognition baselines is long overdue. Given varying amounts of basic information about a protein-ranging from the length of the sequence to a knowledge of its secondary structure-to what extent can the fold be determined by intelligent guesswork? Can simple methods that make use of secondary structure information assign folds more accurately than purely random methods and could these methods be used to construct viable hierarchical classifications? EXPERIMENTS PERFORMED: A number of rapid automatic methods which score similarities between protein domains were devised and tested. These methods ranged from those that incorporated no secondary structure information, such as measuring absolute differences in sequence lengths, to more complex alignments of secondary structure elements. Each method was assessed for accuracy by comparison with the Class Architecture Topology Homology (CATH) classification. Methods were rated against both a random baseline fold assignment method as a lower control and FSSP as an upper control. Similarity trees were constructed in order to evaluate the accuracy of optimum methods at producing a classification of structure.
Publication: Bioinformatics (Oxford, England), 2001, Vol 17, Issue 1, p63
ISSN: 1367-4803
Publication type: Journal Article
DOI: 10.1093/bioinformatics/17.1.63

We found a match

What are the baselines for protein fold recognition?

McGuffin, L J; Bryson, K; Jones, D T

Bioinformatics (Oxford, England), 2001, Vol 17, Issue 1, p63

1367-4803

Journal Article

10.1093/bioinformatics/17.1.63