We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Towards completion of the Earth's proteome.
- Authors
Perez-Iratxeta, Carolina; Palidwor, Gareth; Andrade-Navarro, Miguel A
- Abstract
New protein sequences are deposited in databases at an accelerating pace; however, many of these are homologous to known proteins and could be considered redundant. If all historical releases of the protein database are analysed using the original sequence-clustering procedure described here, the fraction of newly sequenced proteins that are redundant is increasing. We interpret this as an indication that the sequencing of the Earth's proteome--the complete set of proteins on Earth--is approaching completion. We estimate the approximate size of the Earth's proteome to be 5 million sequences, most of which will be identified during the next 5 years. As the Earth's proteome nears completion, cluster analysis of the protein database will become essential to identify under-explored taxa to which future sequencing efforts should be directed and to focus research on protein families without experimental characterization.
- Publication
EMBO reports, 2007, Vol 8, Issue 12, p1135
- ISSN
1469-221X
- Publication type
Journal Article
- DOI
10.1038/sj.embor.7401117