We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
GENETAG: a tagged corpus for gene/protein named entity recognition.
- Authors
Tanabe, Lorraine; Xie, Natalie; Thom, Lynne H; Matten, Wayne; Wilbur, W John
- Abstract
Named entity recognition (NER) is an important first step for text mining the biomedical literature. Evaluating the performance of biomedical NER systems is impossible without a standardized test corpus. The annotation of such a corpus for gene/protein name NER is a difficult process due to the complexity of gene/protein names. We describe the construction and annotation of GENETAG, a corpus of 20K MEDLINE sentences for gene/protein NER. 15K GENETAG sentences were used for the BioCreAtIvE Task 1A Competition.
- Publication
BMC bioinformatics, 2005, Vol 6 Suppl 1, pS3
- ISSN
1471-2105
- Publication type
Journal Article
- DOI
10.1186/1471-2105-6-S1-S3