We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Evaluating and automating the annotation of a learner corpus.
- Authors
Rosen, Alexandr; Hana, Jirka; Štindlová, Barbora; Feldman, Anna
- Abstract
The paper describes a corpus of texts produced by non-native speakers of Czech. We discuss its annotation scheme, consisting of three interlinked tiers, designed to handle a wide range of error types present in the input. Each tier corrects different types of errors; links between the tiers allow capturing errors in word order and complex discontinuous expressions. Errors are not only corrected, but also classified. The annotation scheme is tested on a data set including approx. 175,000 words with fair inter-annotator agreement results. We also explore the possibility of applying automated linguistic annotation tools (taggers, spell checkers and grammar checkers) to the learner text to support or even substitute manual annotation.
- Subjects
ANNOTATIONS; SECOND language acquisition; GRAMMAR checkers (Computer software); SPELL checkers (Computer programs); CZECH language
- Publication
Language Resources & Evaluation, 2014, Vol 48, Issue 1, p65
- ISSN
1574-020X
- Publication type
Article
- DOI
10.1007/s10579-013-9226-3