We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Building a learner corpus.
- Authors
Hana, Jirka; Rosen, Alexandr; Štindlová, Barbora; Štěpánek, Jan
- Abstract
The need for data about the acquisition of Czech by non-native learners prompted the compilation of the first learner corpus of Czech. After introducing its basic design and parameters, including a multi-tier manual annotation scheme and error taxonomy, we focus on the more technical aspects: the transcription of hand-written source texts, process of annotation, and options for exploiting the result, together with tools used for these tasks and decisions behind the choices. To support or even substitute manual annotation we assign some error tags automatically and use automatic annotation tools (tagger, spell checker).
- Subjects
CZECH language; CORPORA; ANNOTATIONS; SPELL checkers (Computer programs); NATIVE language; GRAMMAR
- Publication
Language Resources & Evaluation, 2014, Vol 48, Issue 4, p741
- ISSN
1574-020X
- Publication type
Article
- DOI
10.1007/s10579-014-9278-z