We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Design and development of Iberia: a corpus of scientific Spanish.
- Authors
Zamorano, Jordi Porta; García, Emilio del Rosal; Lara, Ignacio Ahumada
- Abstract
Iberia is a synchronic corpus of scientific Spanish designed mainly for terminological studies. In this paper, we describe its design and the infrastructure for its acquisition, processing and exploitation, including mark-up, linguistic annotation, indexing and the user interface. Two pre-processing tasks affecting a large number of words are described in detail: de-hyphenation and identification of text fragments in other languages. We also show how some of the reported statistics, namely, dispersion and association, are used for research on lexis.
- Subjects
IBERIAN Peninsula; TECHNICAL Spanish; CORPORA; LINGUISTICS; LEXICOGRAPHY
- Publication
Corpora, 2011, Vol 6, Issue 2, p145
- ISSN
1749-5032
- Publication type
Article
- DOI
10.3366/cor.2011.0010