We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
DISCOVERING AUTOMATED LEXICOGRAPHY: THE CASE OF THE SLOVENE LEXICAL DATABASE.
- Authors
Gantar, Polona; Kosem, Iztok; Krek, Simon
- Abstract
In this paper, we describe the compilation of the Slovene Lexical Database; main focus being on developing the methodology to improve the tools used for lexicographic analysis and to introduce automatic data extraction in the lexicographic process. The semiautomated approach, which was devised in the last stages of database compilation, involved extracting corpus data, i.e. grammatical relations, collocations, examples, and grammatical labels, and conducting lexicographic analysis in the dictionary-writing system rather than in the corpus tool. An evaluation that compared the manual approach with the semi-automatic approach showed that the semi-automatic approach is much quicker and presents the lexicographers with almost all the information they identified as relevant during the manual analysis, as well as additional potentially relevant information for the dictionary entry. The final section of the paper proposes a few avenues for improvement of the semi-automated approach, including the implementation of crowdsourcing and additional post-processing of automatically extracted data.
- Subjects
LEXICOGRAPHY; ENCYCLOPEDIAS &; dictionaries; LEXICON; ONLINE databases; DATA extraction; CROWDSOURCING
- Publication
International Journal of Lexicography, 2016, Vol 29, Issue 2, p200
- ISSN
0950-3846
- Publication type
Article
- DOI
10.1093/ijl/ecw014