We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Automatic building of an ontology on the basis of text corpora in Thai.
- Authors
Imsombut, Aurawan; Kawtrakul, Asanee
- Abstract
This paper presents a methodology for automatic learning of ontologies from Thai text corpora, by extraction of terms and relations. A shallow parser is used to chunk texts on which we identify taxonomic relations with the help of cues: lexico-syntactic patterns and item lists. The main advantage of the approach is that it simplify the task of concept and relation labeling since cues help for identifying the ontological concept and hinting their relation. However, these techniques pose certain problems, i.e. cue word ambiguity, item list identification, and numerous candidate terms. We also propose the methodology to solve these problems by using lexicon and co-occurrence features and weighting them with information gain. The precision, recall and F-measure of the system are 0.74, 0.78 and 0.76, respectively.
- Subjects
ONTOLOGY; THAI language; TAI languages; LEXICON; SEMANTIC Web; SEMANTIC networks (Information theory); SEMANTIC integration (Computer systems)
- Publication
Language Resources & Evaluation, 2008, Vol 42, Issue 2, p137
- ISSN
1574-020X
- Publication type
Article
- DOI
10.1007/s10579-007-9045-5