We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
LINGUISTIC FEATURE CLASSIFYING AND TRACING.
- Authors
Moohebat, Mohammadreza; Raj, Ram Gopal; Thorleuchter, Dirk; Kareem, Sameem Binti Abdul
- Abstract
We investigate the identification and analysis of linguistic (lexico-grammatical) features that are characteristically used by articles of a specific year of publication. Linguistic features differ from shallow features because they represent authors' lexico-grammatical writing styles and do not consider well-known bag-of-words model. Current literature focusses on shallow features rather than on linguistic features and existing methods for identifying linguistic features use well-known knowledge-structure based approaches. In contrast to this, we advance these existing methods by applying semantic clustering instead of using knowledge-structure based approaches. For evaluation purpose, a linguistic feature-based prediction model is built to enable an automated assignment of articles to their years of publication. In a case study, the proposed methodology is applied to articles of the Springer book series 'Communications in Computer and Information Science' published from 2009 to 2013. The Case study results show the feasibility of the proposed approach as compared to frequently used baseline.
- Subjects
LANGUAGE classification; LATENT semantic analysis; TEXT mining; LEXICON; GRAMMATICALITY (Linguistics); LEXICAL grammar
- Publication
Malaysian Journal of Computer Science, 2017, Vol 30, Issue 2, p77
- ISSN
0127-9084
- Publication type
Article
- DOI
10.22452/mjcs.vol30no2.1