We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Feature extraction based on information gain and sequential pattern for English question classification.
- Authors
Liu, Yaqing; Yi, Xiaokai; Chen, Rong; Zhai, Zhengguo; Gu, Jingxuan
- Abstract
The purpose of question classification (QC) is to assign a question to an appropriate category from the set of predefined categories that constitute a question taxonomy. Selected question features are able to significantly improve the performance of QC. However, feature extraction, particularly syntax feature extraction, has a high computational cost. To maintain or enhance performance without syntax features, this study presents a hybrid approach to semantic feature extraction and lexical feature extraction. These features are generated by improved information gain and sequential pattern mining methods, respectively. Selected features are then fed into classifiers for questions classification. Benchmark testing is performed using the public UIUC data set. The results reveal that the proposed approach achieves a coarse accuracy of 96% and fine accuracy of 90.4%, which is superior to existing methods.
- Publication
IET Software (Wiley-Blackwell), 2018, Vol 12, Issue 6, p520
- ISSN
1751-8806
- Publication type
Article
- DOI
10.1049/iet-sen.2018.0006