We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
A new approach for textual feature selection based on N-composite isolated labels.
- Authors
Elloumi, Samir
- Abstract
Textual Feature Selection (TFS) aims to extract relevant parts or segments from text as being the most relevant ones w.r.t. the information it expresses. The selected features are useful for automatic indexing, summarization, document categorization, knowledge discovery, so on. Regarding the huge amount of electronic textual data daily published, many challenges related to the semantic aspect as well as the processing efficiency are addressed. In this paper, we propose a new approach for TFS based on Formal Concept Analysis background. Mainly, we propose to extract textual features by exploring the regularities in a formal context where isolated points exist. We introduce the notion of N -composite isolated points as a set of N words to be considered as a unique textual feature. We show that a reduced value of N (between 1 and 3) allows extracting significant textual features compared with existing approaches even for non-completely covering an initial formal context.
- Subjects
FEATURE selection; POINT set theory; LABELS
- Publication
Natural Language Engineering, 2020, Vol 26, Issue 2, p221
- ISSN
1351-3249
- Publication type
Article
- DOI
10.1017/S1351324919000160