We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Borderline Over-sampling in Feature Space for Learning Algorithms in Imbalanced Data Environments.
- Authors
Savetratanakaree, Kittipat; Sookhanaphibarn, Kingkarn; Intakosum, Sarun; Thawonmas, Ruck
- Abstract
In this paper, we propose a new approach to over-sample new minority-class instances along the borderline using the Euclidean distance in the feature space to improve support vector machine (SVM) performance in imbalanced data environments. SVM has been an outstandingly successful classifier in a wide variety of applications where balanced class data distribution is assumed. However, SVM is ineffective when coping with imbalanced datasets whereby the majorityclass instances far outnumber the minority-class instances. Our new approach, called Borderline Over-sampling in the Feature Space, can deal with imbalanced data to effectively recognize new minority-class instances for better classification with SVM. The results of our class prediction experiments using the proposed approach demonstrate better performance than the existing SMOTE, Borderline-SMOTE and borderline over-sampling methods in terms of the g-mean and F-measure.
- Subjects
FEATURE selection; MACHINE learning; SUPPORT vector machines; EUCLIDEAN distance; DATA distribution
- Publication
IAENG International Journal of Computer Science, 2016, Vol 43, Issue 3, p66
- ISSN
1819-656X
- Publication type
Article