We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Feature Selection: A Preprocess for Data Perturbation.
- Authors
Pengpeng Lin; Thapa, Nirmal; St. Omer, Ingrid; Lian Liu; Zhang, Jun
- Abstract
As a major concern in designing various data mining applications, privacy preservation has become a critical component seeking a trade-off between mining performances and protecting sensitive information. Data perturbation or distortion is a widely used approach for privacy protection. Many privacy preservation approaches were developed, either by adding noises or by matrix decomposition methods. In this paper, we intensively studied Singular Value Decomposition (SVD) based data distortion strategy and feature selection techniques, and conducted experiments to explore how feature selection technique could be used and better serve for privacy preservation purpose. Sparsified Singular Value Decomposition (SSVD) and filter based feature selection are used for data distortion and reducing feature space. We design a modified version of Exponential Threshold Strategy (ETS) as our threshold function for matrix sparsification process, and implement several metrics to measure data perturbation level. We also propose a novel algorithm to compute rank and analyze its lower running time bound. The mining utility of distorted data is tested with a well known Classifier, Support Vector Machine (SVM).
- Subjects
DATA mining; PERTURBATION theory; SUPPORT vector machines; DATA protection; COMPUTER security; ALGORITHMS; MATHEMATICAL decomposition
- Publication
IAENG International Journal of Computer Science, 2011, Vol 38, Issue 2, p168
- ISSN
1819-656X
- Publication type
Article