A Cosine-Similarity Mutual-Information Approach for Feature Selection on High Dimensional Datasets
A novel hybrid method based on Cosine Similarity and Mutual Information is presented to find out relevant feature subset. Initially, the supervised Cosine Similarity of each feature is calculated with respect to the class vector and then features are grouped based on the obtained cosine similarity values. From each group the best mutual informative feature is selected. The selected features subset is tested using the three classifiers namely Naïve Bayes (NB), K-Nearest Neighbor and Classification and Regression trees (CART) for getting classification accuracy. The proposed method is applied to various high dimensional datasets. Obtained results showed that the proposed method is capable of eliminating the redundant and irrelevant features.
Year of publication: |
2017
|
---|---|
Authors: | Dubey, Vimal Kumar ; Saxena, Amit Kumar |
Published in: |
Journal of Information Technology Research (JITR). - IGI Global, ISSN 1938-7865, ZDB-ID 2403406-X. - Vol. 10.2017, 1 (01.01.), p. 15-28
|
Publisher: |
IGI Global |
Subject: | Classification | Cosine Similarity | Feature Selection | Gram-Schmidt Orthogonalisation | Mutual Information |
Saved in:
Online Resource
Saved in favorites
Similar items by subject
-
Feature selection for fault level diagnosis of planetary gearboxes
Liu, Zhiliang, (2014)
-
Fuzzy Mutual Information Feature Selection Based on Representative Samples
Salem, Omar A. M., (2018)
-
Estimating mutual information for feature selection in the presence of label noise
Frénay, Benoît, (2014)
- More ...