Department of Information Management and Business Analytics Faculty Scholarship and Creative Works

Dimensionality Reduction With Unsupervised Feature Selection and Applying Non-Euclidean Norms for Classification Accuracy

Amit Saxena, Guru Ghasidas University
John Wang, Montclair State UniversityFollow

Document Type

Article

Publication Date

4-1-2010

Abstract

This article presents a two-phase scheme to select reduced number of features from a dataset using Genetic Algorithm (GA) and testing the classification accuracy (CA) of the dataset with the reduced feature set. In the first phase of the proposed work, an unsupervised approach to select a subset of features is applied. GA is used to select stochastically reduced number of features with Sammon Error as the fitness function. Different subsets of features are obtained. In the second phase, each of the reduced features set is applied to test the CA of the dataset. The CA of a data set is validated using supervised k-nearest neighbor (k-nn) algorithm. The novelty of the proposed scheme is that each reduced feature set obtained in the first phase is investigated for CA using the k-nn classification with different Minkowski metric i.e. non-Euclidean norms instead of conventional Euclidean norm (L2). Final results are presented in the article with extensive simulations on seven real and one synthetic, data sets. It is revealed from the proposed investigation that taking different norms produces better CA and hence a scope for better feature subset selection.

DOI

10.4018/jdwm.2010040102

MSU Digital Commons Citation

Saxena, Amit and Wang, John, "Dimensionality Reduction With Unsupervised Feature Selection and Applying Non-Euclidean Norms for Classification Accuracy" (2010). Department of Information Management and Business Analytics Faculty Scholarship and Creative Works. 61.
https://digitalcommons.montclair.edu/infomgmt-busanalytics-facpubs/61

This document is currently not available here.

COinS

Department of Information Management and Business Analytics Faculty Scholarship and Creative Works

Dimensionality Reduction With Unsupervised Feature Selection and Applying Non-Euclidean Norms for Classification Accuracy

Document Type

Publication Date

Abstract

DOI

MSU Digital Commons Citation

Search

Browse

Author Corner

Links

Contact Us

Department of Information Management and Business Analytics Faculty Scholarship and Creative Works

Dimensionality Reduction With Unsupervised Feature Selection and Applying Non-Euclidean Norms for Classification Accuracy

Authors

Document Type

Publication Date

Abstract

DOI

MSU Digital Commons Citation

Share

Search

Browse

Author Corner

Links

//<![CDATA[ document.write("<a href='mailto:" + "digitalcommons" + "@" + "mail.montclair.edu" + "'>" + "Contact Us" + "<\/a>") //]]> Contact Us

Contact Us