"Optimal Feature Selection for Nearest Centroid Classifiers, With Appli" by Alan R. Dabney and John D. Storey

UW Biostatistics Working Paper Series

Title

Optimal Feature Selection for Nearest Centroid Classifiers, With Applications to Gene Expression Microarrays

Authors

Alan R. Dabney, University of WashingtonFollow
John D. Storey, University of WashingtonFollow

Abstract

Nearest centroid classifiers have recently been successfully employed in high-dimensional applications. A necessary step when building a classifier for high-dimensional data is feature selection. Feature selection is typically carried out by computing univariate statistics for each feature individually, without consideration for how a subset of features performs as a whole. For subsets of a given size, we characterize the optimal choice of features, corresponding to those yielding the smallest misclassification rate. Furthermore, we propose an algorithm for estimating this optimal subset in practice. Finally, we investigate the applicability of shrinkage ideas to nearest centroid classifiers. We use gene-expression microarrays for our illustrative examples, demonstrating that our proposed algorithms can improve the performance of a nearest centroid classifier.

Disciplines

Bioinformatics | Computational Biology | Microarrays | Statistical Methodology | Statistical Theory

Suggested Citation

Dabney, Alan R. and Storey, John D., "Optimal Feature Selection for Nearest Centroid Classifiers, With Applications to Gene Expression Microarrays" (November 2005). UW Biostatistics Working Paper Series. Working Paper 267.
https://biostats.bepress.com/uwbiostat/paper267

Download

Included in

Bioinformatics Commons, Computational Biology Commons, Microarrays Commons, Statistical Methodology Commons, Statistical Theory Commons

COinS

Collection of Biostatistics Research Archive

UW Biostatistics Working Paper Series

Title

Authors

Abstract

Disciplines

Suggested Citation

Included in

Browse

Search

Author Corner

UW Biostatistics

Collection of Biostatistics Research Archive

UW Biostatistics Working Paper Series

Title

Authors

Abstract

Disciplines

Suggested Citation

Included in

Share

Browse

Search

Author Corner

UW Biostatistics