"Control-Group Feature Normalization for Multivariate Pattern Analysis " by Kristin A. Linn, Bilwaj Gaonkar et al.

UPenn Biostatistics Working Papers

Title

Control-Group Feature Normalization for Multivariate Pattern Analysis Using the Support Vector Machine

Authors

Kristin A. Linn, Department of Biostatistics and Epidemiology, Perelman School of Medicine, University of PennsylvaniaFollow
Bilwaj Gaonkar, Department of Neurosurgery, UCLA
Jimit Doshi, Department of Radiology, University of Pennsylvania
Christos Davatzikos, Department of Radiology, Perelman School of Medicine, University of Pennsylvania
Russell T. Shinohara, Department of Biostatistics and Epidemiology, Perelman School of Medicine, University of PennsylvaniaFollow

Abstract

Normalization of feature vector values is a common practice in machine learning. Generally, each feature value is standardized to the unit hypercube or by normalizing to zero mean and unit variance. Classification decisions based on support vector machines (SVMs) or by other methods are sensitive to the specific normalization used on the features. In the context of multivariate pattern analysis using neuroimaging data, standardization effectively up- and down-weights features based on their individual variability. Since the standard approach uses the entire data set to guide the normalization it utilizes the total variability of these features. This total variation is inevitably dependent on the amount of marginal separation between groups. Thus, such a normalization may attenuate the separability of the data in high dimensional space. In this work we propose an alternate approach that uses an estimate of the control-group standard deviation to normalize features before training. We also show that control-based normalization provides better interpretation with respect to the estimated multivariate disease pattern and improves the classifier performance in many cases.

Disciplines

Biostatistics

Suggested Citation

Linn, Kristin A.; Gaonkar, Bilwaj; Doshi, Jimit; Davatzikos, Christos; and Shinohara, Russell T., "Control-Group Feature Normalization for Multivariate Pattern Analysis Using the Support Vector Machine" (September 2015). UPenn Biostatistics Working Papers. Working Paper 42.
https://biostats.bepress.com/upennbiostat/art42

Download

Included in

Biostatistics Commons

COinS

Collection of Biostatistics Research Archive

UPenn Biostatistics Working Papers

Title

Authors

Abstract

Disciplines

Suggested Citation

Included in

Browse

Search

Author Corner

UPenn Biostatistics

Collection of Biostatistics Research Archive

UPenn Biostatistics Working Papers

Title

Authors

Abstract

Disciplines

Suggested Citation

Included in

Share

Browse

Search

Author Corner

UPenn Biostatistics