"Latent Supervised Learning" by Susan Wei and Michael R. Kosorok

The University of North Carolina at Chapel Hill Department of Biostatistics Technical Report Series

Title

Authors

Susan Wei, University of North Carolina at Chapel HillFollow
Michael R. Kosorok, University of North Carolina at Chapel HillFollow

Abstract

A new machine learning task is introduced, called latent supervised learning, where the goal is to learn a binary classifier from continuous training labels which serve as surrogates for the unobserved class labels. A specific model is investigated where the surrogate variable arises from a two-component Gaussian mixture with unknown means and variances, and the component membership is determined by a hyperplane in the covariate space. The estimation of the separating hyperplane and the Gaussian mixture parameters forms what shall be referred to as the change-line classification problem. A data-driven sieve maximum likelihood estimator for the hyperplane is proposed, which in turn can be used to estimate the parameters of the Gaussian mixture. The estimator is shown to be consistent. Simulations as well as empirical data show the estimator has high classification accuracy.

Disciplines

Biostatistics

Suggested Citation

Wei, Susan and Kosorok, Michael R., "Latent Supervised Learning" (January 2013). The University of North Carolina at Chapel Hill Department of Biostatistics Technical Report Series. Working Paper 36.
http://biostats.bepress.com/uncbiostat/art36

Download

Included in

Biostatistics Commons

COinS

Collection of Biostatistics Research Archive

The University of North Carolina at Chapel Hill Department of Biostatistics Technical Report Series

Title

Authors

Abstract

Disciplines

Suggested Citation

Downloads

Included in

Browse

Search

Author Corner

UNC Biostatistics

Collection of Biostatistics Research Archive

The University of North Carolina at Chapel Hill Department of Biostatistics Technical Report Series

Title

Authors

Abstract

Disciplines

Suggested Citation

Downloads

Included in

Share

Browse

Search

Author Corner

UNC Biostatistics