"A Method to Increase the Power of Multiple Testing Procedures Through " by Daniel Rubin, Sandrine Dudoit et al.

U.C. Berkeley Division of Biostatistics Working Paper Series

Title

A Method to Increase the Power of Multiple Testing Procedures Through Sample Splitting

Authors

Daniel Rubin, Division of Biostatistics, School of Public Health, University of California, BerkeleyFollow
Sandrine Dudoit, Division of Biostatistics, School of Public Health, University of California, BerkeleyFollow
Mark J. van der Laan, Division of Biostatistics, School of Public Health, University of California, BerkeleyFollow

Comments

Published 2006 in Statistical Applications in Genetics and Molecular Biology 5, article 19.

Abstract

Consider the standard multiple testing problem where many hypotheses are to be tested, each hypothesis is associated with a test statistic, and large test statistics provide evidence against the null hypotheses. One proposal to provide probabilistic control of Type-I errors is the use of procedures ensuring that the expected number of false positives does not exceed a user-supplied threshold. Among such multiple testing procedures, we derive the ``most powerful'' method, meaning the test statistic cutoffs that maximize the expected number of true positives. Unfortunately, these optimal cutoffs depend on the true unknown data generating distribution, so could never be used in a practical setting. We instead consider splitting the sample so that the optimal cutoffs are estimated from a portion of the data, and then testing on the remaining data using these estimated cutoffs. When the null distributions for all test statistics are the same, the obvious way to control the expected number of false positives would be to use a common cutoff for all tests. In this work, we consider the common cutoff method as a benchmark multiple testing procedure. We show that in certain circumstances the use of estimated optimal cutoffs via sample splitting can dramatically outperform this benchmark method, resulting in increased true discoveries, while retaining Type-I error control. This paper is an updated version of the work presented in Rubin et al. (2005), later expanded upon by Wasserman and Roeder (2006).

Disciplines

Statistical Methodology | Statistical Theory

Suggested Citation

Rubin, Daniel; Dudoit, Sandrine ; and van der Laan, Mark J., "A Method to Increase the Power of Multiple Testing Procedures Through Sample Splitting" (June 2006). U.C. Berkeley Division of Biostatistics Working Paper Series. Working Paper 171.
https://biostats.bepress.com/ucbbiostat/paper171

Previous Versions

March 15, 2005

Download

Included in

Statistical Methodology Commons, Statistical Theory Commons

COinS

Collection of Biostatistics Research Archive

U.C. Berkeley Division of Biostatistics Working Paper Series

Title

Authors

Comments

Abstract

Disciplines

Suggested Citation

Previous Versions

Included in

Browse

Search

Author Corner

UCB Biostatistics

Collection of Biostatistics Research Archive

U.C. Berkeley Division of Biostatistics Working Paper Series

Title

Authors

Comments

Abstract

Disciplines

Suggested Citation

Previous Versions

Included in

Share

Browse

Search

Author Corner

UCB Biostatistics