The Division of Biostatistics is one of five divisions of the School of Public Health at the University of California, Berkeley. Its mission is the promotion of teaching and research of biostatistical methods by faculty and graduate students. Graduate students are admitted to M.A. and Ph.D. programs through the Group in Biostatistics which is a joint program of the School of Public Health and the Department of Statistics.
The Biostatistics Working Paper series includes articles on statistical methods and applications developed by faculty and visitors of the Division of Biostatistics. In general, articles dated 2001 and later are downloadable from this site. For earlier articles that have appeared in print, we have included an abstract with a citation. Articles that are not downloadable or are unavailable in print may be requested from
Nicholas P. Jewell
Chair, Group in Biostatistics
University of California, Berkeley
140 Warren Hall
Berkeley, CA 94720-7360
Papers from 2004
Regulatory Motif Finding by Logic Regression, Sunduz Keles, Mark J. van der Laan, and Chris Vulpe
Deletion/Substitution/Addition Algorithm for Partitioning the Covariate Space in Prediction, Annette Molinaro and Mark J. van der Laan
Multiple Testing Procedures: R multtest Package and Applications to Genomics, Katherine S. Pollard, Sandrine Dudoit, and Mark J. van der Laan
GLLAMM Manual, Sophia Rabe-Hesketh, Anders Skrondal, and Andrew Pickles
Loss-Based Cross-Validated Deletion/Substitution/Addition Algorithms in Estimation, Sandra E. Sinisi and Mark J. van der Laan
Multiple Testing. Part III. Procedures for Control of the Generalized Family-Wise Error Rate and Proportion of False Positives, Mark J. van der Laan, Sandrine Dudoit, and Katherine S. Pollard
The Cross-Validated Adaptive Epsilon-Net Estimator, Mark J. van der Laan, Sandrine Dudoit, and Aad W. van der Vaart
Estimation of Treatment Effects in Randomized Trials with Noncompliance and a Dichotomous Outcome , Mark J. van der Laan, Alan E. Hubbard, and Nicholas P. Jewell
Estimation of Direct and Indirect Causal Effects in Longitudinal Studies, Mark J. van der Laan and Maya L. Petersen
History-Adjusted Marginal Structural Models and Statically-Optimal Dynamic Treatment Regimes, Mark J. van der Laan and Maya L. Petersen
Estimating a Survival Distribution with Current Status Data and High-Dimensional Covariates, Mark J. van der Laan and Aad van der Vaart
Quantification and Visualization of LD Patterns and Identification of Haplotype Blocks, Yan Wang and Sandrine Dudoit
Data Adaptive Estimation of the Treatment Specific Mean, Yue Wang, Oliver Bembom, and Mark J. van der Laan
A Statistical Method for Constructing Transcriptional Regulatory Networks Using Gene Expression and Sequence Data , Biao Xing and Mark J. van der Laan
Papers from 2003
A Semiparametric Model Selection Criterion with Applications to the Marginal Structural Model, M. Alan Brookhart and Mark J. van der Laan
Rank Regression in Stability Analysis, Ying Qing Chen, Annpey Pong, and Biao Xing
IBD Configuration Transition Matrices and Linkage Score Tests for Unilineal Relative Pairs, Sandrine Dudoit
Asymptotics of Cross-Validated Risk Estimation in Estimator Selection and Performance Assessment, Sandrine Dudoit and Mark J. van der Laan
Loss-Based Estimation with Cross-Validation: Applications to Microarray Data Analysis and Motif Finding, Sandrine Dudoit, Mark J. van der Laan, Sunduz Keles, Annette M. Molinaro, Sandra E. Sinisi, and Siew Leng Teng
Multiple Testing. Part I. Single-Step Procedures for Control of General Type I Error Rates, Sandrine Dudoit, Mark J. van der Laan, and Katherine S. Pollard
Temporal Stability and Geographic Variation in Cumulative Case Fatality Rates and Average Doubling Times of SARS Epidemics, Alison P. Galvani, Xiudong Lei, and Nicholas P. Jewell
Comparison of the Inverse Probability of Treatment Weighted (IPTW) Estimator With a Naïve Estimator in the Analysis of Longitudinal Data With Time-Dependent Confounding: A Simulation Study, Thaddeus Haight, Romain Neugebauer, Ira B. Tager, and Mark J. van der Laan
Asymptotically Optimal Model Selection Method with Right Censored Outcomes, Sunduz Keles, Mark J. van der Laan, and Sandrine Dudoit
Supervised Detection of Regulatory Motifs in DNA Sequences, Sunduz Keles, Mark J. van der Laan, Sandrine Dudoit, Biao Xing, and Michael B. Eisen
Tree-based Multivariate Regression and Density Estimation with Right-Censored Data , Annette M. Molinaro, Sandrine Dudoit, and Mark J. van der Laan
Locally Efficient Estimation of Nonparametric Causal Effects on Mean Outcomes in Longitudinal Studies, Romain Neugebauer and Mark J. van der Laan
Resampling-based Multiple Testing: Asymptotic Control of Type I Error and Applications to Gene Expression Data, Katherine S. Pollard and Mark J. van der Laan
Unified Cross-Validation Methodology For Selection Among Estimators and a General Cross-Validated Adaptive Epsilon-Net Estimator: Finite Sample Oracle Inequalities and Examples, Mark J. van der Laan and Sandrine Dudoit
Asymptotic Optimality of Likelihood Based Cross-Validation, Mark J. van der Laan, Sandrine Dudoit, and Sunduz Keles
Multiple Testing. Part II. Step-Down Procedures for Control of the Family-Wise Error Rate, Mark J. van der Laan, Sandrine Dudoit, and Katherine S. Pollard
Double Robust Estimation in Longitudinal Marginal Structural Models, Zhuo Yu and Mark J. van der Laan
Measuring Treatment Effects Using Semiparametric Models, Zhuo Yu and Mark J. van der Laan
Papers from 2002
Locally Efficient Estimation of Regression Parameters Using Current Status Data, Chris Andrews, Mark J. van der Laan, and James M. Robins
Analysis of Longitudinal Marginal Structural Models , Jennifer F. Bryan, Zhuo Yu, and Mark J. van der Laan
Accelerated Hazards Model: Method, Theory and Applications, Ying Qing Chen, Nicholas P. Jewell, and Jingrong Yang
Regression Analysis of Recurrent Gap Times with Time-Dependent Covariates, Ying Qing Chen, Mei-Cheng Wang, and Yijian Huang
Semiparametric Regression Analysis on Longitudinal Pattern of Recurrent Gap Times, Ying Qing Chen, Mei-Cheng Wang, and Yijian Huang
Inference for Proportional Mean Residual Life Model in the Presence of Censoring, Ying Q. Chen and Nicholas P. Jewell
Multiple Hypothesis Testing in Microarray Experiments, Sandrine Dudoit, Juliet Popper Shaffer, and Jennifer C. Boldrick
An Empirical Study of Marginal Structural Models for Time-Independent Treatment, Tanya A. Henneman and Mark J. van der Laan
Estimating Causal Parameters in Marginal Structural Models with Unmeasured Confounders Using Instrumental Variables, Tanya A. Henneman, Mark Johannes van der Laan, and Alan E. Hubbard
Case-Control Current Status Data, Nicholas P. Jewell and Mark J. van der Laan
Current Status Data: Review, Recent Developments and Open Problems, Nicholas P. Jewell and Mark J. van der Laan
Estimation of the Bivariate Survival Function with Generalized Bivariate Right Censored Data Structures, Sunduz Keles, Mark J. van der Laan, and James M. Robins
Recurrent Events Analysis in the Presence of Time Dependent Covariates and Dependent Censoring, Maja Miloslavsky, Sunduz Keles, Mark J. van der Laan, and Steve Butler
Comparative Genomic Hybridization Array Analysis, Annette M. Molinaro, Mark J. van der Laan, and Dan H. Moore
Why Prefer Double Robust Estimates? Illustration with Causal Point Treatment Studies, Romain Neugebauer and Mark J. van der Laan
A Method to Identify Significant Clusters in Gene Expression Data, Katherine S. Pollard and Mark J. van der Laan
Locally Efficient Estimation with Bivariate Right Censored Data , Christopher M. Quale, Mark J. van der Laan, and James M. Robins
Bivariate Current Status Data, Mark J. van der Laan and Nicholas P. Jewell
A New Partitioning Around Medoids Algorithm, Mark J. van der Laan, Katherine S. Pollard, and Jennifer Bryan
Construction of Counterfactuals and the G-computation Formula, Zhuo Yu and Mark J. van der Laan
Papers from 2001
Paired and Unpaired Comparisons and Clustering with Gene Expression Data, Jennifer F. Bryan, Katherine S. Pollard, and Mark J. van der Laan
Mixture Hazards Models with Additive Random Effects Accounting for Treatment Effectiveness Lag Time, Ying Qing Chen, C. A. Rohde, and M.-C. Wang
Detection of Progressive Deterioration in Early Onset Schizophrenia with a New Statistical Method, Ying Qing Chen, Mei-Cheng Wang, and William W. Eaton
Marginal Regression of Gaps Between Recurrent Events, Yijian Huang and Ying Qing Chen
Maximum Likelihood Estimation of Ordered Multinomial Parameters, Nicholas P. Jewell and John D. Kalbfleisch
Nonparametric Estimation from Current Status Data with Competing Risks, Nicholas P. Jewell, Mark J. van der Laan, and Tanya Henneman
Identification of Regulatory Elements Using A Feature Selection Method, Sunduz Keles, Mark J. van der Laan, and Michael B. Eisen
Fitting of Mixtures with Unspecified Number of Components Using Cross Validation Distance Estimate, Maja Miloslavsky and Mark J. van der Laan
Statistical Inference for Simultaneous Clustering of Gene Expression Data, Katherine S. Pollard and Mark J. van der Laan
Smooth Hazard Function Estimation for Interval Censored Data with Time Varying Covariates, Christopher Quale and Peter Bacchetti
Locally Efficient Estimation with Bivariate Right Censored Data, Christopher M. Quale, Mark J. van der Laan, and James M. Robins
Current Status and Right-Censored Data Structures When Observing a Marker at the Censoring Time, Mark J. van der Laan and Nicholas P. Jewell
Hybrid Clustering of Gene Expression Data with Visualization and the Bootstrap, Mark J. van der Laan and Katherine S. Pollard
Smooth Estimation of a Monotone Density, Aad W. van der Vaart and Mark J. van der Laan
Papers from 2000
A Class of Semiparametric Scale-Change Hazards Regression Models and Its Adequacy for Censored Survival Data, Ying Qing Chen
On a General Class of Semiparametric Hazards Regression Models, Ying Qing Chen and Nicholas P. Jewell
Gene Expression Analysis with the Parametric Bootstrap, Mark J. van der Laan and Jennifer F. Bryan
Locally Efficient Estimation in Censored Data Models: Theory and Examples, Mark J. van der Laan, Richard D. Gill, and James M. Robins
Locally Efficient Estimation of a Multivariate Survival Function in Longitudinal Studies, Mark J. van der Laan, Alan E. Hubbard, and James M. Robins
Papers from 1999
Assessing Adequacy of the Semiparametric Scale Change Hazards Regression Models, Ying Qing Chen
Analysis of Accelerated Hazards Models, Ying Qing Chen and Mei-Cheng Wang
Estimating a Treatment Effect by the Accelerated Hazards Model, Ying Qing Chen and Mei-Cheng Wang
The NPMLE in the Uniform Doubly Censored Current Status Data Model, Mark J. van der Laan and Nicholas P. Jewell
Papers from 1998
Automating Data Entry and Validation in Clinical Research, Andrew D. Graham, Robert E. Fusaro, and Kenneth A. Polse
Nonparametric Locally Efficient Estimation of the Treatment Specific Survival Distribution with Right Censored Data and Covariates in Observational Studies, Alan E. Hubbard, Mark J. van der Laan, and James M. Robins
Subset Selection in Explanatory Regression Analyses, Derick R. Peterson and Richard J. Brand
Inference with Bivariate Truncated Data, Christopher M. Quale and Mark J. van der Laan
Estimation with Interval Censored Data in Longitudinal Studies, Mark J. van der Laan
The Nonparametric Maximum Likelihood Estimator in a Class of Doubly Censored Current Status Data Models with Application to Partner Studies, Mark J. van der Laan and Chris Andrews
Locally Efficient Estimation of the Quality Adjusted Lifetime Distribution with Right-Censored Data and Covariates, Mark J. van der Laan and Alan E. Hubbard
Nonparametric Efficient Estimation with Current Status Data and Right-Censored Data Structures When Observing a Marker at the Censoring Time, Mark J. van der Laan and Nicholas P. Jewell
Subset Selection Based on Order Statistics from Logistic Populations, Mark J. van der Laan and Paul van der Laan
Papers from 1997
Locally Efficient Estimation of the Survival Distribution with Right Censored Data and Covariates When Collection of Data is Delayed, Mark J. van der Laan and Alan E. Hubbard
Efficient Estimation of the Lifetime and Disease Onset Distribution, Mark J. van der Laan, Nicholas P. Jewell, and Derick R. Peterson
Efficient Estimation from Right-Censored Data When Failure Indicators are Missing at Random, Mark J. van der Laan and Ian W. McKeague
Smooth Estimation and Inference with Interval Censored Data, Mark J. van der Laan and Derick R. Peterson
Locally Efficient Estimation with Current Status Data and Time-Dependent Covariates, Mark J. van der Laan and James M. Robins
Papers from 1996
Backcalculation of Multiple Sclerosis Incidence Rates Based on Faroe Islands Data, Nicholas P. Jewell and Biao Wm. Lu
Singly and Doubly Censored Current Status Data With Extensions to Multi-State Counting Processes, Nicholas P. Jewell and Mark J. van der Laan
Estimation with Interval Censored Data and Covariates, Mark J. van der Laan and Alan E. Hubbard
Papers from 1995
A Modification of the Re-Distribution to the Right Algorithm Using Disease Markers, Hina M. Malani
Efficiency of the Sieved-NPMLE in CAR-Missing Data Models, Mark J. van der Laan
Locally Efficient Estimation with Current Status Data and High Dimensional Covariates, Mark J. van der Laan
Nonparametric Estimation of the Bivariate Survival Function with Truncated Data, Mark J. van der Laan
The Two-Interval Line-Segment Problem, Mark J. van der Laan
Papers from 1994
Double Censoring and All That: Problems in Biostatistical Inference, Nicholas P. Jewell
Generalizations of Current Status Data With Applications, Nicholas P. Jewell and Mark J. van der Laan
Simulation Properties of Malani's Modified Kaplan-Meier Estimator, Sandra R. Percell and Hina M. Malani