The Department of Biostatistics is one of 5 departments in the School of Public Health and Community Medicine at the University of Washington. Its mission is to serve as a source of expertise and a focus for training and research in the quantitative aspects of public health and medicine, and to promote the use of rigorous quantitative methods in the biomedical and public health sciences.
Our graduate program is regarded as one of the best Biostatistics programs in the world, with over 30 years of teaching and research experience on the UW campus. Faculty interests range over a wide variety of statistical topics, including survival analysis, clinical trials, statistical genetics and correlated data.
The UW Biostatistics Working Paper series includes articles on statistical methods and applications developed by members of the department. In general, articles dated 2000 and later are downloadable from this site.
Submission Guidelines.
Please follow the Policies and Procedures page for submission guidelines.
Papers from 2012
Testing for improvement in prediction model performance, Margaret S. Pepe PhD, Kathleen F. Kerr, Gary M. Longton, and Zheyu Wang
Papers from 2011
When Does Combining Markers Improve Classification Performance and What Are Implications for Practice?, Aasthaa Bansal and Margaret Sullivan Pepe
Doubly Robust Estimates for Binary Longitudinal Data Analysis with Missing Response and Missing Covariates, Baojiang Chen and Xiao-Hua Zhou
The Importance of Statistical Theory in Outlier Detection, Sarah C. Emerson and Scott S. Emerson
Some Observations on the Wilcoxon Rank Sum Test, Scott S. Emerson
Adaptive Clinical Trial Designs with Pre-specified Rules for Modifying the Sample Size: Understanding Efficient Types of Adaptation, Gregory P. Levin, Sarah C. Emerson, and Scott S. Emerson
A Flexible Spatio-Temporal Model for Air Pollution: Allowing for Spatio-Temporal Covariates, Johan Lindstrom, Adam A. Szpiro, Paul D. Sampson, Lianne Sheppard, Assaf Oron, Mark Richards, and Tim Larson
Semiparametric Estimation of the Covariate-Specific ROC Curve in Presence of Ignorable Verification Bias, Danping Liu and Xiao-Hua Zhou
Evaluating Markers for Treatment Selection Based on Survival Time, Xiao Song and Xiao-Hua Zhou
Non-Homogeneous Markov Process Models with Incomplete Observations: Application to a Dementia Disease Study, Xiao-Hua Zhou and Baojiang Chen
BATE Curve in Assessment of Clinical Utility of Predictive Biomarkers, Xiao-Hua Zhou and Yunbei Ma
Papers from 2010
Panel Count Data Regression with Informative Observation Times, Petra Buzkova
Modification and Improvement of Empirical Liklihood for Missing Response Problem, Gary Chan
Modification and Improvement of Empirical Likelihood for Missing Response Problem, Kwun Chuen Gary Chan
Oracle and Multiple Robustness Properties of Survey Calibration Estimator in Missing Response Problem, Kwun Chuen Gary Chan
On Two-Stage Hypothesis Testing Procedures Via Asymptotically Independent Statistics, James Dai, Charles Kooperberg, Michael L. LeBlanc, and Ross Prentice
On two-stage hypothesis testing procedures via asymptotically independent statistics, James Y. Dai, Charles Kooperberg, Michael LeBlanc, and Ross L. Prentice
Robustness of approaches to ROC curve modeling under misspecification of the underlying probability model, Sean Devlin, Elizabeth Thomas, and Scott S. Emerson
Using the Stages of Change Model to Choose an Optimal Health Marketing Target, Paula Diehr, Peggy A. Hannon, Barbara Pizacani, Mark Forehand, Jeffrey Harris, Hendrika Meischke, Susan J. Curry, Diane P. Martin, and Marcia R. Weaver
Multi-state Life Tables, Equilibrium Prevalence, and Baseline Selection Bias, Paula Diehr and David Yanez
Exploring the Benefits of Adaptive Sequential Designs in Time-to-Event Endpoint Settings, Sarah C. Emerson, Kyle Rudser, and Scott S. Emerson
Bio-Creep in Non-Inferiority Clinical Trials, Siobhan P. Everson-Stewart and Scott S. Emerson
Asymptotic Properties of the Sequential Empirical ROC and PPV Curves, Joseph S. Koopmeiners and Ziding Feng
Critical Immune and Vaccination Thresholds in Heterogenous Populations, Laura Matrajt and Ira Longini
Optimizing Vaccine Allocation at Different Points in Time During an Epidemic, Laura Matrajt and Ira M. Longini Jr.
Nonparametric and Semiparametric Analysis of Current Status Data Subject to Outcome Misclassification, Victor G. Sal y Rosas and James P. Hughes
Estimates of Information Growth in Longitudinal Clinical Trials, Abigail Shoben, Kyle Rudser, and Scott S. Emerson
Model-Robust Regression and a Bayesian `Sandwich' Estimator, Adam A. Szpiro, Kenneth M. Rice, and Thomas Lumley
Efficient Measurement Error Correction with Spatially Misaligned Data, Adam A. Szpiro, Lianne Sheppard, and Thomas Lumley
Papers from 2009
Measures to Summarize and Compare the Predictive Capacity of Markers, Wen Gu and Margaret Pepe
Interval Estimation for the Difference in Paired Areas under the ROC Curves in the Absence of a Gold Standard Test, Hsin-Neng Hsieh, Hsiu-Yuan Su, and Xiao-Hua Zhou
Nonparametric and Semiparametric Estimation of the Three Way Receiver Operating Characteristic Surface, Jialiang Li and Xiao-Hua Zhou
A Semi-Parametric Two-Part Mixed-Effects Heteroscedastic Transformation Model for Correlated Right-Skewed Semi-Continuous Data, Huazhen Lin and Xiao-Hua Zhou
Semiparametric Two-Part Models with Proportionality Constraints: Analysis of the Multi-Ethnic Study of Atherosclerosis (MESA), Anna Liu, Richard Kronmal, Xiao-Hua Zhou, and Shuangge Ma
Robustness of Semiparametric Efficiency in Nearly-Correct Models for Two-Phase Samples, Thomas Lumley
Pooled Nucleic Acid Testing to Identify Antiretroviral Treatment Failure during HIV Infection, Susanne May, Anthony Gamst, Richard Haubrich, Constance Benson, and Davey Smith
Pragmatic Estimation of a Spatio-Temporal Air Quality Model With Irregular Monitoring Data, Paul D. Sampson, Adam A. Szpiro, Lianne Sheppard, Johan Lindström, and Joel D. Kaufman
Evaluating Markers for Treatment Selection Based on Survival Time, Xiao Song and Xiao-Hua Zhou
Multiple Imputation Methods for Treatment Noncompliance and Nonresponse in Randomized Clinical Trials, Leslie Taylor and Xiao-Hua (Andrew) Zhou
Relaxing Latent Ignorability in the ITT Analysis of Randomized Studies with Missing Data and Noncompliance, L Taylor and Xiao-Hua Zhou
Papers from 2008
Multiple imputation of timing of mother-to-child transmission of HIV, Elizabeth Brown and Ying Qing Chen
Using Longitudinal Data to Estimate the Effect of Starting to Exercise on the Health of Sedentary Older Adults, Paula Diehr and Calvin Hirsch
Borrowing Information across Populations in Estimating Positive and Negative Predictive Values, Ying Huang, Ziding Feng, and Youyi Fong
Semiparametric and nonparametric methods for evaluating risk prediction markers in case-control studies, Ying Huang and Margaret Pepe
Semiparametric methods for evaluating the covariate-specific predictiveness of continuous markers in matched case-control studies, Ying Huang and Margaret S. Pepe
Accommodating Covariates in ROC Analysis, Holly Janes, Gary M. Longton, and Margaret Pepe
Influence of prediction approaches for spatially-dependent air pollution exposure on health effect estimation, Sun-Young Kim, Lianne Sheppard, and Ho Kim
Estimation and Comparison of Receiver Operating Characteristic Curves, Margaret Pepe, Gary M. Longton, and Holly Janes
Trading Bias for Precision: Decision Theory for Intervals and Sets, Kenneth M. Rice, Thomas Lumley, and Adam A. Szpiro
Estimation for Arbitrary Functionals of Survival, Kyle Rudser, Michael L. LeBlanc, and Scott S. Emerson
Predicting Intra-Urban Variation in Air Pollution Concentrations with Complex Spatio-Temporal Interactions, Adam A. Szpiro, Paul D. Sampson, Lianne Sheppard, Thomas Lumley, Sara D. Adar, and Joel Kaufman
Accounting for Errors from Predicting Exposures in Environmental Epidemiology and Environmental Statistics, Adam A. Szpiro, Lianne Sheppard, and Thomas Lumley
Semiparametric Inferential Procedures for Comparing Multivariate ROC Curves with Interaction Terms, Liansheng Tang and Xiao-Hua Zhou
Synthesis Analysis of Regression Models with a Continuous Outcome, Andrew Zhou, Nan Hu, Guizhou Hu, and Martin Root
Semi-Parametric Maximum Likelihood Estimates for ROC Curves of Continuous-Scale Tests, Xiao-Hua Zhou and Huazhen Lin
Nonparametric Heteroscedastic Transformation Regression Models for Skewed Data with an Application to Health Care Costs, Xiao-Hua Zhou, Huazhen Lin, and Eric Johnson
Papers from 2007
Identifiability and Estimation of Causal Effects in Randomized Trials with Noncompliance and Completely Non-ignorable Missing-Data, Hua Chen, Zhi Geng, and Xiao-Hua Zhou
ROC Surfaces in the Presence of Verification Bias, Yueh-Yun Chi and Xiao-Hua (Andrew) Zhou
Power Boosting in Genome-Wide Studies Via Methods for Multivariate Outcomes, Mary J. Emond
A Censored Multinomial Regression Model for Perinatal Mother to Child Transmission of HIV, Charlotte C. Gard and Elizabeth R. Brown
Evaluating a Group Sequential Design in the Setting of Nonproportional Hazards, Daniel L. Gillen and Scott S. Emerson
A Parametric ROC Model Based Approach for Evaluating the Predictiveness of Continuous Markers in Case-control Studies, Ying Huang and Margaret Pepe
Biomarker Evaluation Using the Controls as a Reference Population, Ying Huang and Margaret Pepe
Adjusting for Covariates in Studies of Diagnostic, Screening, or Prognostic Markers: An Old Concept in a New Setting, Holly Janes and Margaret Pepe
What Is the Best Reference RNA? And Other Questions Regarding the Design and Analysis of Two-Color Microarray Experiments, Kathleen F. Kerr, Kyle A. Serikawa, Caimiao Wei, Mette A. Peters, and Roger E. Bumgarner
Longitudinal Data with Follow-up Truncated by Death: Finding a Match Between Analysis Method and Research Aims, Brenda Kurland, Laura Lee Johnson, and Paula Diehr
Estimating Sensitivity and Specificity from a Phase 2 Biomarker Study that Allows for Early Termination, Margaret S. Pepe PhD
Evaluating the ROC Performance of Markers for Future Events, Margaret Pepe, Yingye Zheng, and Yuying Jin
Gamma Generalized Linear Models for Pharmacokinetic Data, Ruth Salway and Jon Wakefield
Model-Robust Bayesian Regression and the Sandwich Estimator, Adam A. Szpiro, Kenneth M. Rice, and Thomas Lumley
Nonparametric and Semiparametric Group Sequential Methods for Comparing Accuracy of Diagnostic Tests, Liansheng Tang, Scott S. Emerson, and Xiao-Hua Zhou
Ecologic Studies Revisited, Jon Wakefield
Reporting and Interpretation in Genome-Wide Association Studies, Jon Wakefield
Papers from 2006
Generalized confidence intervals for the ratio or difference of two means for lognormal populations with zeros, Yea-Hung Chen and Xiao-Hua Zhou
Large Cluster Asymptotics for GEE: Working Correlation Models, Hyoju Chung and Thomas Lumley
Functional ANOVA Normalization of Two-Channel Microarrays, Alan Dabney and John D. Storey
Comparison of Haplotype-based and Tree-based SNP Imputation in Association Studies, James Y. Dai, Ingo Ruczinski, Michael LeBlanc, and Charles Kooperberg
Reliability, Effect Size, and Responsiveness and Intraclass Correlation of Health Status Measures Used in Randomized and Cluster-Randomized Trials, Paula Diehr, Lu Chen, Donald L. Patrick, Ziding Feng, and Yutaka Yasui
Different Public Health Interventions have Varying Effects, Paula Diehr, Anne B. Newman, Liming Cai, and Ann Derleth
Evaluating Causal Effect Predictiveness of Candidate Surrogate Endpoints, Peter B. Gilbert and Michael Hudgens
The Two-sample Problem for Failure Rates Depending on a Continuous Mark: An Application to Vaccine Efficacy, Peter B. Gilbert, Ian W. McKeague, and Yanqing Sun
Genome Scanning Methods for Comparing Sequences Between Groups, with Application to HIV Vaccine Trials, Peter B. Gilbert, Chunyuan Wu, and David V. Jobes
Hierarchical Models for Combining Ecological and Case-control Data, Sebastien Haneuse and Jon Wakefield
The combination of ecological and case-control data, Sebastien Haneuse and Jon Wakefield
The Combination of Ecological and Case-Control Data, Sebastien Haneuse and Jon Wakefield
Multiple imputation for the comparison of two screening tests in two-phase Alzheimer studies, Ofer Harel and Xiao-Hua Zhou
Multiple imputation - Review of theory, implementation and software, Ofer Harel and Xiao-Hua Zhou
Evaluating the Predictiveness of a Continuous Marker, Ying Huang, Margaret S. Pepe, and Ziding Feng
Adjusting for Covariate Effects on Classification Accuracy Using the Covariate-Adjusted ROC Curve, Holly Janes and Margaret S. Pepe
Statistical Analysis of Air Pollution Panel Studies: An Illustration, Holly Janes, Lianne Sheppard, and Kristen Shepherd
2^k Factorials in Blocks of Size 2, with Application to Two-Color Microarray Experiments, Kathleen F. Kerr
On the Structure of Multiple Testing Procedures, Jeffrey Leek and John D. Storey
Relative Risk Regression in Medical Research: Models, Contrasts, Estimators, and Algorithms, Thomas Lumley, Richard Kronmal, and Shuangge Ma
A Marginalized Diffusion Model for Estimating Age at First Endoscopy Examination from Current Status Data, Diana Miglioretti and Elizabeth Brown
Hierarchical Lévy Frailty Models and a Frailty Analysis of Data on Infant Mortality in Norwegian Siblings, Tron Anders Moger and Odd O. Aalen
Case-cohort Methods for Survival Data on Families from Routine Registers, Tron Anders Moger, Yudi Pawitan, and Ørnulf Borgan
Integrating the Predictiveness of a Marker with its Performance as a Classifier, Margaret S. Pepe, Ziding Feng, Ying Huang, Gary M. Longton, Ross Prentice, Ian M. Thompson, and Yingye Zheng
A Semiparametric Approach for the Nonparametric Transformation Survival Model With Multiple Covariates, Xiao Song, Shuangge Ma, Jian Huang, and Xiao-Hua Zhou
COVARIATE SPECIFIC ROC CURVE WITH SURVIVAL OUTCOME, Xiao Song and Xiao-Hua Zhou
