The Department of Biostatistics is one of 5 departments in the School of Public Health and Community Medicine at the University of Washington. Its mission is to serve as a source of expertise and a focus for training and research in the quantitative aspects of public health and medicine, and to promote the use of rigorous quantitative methods in the biomedical and public health sciences.
Our graduate program is regarded as one of the best Biostatistics programs in the world, with over 30 years of teaching and research experience on the UW campus. Faculty interests range over a wide variety of statistical topics, including survival analysis, clinical trials, statistical genetics and correlated data.
The UW Biostatistics Working Paper series includes articles on statistical methods and applications developed by members of the department. In general, articles dated 2000 and later are downloadable from this site.
Submission Guidelines.
Please follow the Policies and Procedures page for submission guidelines.
Papers from 2013
Statistical Methods for Evaluating and Comparing Biomarkers for Patient Treatment Selection, Holly Janes, Marshall D. Brown, Margaret Pepe, and Ying Huang
An Evaluation of Inferential Procedures for Adaptive Clinical Trial Designs with Pre-specified Rules for Modifying the Sample Size, Greg P. Levin, Sarah C. Emerson, and Scott S. Emerson
The Net Reclassification Index (NRI): a Misleading Measure of Prediction Improvement with Miscalibrated or Overfit Models, Margaret Pepe, Jin Fang, Ziding Feng, Thomas Gerds, and Jorgen Hilden
Asymptotic and Finite Sample Behavior of Net Reclassification Indices, Zheyu Wang
Papers from 2012
A National Model Built with Partial Least Squares and Universal Kriging and Bootstrap-based Measurement Error Correction Techniques: An Application to the Multi-Ethnic Study of Atherosclerosis, Silas Bergen, Lianne Sheppard, Paul D. Sampson, Sun-Young Kim, Mark Richards, Sverre Vedal, Joel Kaufman, and Adam A. Szpiro
Decline in Health for Older Adults: 5-Year Change in 13 Key Measures of Standardized Health, Paula H. Diehr, Stephen M. Thielke, Anne B. Newman, Calvin H. Hirsch, and Russell Tracy
Borrowing Information Across Populations in Estimating Positive and Negative Predictive Values, Ying Huang, Youyi Fong, John Wei, and Ziding Feng
Fitting and Interpreting Continuous-Time Latent Markov Models for Panel Data, Jane M. Lange and Vladimir N. Minin
Methods for Evaluating Prediction Performance of Biomarkers and Tests, Margaret Pepe and Holly Janes
Testing for improvement in prediction model performance, Margaret S. Pepe PhD, Kathleen F. Kerr, Gary M. Longton, and Zheyu Wang
A Regionalized National Universal Kriging Model Using Partial Least Squares Regression for Estimating Annual PM2.5 Concentrations in Epidemiology, Paul D. Sampson, Mark Richards, Adam A. Szpiro, Silas Bergen, Lianne Sheppard, Timothy V. Larson, and Joel Kaufman
Transitions Among Health States Using 12 Measures of Successful Aging: Results from the Cardiovascular Health Study, Stephen Thielke and Paula Diehr
Papers from 2011
When Does Combining Markers Improve Classification Performance and What Are Implications for Practice?, Aasthaa Bansal and Margaret Sullivan Pepe
Doubly Robust Estimates for Binary Longitudinal Data Analysis with Missing Response and Missing Covariates, Baojiang Chen and Xiao-Hua Zhou
The Importance of Statistical Theory in Outlier Detection, Sarah C. Emerson and Scott S. Emerson
Some Observations on the Wilcoxon Rank Sum Test, Scott S. Emerson
Adaptive Clinical Trial Designs with Pre-specified Rules for Modifying the Sample Size: Understanding Efficient Types of Adaptation, Gregory P. Levin, Sarah C. Emerson, and Scott S. Emerson
A Flexible Spatio-Temporal Model for Air Pollution: Allowing for Spatio-Temporal Covariates, Johan Lindstrom, Adam A. Szpiro, Paul D. Sampson, Lianne Sheppard, Assaf Oron, Mark Richards, and Tim Larson
Semiparametric Estimation of the Covariate-Specific ROC Curve in Presence of Ignorable Verification Bias, Danping Liu and Xiao-Hua Zhou
Evaluating Markers for Treatment Selection Based on Survival Time, Xiao Song and Xiao-Hua Zhou
Non-Homogeneous Markov Process Models with Incomplete Observations: Application to a Dementia Disease Study, Xiao-Hua Zhou and Baojiang Chen
BATE Curve in Assessment of Clinical Utility of Predictive Biomarkers, Xiao-Hua Zhou and Yunbei Ma
Papers from 2010
Panel Count Data Regression with Informative Observation Times, Petra Buzkova
Modification and Improvement of Empirical Liklihood for Missing Response Problem, Gary Chan
Modification and Improvement of Empirical Likelihood for Missing Response Problem, Kwun Chuen Gary Chan
Oracle and Multiple Robustness Properties of Survey Calibration Estimator in Missing Response Problem, Kwun Chuen Gary Chan
On Two-Stage Hypothesis Testing Procedures Via Asymptotically Independent Statistics, James Dai, Charles Kooperberg, Michael L. LeBlanc, and Ross Prentice
On two-stage hypothesis testing procedures via asymptotically independent statistics, James Y. Dai, Charles Kooperberg, Michael LeBlanc, and Ross L. Prentice
Robustness of approaches to ROC curve modeling under misspecification of the underlying probability model, Sean Devlin, Elizabeth Thomas, and Scott S. Emerson
Using the Stages of Change Model to Choose an Optimal Health Marketing Target, Paula Diehr, Peggy A. Hannon, Barbara Pizacani, Mark Forehand, Jeffrey Harris, Hendrika Meischke, Susan J. Curry, Diane P. Martin, and Marcia R. Weaver
Multi-state Life Tables, Equilibrium Prevalence, and Baseline Selection Bias, Paula Diehr and David Yanez
Exploring the Benefits of Adaptive Sequential Designs in Time-to-Event Endpoint Settings, Sarah C. Emerson, Kyle Rudser, and Scott S. Emerson
Bio-Creep in Non-Inferiority Clinical Trials, Siobhan P. Everson-Stewart and Scott S. Emerson
Asymptotic Properties of the Sequential Empirical ROC and PPV Curves, Joseph S. Koopmeiners and Ziding Feng
Optimizing Vaccine Allocation at Different Points in Time During an Epidemic, Laura Matrajt and Ira M. Longini Jr.
Nonparametric and Semiparametric Analysis of Current Status Data Subject to Outcome Misclassification, Victor G. Sal y Rosas and James P. Hughes
Estimates of Information Growth in Longitudinal Clinical Trials, Abigail Shoben, Kyle Rudser, and Scott S. Emerson
Model-Robust Regression and a Bayesian `Sandwich' Estimator, Adam A. Szpiro, Kenneth M. Rice, and Thomas Lumley
Efficient Measurement Error Correction with Spatially Misaligned Data, Adam A. Szpiro, Lianne Sheppard, and Thomas Lumley
Papers from 2009
Measures to Summarize and Compare the Predictive Capacity of Markers, Wen Gu and Margaret Pepe
Interval Estimation for the Difference in Paired Areas under the ROC Curves in the Absence of a Gold Standard Test, Hsin-Neng Hsieh, Hsiu-Yuan Su, and Xiao-Hua Zhou
Nonparametric and Semiparametric Estimation of the Three Way Receiver Operating Characteristic Surface, Jialiang Li and Xiao-Hua Zhou
A Semi-Parametric Two-Part Mixed-Effects Heteroscedastic Transformation Model for Correlated Right-Skewed Semi-Continuous Data, Huazhen Lin and Xiao-Hua Zhou
Semiparametric Two-Part Models with Proportionality Constraints: Analysis of the Multi-Ethnic Study of Atherosclerosis (MESA), Anna Liu, Richard Kronmal, Xiao-Hua Zhou, and Shuangge Ma
Robustness of Semiparametric Efficiency in Nearly-Correct Models for Two-Phase Samples, Thomas Lumley
Pooled Nucleic Acid Testing to Identify Antiretroviral Treatment Failure during HIV Infection, Susanne May, Anthony Gamst, Richard Haubrich, Constance Benson, and Davey Smith
Pragmatic Estimation of a Spatio-Temporal Air Quality Model With Irregular Monitoring Data, Paul D. Sampson, Adam A. Szpiro, Lianne Sheppard, Johan Lindström, and Joel D. Kaufman
Evaluating Markers for Treatment Selection Based on Survival Time, Xiao Song and Xiao-Hua Zhou
Multiple Imputation Methods for Treatment Noncompliance and Nonresponse in Randomized Clinical Trials, Leslie Taylor and Xiao-Hua (Andrew) Zhou
Relaxing Latent Ignorability in the ITT Analysis of Randomized Studies with Missing Data and Noncompliance, L Taylor and Xiao-Hua Zhou
Papers from 2008
Multiple imputation of timing of mother-to-child transmission of HIV, Elizabeth Brown and Ying Qing Chen
Using Longitudinal Data to Estimate the Effect of Starting to Exercise on the Health of Sedentary Older Adults, Paula Diehr and Calvin Hirsch
Semiparametric and nonparametric methods for evaluating risk prediction markers in case-control studies, Ying Huang and Margaret Pepe
Semiparametric methods for evaluating the covariate-specific predictiveness of continuous markers in matched case-control studies, Ying Huang and Margaret S. Pepe
Accommodating Covariates in ROC Analysis, Holly Janes, Gary M. Longton, and Margaret Pepe
Influence of prediction approaches for spatially-dependent air pollution exposure on health effect estimation, Sun-Young Kim, Lianne Sheppard, and Ho Kim
Estimation and Comparison of Receiver Operating Characteristic Curves, Margaret Pepe, Gary M. Longton, and Holly Janes
Trading Bias for Precision: Decision Theory for Intervals and Sets, Kenneth M. Rice, Thomas Lumley, and Adam A. Szpiro
Estimation for Arbitrary Functionals of Survival, Kyle Rudser, Michael L. LeBlanc, and Scott S. Emerson
Predicting Intra-Urban Variation in Air Pollution Concentrations with Complex Spatio-Temporal Interactions, Adam A. Szpiro, Paul D. Sampson, Lianne Sheppard, Thomas Lumley, Sara D. Adar, and Joel Kaufman
Accounting for Errors from Predicting Exposures in Environmental Epidemiology and Environmental Statistics, Adam A. Szpiro, Lianne Sheppard, and Thomas Lumley
Semiparametric Inferential Procedures for Comparing Multivariate ROC Curves with Interaction Terms, Liansheng Tang and Xiao-Hua Zhou
Synthesis Analysis of Regression Models with a Continuous Outcome, Andrew Zhou, Nan Hu, Guizhou Hu, and Martin Root
Semi-Parametric Maximum Likelihood Estimates for ROC Curves of Continuous-Scale Tests, Xiao-Hua Zhou and Huazhen Lin
Nonparametric Heteroscedastic Transformation Regression Models for Skewed Data with an Application to Health Care Costs, Xiao-Hua Zhou, Huazhen Lin, and Eric Johnson
Papers from 2007
Identifiability and Estimation of Causal Effects in Randomized Trials with Noncompliance and Completely Non-ignorable Missing-Data, Hua Chen, Zhi Geng, and Xiao-Hua Zhou
ROC Surfaces in the Presence of Verification Bias, Yueh-Yun Chi and Xiao-Hua (Andrew) Zhou
Power Boosting in Genome-Wide Studies Via Methods for Multivariate Outcomes, Mary J. Emond
A Censored Multinomial Regression Model for Perinatal Mother to Child Transmission of HIV, Charlotte C. Gard and Elizabeth R. Brown
Evaluating a Group Sequential Design in the Setting of Nonproportional Hazards, Daniel L. Gillen and Scott S. Emerson
A Parametric ROC Model Based Approach for Evaluating the Predictiveness of Continuous Markers in Case-control Studies, Ying Huang and Margaret Pepe
Biomarker Evaluation Using the Controls as a Reference Population, Ying Huang and Margaret Pepe
Adjusting for Covariates in Studies of Diagnostic, Screening, or Prognostic Markers: An Old Concept in a New Setting, Holly Janes and Margaret Pepe
What Is the Best Reference RNA? And Other Questions Regarding the Design and Analysis of Two-Color Microarray Experiments, Kathleen F. Kerr, Kyle A. Serikawa, Caimiao Wei, Mette A. Peters, and Roger E. Bumgarner
Longitudinal Data with Follow-up Truncated by Death: Finding a Match Between Analysis Method and Research Aims, Brenda Kurland, Laura Lee Johnson, and Paula Diehr
Estimating Sensitivity and Specificity from a Phase 2 Biomarker Study that Allows for Early Termination, Margaret S. Pepe PhD
Evaluating the ROC Performance of Markers for Future Events, Margaret Pepe, Yingye Zheng, and Yuying Jin
Gamma Generalized Linear Models for Pharmacokinetic Data, Ruth Salway and Jon Wakefield
Model-Robust Bayesian Regression and the Sandwich Estimator, Adam A. Szpiro, Kenneth M. Rice, and Thomas Lumley
Nonparametric and Semiparametric Group Sequential Methods for Comparing Accuracy of Diagnostic Tests, Liansheng Tang, Scott S. Emerson, and Xiao-Hua Zhou
Ecologic Studies Revisited, Jon Wakefield
Reporting and Interpretation in Genome-Wide Association Studies, Jon Wakefield
Papers from 2006
Generalized confidence intervals for the ratio or difference of two means for lognormal populations with zeros, Yea-Hung Chen and Xiao-Hua Zhou
Large Cluster Asymptotics for GEE: Working Correlation Models, Hyoju Chung and Thomas Lumley
Functional ANOVA Normalization of Two-Channel Microarrays, Alan Dabney and John D. Storey
Comparison of Haplotype-based and Tree-based SNP Imputation in Association Studies, James Y. Dai, Ingo Ruczinski, Michael LeBlanc, and Charles Kooperberg
Reliability, Effect Size, and Responsiveness and Intraclass Correlation of Health Status Measures Used in Randomized and Cluster-Randomized Trials, Paula Diehr, Lu Chen, Donald L. Patrick, Ziding Feng, and Yutaka Yasui
Different Public Health Interventions have Varying Effects, Paula Diehr, Anne B. Newman, Liming Cai, and Ann Derleth
Evaluating Causal Effect Predictiveness of Candidate Surrogate Endpoints, Peter B. Gilbert and Michael Hudgens
The Two-sample Problem for Failure Rates Depending on a Continuous Mark: An Application to Vaccine Efficacy, Peter B. Gilbert, Ian W. McKeague, and Yanqing Sun
Genome Scanning Methods for Comparing Sequences Between Groups, with Application to HIV Vaccine Trials, Peter B. Gilbert, Chunyuan Wu, and David V. Jobes
Hierarchical Models for Combining Ecological and Case-control Data, Sebastien Haneuse and Jon Wakefield
The combination of ecological and case-control data, Sebastien Haneuse and Jon Wakefield
The Combination of Ecological and Case-Control Data, Sebastien Haneuse and Jon Wakefield
Multiple imputation for the comparison of two screening tests in two-phase Alzheimer studies, Ofer Harel and Xiao-Hua Zhou
Multiple imputation - Review of theory, implementation and software, Ofer Harel and Xiao-Hua Zhou
Evaluating the Predictiveness of a Continuous Marker, Ying Huang, Margaret S. Pepe, and Ziding Feng
Adjusting for Covariate Effects on Classification Accuracy Using the Covariate-Adjusted ROC Curve, Holly Janes and Margaret S. Pepe
