Located on the Harvard Medical Campus, the Department of Biostatistics was one of the first departments in the newly formed Harvard School of Public Health in 1922. Now in its 80th year, the Department comprises 85 students, 57 faculty members, and 22 research associates and fellows. Our size contributes to our ability to address a broad spectrum of biostatistical and public health issues.

Current departmental research on statistical and computing methods for observational studies and clinical trials includes survival analysis, missing-data problems, and causal inference. Other areas of investigation are environmental research (methods for longitudinal studies, analyses with incomplete data, and meta-analysis); statistical aspects of the study of AIDS and cancer; quantitative problems in health-risk analysis, technology assessment, and clinical decision making; statistical methodology in psychiatric research and in genetic studies; Bayesian statistics; statistical computing; statistical genetics and computational biology; and collaborative research activities with biomedical scientists in other Harvard-affiliated institutions.

The Harvard University Biostatistics Working Paper Series presents contributions by our faculty and researchers that rely on the theory and application of statistical science to analyze public health problems.

Follow

Papers from 2014

PDF

Estimation of the Overall Treatment Effect in the Presence of Interference in Cluster-randomized Trials of Infectious Disease Prevention, Nicole Bohme Carnegie, Rui Wang, and Victor De Gruttola

PDF

Adjustment for Mismeasured Exposure using Validation Data and Propensity Scores, Danielle Braun, Malka Gorfine, Corwin Zigler, Francesca Dominici, and Giovanni Parmigiani

PDF

A Predictive Enrichment Procedure to Identify Potential Responders to a New Therapy for Randomized, Comparative, Controlled Clinical Studies, Junlong Li, Lihui Zhao, Lu Tian, Tianxi Cai, Brian Claggett, Andrea Callegaro, Benjamin Dizier, Bart Spiessens, Fernando Ulloa-Montoya, and L. J. Wei

PDF

Likelihood Based Estimation of Logistic Structural Nested Mean Models with an Instrumental Variable, Roland A. Matsouaka and Eric J. Tchetgen Tchetgen

PDF

Control Function Assisted IPW Estimation with a Secondary Outcome in Case-Control Studies, Tamar Sofer, Marilyn C. Cornelis, Peter Kraft, and Eric J. Tchetgen Tchetgen

PDF

A Note on the Control Function Approach with an Instrumental Variable and a Binary Outcome, Eric Tchetgen Tchetgen

PDF

A Simple Regression-based Approach to Account for Survival Bias in Birth Outcomes Research, Eric J. Tchetgen Tchetgen, Kelesitse Phiri, and Roger Shapiro

PDF

Instrumental Variable Estimation in a Survival Context, Eric J. Tchetgen Tchetgen, Stefan Walter, Stijn Vansteelandt, Torben Martinussen, and Maria Glymour

PDF

Bounds to Evaluate the Pure/natural Direct Effect without Cross-world Counterfactual Independence, Eric Tchetgen Tchetgen and Kelesitse Phiri

PDF

A General Approach to Detect Gene (G)-environment (E) Additive Interaction Leveraging G-E Independence in Case-control Studies, Eric Tchetgen Tchetgen, Tamar Sofer, and Benedict H.W. Wong

PDF

A unification of mediation and interaction: a four-way decomposition, Tyler J. VanderWeele

PDF

Mediation Analysis with Time-Varying Exposures and Mediators, Tyler J. VanderWeele and Eric Tchetgen Tchetgen

PDF

Generalized Quantile Treatment Effect, Sergio Venturini, Francesca Dominici, and Giovanni Parmigiani

PDF

Predicting the Future Subject's Outcome via an Optimal Stratification Procedure with Baseline Information, Florence H. Yong, Lu Tian, Sheng Yu, Tianxi Cai, and L. J. Wei

PDF

Optimal Bayesian Adaptive Trials when Treatment Efficacy Depends on Biomarkers, Yifan Zhang, Lorenzo Trippa, and Giovanni Parmigiani

Papers from 2013

PDF

Phylogenetic Linkage Among HIV-infected Village Residents in Botswana: Estimation of Clustering Rates in the Presence of Missing Data, Nicole Bohme Carnegie, Rui Wang, Vladimir Novitsky, and Victor G. DeGruttola

PDF

Model Averaged Double Robust Estimation, Matthew Cefalu, Francesca Dominici, and Giovanni Parmigiani

PDF

Efficient Estimation of Risk Ratios From Clustered Binary Data, Matthew Cefalu and Eric Tchetgen Tchetgen

PDF

Simulating Bipartite Networks to Reflect Uncertainty in Local Network Properties, Ravi Goyal, Joseph Blitzstein, and Victor De Gruttola

PDF

A General Regression Framework for a Secondary Outcome in Case-control Studies, Eric J. Tchetgen Tchetgen

PDF

Identification and Estimation of Survivor Average Causal Effects, Eric J. Tchetgen Tchetgen

PDF

Alternative Identification and Inference for the Effect of Treatment on the Treated with an Instrumental Variable, Eric J. Tchetgen Tchetgen and Stijn Vansteelandt

PDF

A General Instrumental Variable Framework for Regression Analysis with Outcome Missing Not at Random, Eric J. Tchetgen Tchetgen and Kathleen Wirth

PDF

On the Restricted Mean Event Time in Survival Analysis, Lu Tian, Lihui Zhao, and L. J. Wei

PDF

A versatile test for equality of two survival functions based on weighted differences of Kaplan-Meier curves, Hajime Uno, Lu Tian, Brian Claggett, and L. J. Wei

PDF

A unification of mediation and interaction, Tyler J. VanderWeele

PDF

On the causal interpretation of race in regressions adjusting for confounding and mediating variables, Tyler J. VanderWeele and Whitney Robinson

PDF

Attributing effects to interactions, Tyler J. VanderWeele and Eric J. Tchetgen Tchetgen

PDF

Sample Size Considerations in the Design of Cluster Randomized Trials of Combination HIV Prevention, Rui Wang, Ravi Goyal, Quanhong Lei, M. Essex, and Victor DeGruttola

PDF

Más-o-menos: A Simple Sign Averaging Method for Discrimination in Genomic Data Analysis, Sihai Dave Zhao, Giovanni Parmigiani, Curtis Huttenhower, and Levi Waldron

Papers from 2012

PDF

Treatment Selections using Risk-benefit Profiles Based on Data from Comparative Randomized Clinical Trials with Multiple Endpoints, Brian Claggett, Lu Tian, Davide Castagno, and L. J. Wei

PDF

Nonparametric Inference for Meta Analysis with Fixed Unknown, Study-specific Parameters, Brian Claggett, Minge Xie, and Lu Tian

PDF

C2BAT: A Novel Method for Association Between Ge- netic Markers and Multiple Phenotypes, Melissa Naylor and Christoph Lange

PDF

Flexible Covariate-adjusted Exact Tests for Randomized Studies, Alisa J. Stephens, Eric J. Tchetgen Tchetgen, and Victor De Gruttola

PDF

Locally Efficient Estimation of Marginal Treatment Effects when Outcomes are Correlated: Is the Prize Worth the Chase?, Alisa J. Stephens, Eric J. Tchetgen Tchetgen, and Victor De Gruttola

PDF

Formulae for Causal Mediation Analysis in an Odds Ratio Context Without a Normality Assumption for the Continuous Mediator, Eric J. Tchetgen Tchetgen

PDF

Inverse Odds Ratio-Weighted Estimation for Causal Mediation Analysis, Eric J. Tchetgen Tchetgen

PDF

Multiple-Robust Estimation of an Odds Ratio Interaction, Eric J. Tchetgen Tchetgen

PDF

On a Closed-form Doubly Robust Estimator of the Adjusted Odds Ratio for a Binary Exposure, Eric J. Tchetgen Tchetgen

PDF

On a Logistic Mixed Model Formulation of a Quadratic Exponential Model for Correlated Binary Outcomes, Eric J. Tchetgen Tchetgen

PDF

A Cautionary Note on Specification of the Correlation Structure in Inverse-Probability-Weighted Estimation for Repeated Measures, Eric J. Tchetgen Tchetgen, M. Maria Glymour, Jennifer Weuve, and James Robins

PDF

Robust Estimation of Pure/Natural Direct Effects with Mediator Measurement Error, Eric J. Tchetgen Tchetgen and Sheng Hsuan Lin

PDF

On Parametrization, Robustness and Sensitivity Analysis in a Marginal Structural Cox Proportional Hazards Model for Point Exposure, Eric J. Tchetgen Tchetgen and James M. Robins

PDF

On Identification of Natural Direct Effects when a Confounder of the Mediator is Directly Affected by Exposure, Eric J. Tchetgen Tchetgen and Tyler J. VanderWeele

PDF

Robustness of Measures of Interaction to Unmeasured Confounding, Eric J. Tchetgen Tchetgen and Tyler J. VanderWeele

Papers from 2011

PDF

Estimating Subject-Specific Treatment Differences for Risk-Benefit Assessment with Competing Risk Event-Time Data, Brian Claggett, Lihui Zhao, Lu Tian, Davide Castagno, and L. J. Wei

PDF

Statistical Properties of the Integrative Correlation Coefficient: a Measure of Cross-study Gene Reproducibility, Leslie Cope and Giovanni Parmigiani

PDF

Multiple Testing of Local Maxima for Detection of Unimodal Peaks in 1D, Armin Schwartzman, Yulia Gavrilov, and Robert J. Adler

PDF

Multiple Testing of Local Maxima for Detection of Peaks in ChIP-Seq Data, Armin Schwartzman, Andrew Jaffe, Yulia Gavrilov, and Clifford A. Meyer

PDF

Estimation of Risk Ratios in Cohort Studies With Common Outcomes: A Simple and Efficient Two-stage Approach, Eric J. Tchetgen

PDF

On Causal Mediation Analysis with a Survival Outcome, Eric J. Tchetgen Tchetgen

PDF

Semiparametric Estimation of Models for Natural Direct and Indirect Effects, Eric J. Tchetgen Tchetgen and Ilya Shpitser

PDF

Semiparametric Theory for Causal Mediation Analysis: efficiency bounds, multiple robustness, and sensitivity analysis, Eric J. Tchetgen Tchetgen and Ilya Shpitser

PDF

On the Covariate-adjusted Estimation for an Overall Treatment Difference with Data from a Randomized Comparative Clinical Trial, Lu Tian, Tianxi Cai, Lihui Zhao, and L. J. Wei

PDF

Bayesian Effect Estimation Accounting for Adjustment Uncertainty, Chi Wang, Giovanni Parmigiani, and Francesca Dominici

PDF

Effectively Selecting a Target Population for a Future Comparative Study, Lihui Zhao, Lu Tian, Tianxi Cai, Brian Claggett, and L. J. Wei

PDF

A Regularization Corrected Score Method for Nonlinear Regression Models with Covariate Error, David M. Zucker, Malka Gorfine, Yi Li, and Donna Spiegelman

Papers from 2010

PDF

A New Class of Dantzig Selectors for Censored Linear Regression Models, Yi Li, Lee Dicker, and Sihai Dave Zhao

PDF

Estimating Causal Effects in Trials Involving Multi-treatment Arms Subject to Non-compliance: A Bayesian Frame-work, Qi Long, Roderick J. Little, and Xihong Lin

PDF

Improving the Power of Chronic Disease Surveillance by Incorporating Residential History, Justin Manjourides and Marcello Pagano

PDF

A Perturbation Method for Inference on Regularized Regression Estimates, Jessica Minnier, Lu Tian, and Tianxi Cai

PDF

Landmark Prediction of Survival, Layla Parast and Tianxi Cai

PDF

Modeling Dependent Gene Expression, Donatello Telesca, Peter Muller, Giovanni Parmigiani, and Ralph S. Freedman

PDF

Graphical Procedures for Evaluating Overall and Subject-Specific Incremental Values from New Predictors with Censored Event Time Data, Hajime Uno, Tianxi Cai, Lu Tian, and L. J. Wei

PDF

Nonparametric Regression with Missing Outcomes Using Weighted Kernel Estimating Equations, Lu Wang, Andrea Rotnitzky, and Xihong Lin

PDF

Powerful SNP Set Analysis for Case-Control Genome Wide Association Studies, Michael C. Wu, Peter Kraft, Michael P. Epstein, Deanne M. Taylor, Stephen J. Chanock, David J. Hunter, and Xihong Lin

PDF

Stratifying Subjects for Treatment Selection with Censored Event Time Data from a Comparative Study, Lihui Zhao, Tianxi Cai, Lu Tian, Hajime Uno, Scott D. Solomon, and L. J. Wei

PDF

Utilizing the Integrated Difference of Two Survival Functions to Quantify the Treatment Contrast for Designing, Monitoring and Analyzing a Comparative Clinical Study, Lihui Zhao, Lu Tian, Hajime Uno, Scott D. Solomon, Marc A. Pfeffer, J. S. Schindler, and L. J. Wei

PDF

Principled Sure Independence Screening for Cox Models with Ultra-high-dimensional Covariates, Sihai Dave Zhao and Yi Li

Papers from 2009

PDF

Lot Quality Assurance Sampling (LQAS) and the Mozambique Malaria Indicator Surveys, Caitlin Biedron, Marcello Pagano, Bethany L. Hedt, Albert Kilian, Amy Ratcliffe, Samuel Mabunda, and Joseph J. Valadez

PDF

Analysis of Randomized Comparative Clinical Trial Data for Personalized Treatment Selections, Tianxi Cai, Lu Tian, Peggy H. Wong, and L. J. Wei

PDF

Spatial Cluster Detection for Repeatedly Measured Outcomes while Accounting for Residential History, Andrea J. Cook, Diane Gold, and Yi Li

PDF

Spatial Cluster Detection for Weighted Outcomes Using Cumulative Geographic Residuals, Andrea J. Cook, Yi Li, David Arterburn, and Ram C. Tiwari

PDF

Survival Analysis with Error-prone Time-varying Covariates: A Risk Set Calibration Approach, Xiaomei Liao, David M. Zucker, Yi Li, and donna spiegelman

PDF

Estimating Subject-Specific Dependent Competing Risk Profile with Censored Event Time Observations, Yi Li, Lu Tian, and L. J. Wei

PDF

A New Class of Minimum Power Divergence Estimators with Applications to Cancer Surveillance, Nirian Martin and Yi Li

PDF

Marginalized Frailty Models for Multivariate Survival Data, Megan Othus and Yi Li

PDF

A Class of Semiparametric Mixture Cure Survival Models with Dependent Censoring, Megan Othus, Yi Li, and Ram C. Tiwari

PDF

The Importance of Scale for Spatial-confounding Bias and Precision of Spatial Regression Estimators, Christopher J. Paciorek

PDF

Group Comparison of Eigenvalues and Eigenvectors of Diffusion Tensors, Armin Schwartzman, Robert F. Dougherty, and Jonathan E. Taylor

PDF

The Effect of Correlation in False Discovery Rate Estimation, Armin Schwartzman and Xihong Lin

PDF

On The C-Statistics For Evaluating Overall Adequacy Of Risk Prediction Procedures With Censored Survival Data, Hajime Uno, Tianxi Cai, Michael J. Pencina, Ralph B. D'Agostino, and L. J. Wei

PDF

Comparing Risk Scoring Systems Beyond the ROC Paradigm in Survival Analysis, Hajime Uno, Lu Tian, Tianxi Cai, Isaac S. Kohane, and L. J. Wei

PDF

Sparse Linear Discriminant Analysis for Simultaneous Testing for the Significance of a Gene Set/Pathway and Gene Selection, Michael C. Wu, Lingson Zhang, Zhaoxi Wang, David C. Christiani, and Xihong Lin

Papers from 2008

PDF

Evaluating Subject-level Incremental Values of New Markers for Risk Classification Rule, Tianxi Cai, Lu Tian, Donald M. Lloyd-Jones, and L. J. Wei

PDF

Calibrating Parametric Subject-specific Risk Estimation, Tianxi Cai, Lu Tian, Hajime Uno, Scott D. Solomon, and L. J. Wei

PDF

A Functional Random Effects Model for Flexible Assessment of Susceptibility in Longitudinal Designs, Brent A. Coull

PDF

Estimation of Controlled Direct Effects, Sylvie Goetgeluk, Stijn Vansteelandt, and Els Goetghebeur

PDF

A New Class of Rank Tests for Interval-censored Data, Guadalupe Gomez and Ramon Oller Pique

PDF

Measurement Error Caused by Spatial Misalignment in Environmental Epidemiology, Alexandros Gryparis, Christopher J. Paciorek, Ariana Zeka, Joel Schwartz, and Brent A. Coull

PDF

A Matrix Pooling Algorithm for Disease Detection, Bethany L. Hedt and Marcello Pagano

PDF

Matrix Pooling: An Accurate and Cost Effective Testing Algorithm for Detection of Acute HIV Infection, Bethany L. Hedt and Marcello Pagano

PDF

Model-based Clustering of Methylation Array Data: A Recursive-partitioning Algorithm for High-dimensional Data Arising as a Mixture of Beta Distributions, E. Andres Houseman, Brock C. Christensen, Ru-Fang Yeh, Carmen J. Marsit, Margaret R. Karagas, Margaret Wrensch, Heather H. Nelson, Joseph Wiemels, Shichun Zheng, John K. Wiencke, and Karl T. Kelsey

PDF

A Powerful and Flexible Multilocus Association Test for Quantitative Traits, Lydia Coulter Kwee, Dawei Liu, Xihong Lin, Debashis Ghosh, and Michael P. Epstein

PDF

A Comparison of Methods for Estimating the Causal Effect of a Treatment in Randomized Clinical Trials Subject to Noncompliance, Rod Little, Qi Long, and Xihong Lin

PDF

Estimation and Testing for the Effect of a Genetic Pathway on a Disease Outcome Using Logistic Kernel Machine Regression via Logistic Mixed Models, Dawei Liu, Debashis Ghosh, and Xihong Lin

PDF

Semiparametric Maximum Likelihood Estimation in Normal Transformation Models for Bivariate Survival Data, Yi Li, Ross L. Prentice, and Xihong Lin

PDF

Limitations of Remotely-sensed Aerosol as a Spatial Proxy for Fine Particulate Matter, Christopher J. Paciorek and Yang Liu

PDF

Expanded Technical Report: Mapping Ancient Forests: Bayesian Inference for Spatio-temporal Trends in Forest Composition Using the Fossil Pollen Proxy Record, Christopher J. Paciorek and Jason S. McLachlan

PDF

Practical Large-Scale Spatio-Temporal Modeling of Particulate Matter Concentrations, Christopher J. Paciorek, Jeff D. Yanosky, Robin C. Puett, Francine Laden, and Helen H. Suh