Located on the Harvard Medical Campus, the Department of Biostatistics was one of the first departments in the newly formed Harvard School of Public Health in 1922. Now in its 80th year, the Department comprises 85 students, 57 faculty members, and 22 research associates and fellows. Our size contributes to our ability to address a broad spectrum of biostatistical and public health issues.

Current departmental research on statistical and computing methods for observational studies and clinical trials includes survival analysis, missing-data problems, and causal inference. Other areas of investigation are environmental research (methods for longitudinal studies, analyses with incomplete data, and meta-analysis); statistical aspects of the study of AIDS and cancer; quantitative problems in health-risk analysis, technology assessment, and clinical decision making; statistical methodology in psychiatric research and in genetic studies; Bayesian statistics; statistical computing; statistical genetics and computational biology; and collaborative research activities with biomedical scientists in other Harvard-affiliated institutions.

The Harvard University Biostatistics Working Paper Series presents contributions by our faculty and researchers that rely on the theory and application of statistical science to analyze public health problems.

Follow

Papers from 2008

PDF

Estimation and Testing for the Effect of a Genetic Pathway on a Disease Outcome Using Logistic Kernel Machine Regression via Logistic Mixed Models, Dawei Liu, Debashis Ghosh, and Xihong Lin

PDF

Semiparametric Maximum Likelihood Estimation in Normal Transformation Models for Bivariate Survival Data, Yi Li, Ross L. Prentice, and Xihong Lin

PDF

Limitations of Remotely-sensed Aerosol as a Spatial Proxy for Fine Particulate Matter, Christopher J. Paciorek and Yang Liu

PDF

Expanded Technical Report: Mapping Ancient Forests: Bayesian Inference for Spatio-temporal Trends in Forest Composition Using the Fossil Pollen Proxy Record, Christopher J. Paciorek and Jason S. McLachlan

PDF

Practical Large-Scale Spatio-Temporal Modeling of Particulate Matter Concentrations, Christopher J. Paciorek, Jeff D. Yanosky, Robin C. Puett, Francine Laden, and Helen H. Suh

PDF

Estimation in Semiparametric Transition Measurement Error Models for Longitudinal Data, Wenqin Pan, Donglin Zeng, and Xihong Lin

PDF

Empirical Null and False Discovery Rate Inference for Exponential Families, Armin Schwartzman

PDF

The Highest Confidence Density Region and Its Usage for Inferences about the Survival Function with Censored Data, Lu Tian, Rui wang, Tianxi Cai, and L. J. Wei

PDF

Marginal Structural Models for Partial Exposure Regimes, Stijn Vansteelandt, Karl Mertens, Carl Suetens, and Els Goetghebeur

PDF

Nonparametric Inference Procedure For Percentiles of the Random Effect Distribution in Meta Analysis, Rui Wang, Lu Tian, Tianxi Cai, and L. J. Wei

PDF

Nonparametric Regression Using Local Kernel Estimating Equations for Correlated Failure Time Data, Zhangsheng Yu and Xihong Lin

Papers from 2007

PDF

Survival Analysis with Large Dimensional Covariates: An Application in Microarray Studies, David A. Engler and Yi Li

PDF

Assessment of a CGH-based Genetic Instability, David A. Engler, Yiping Shen, J F. Gusella, and Rebecca A. Betensky

PDF

Comparing Trends in Cancer Rates Across Overlapping Regions, Yi Li and Ram C. Tiwari

PDF

Estimating Time-to-Event From Longitudinal Categorical Data Using Random Effects Markov Models: Application to Multiple Sclerosis Progression, Micha Mandel and Rebecca A. Betensky

PDF

Simultaneous Confidence Intervals Based on the Percentile Bootstrap Approach, Micha Mandel and Rebecca A. Betensky

PDF

Assessing Population Level Genetic Instability via Moving Average, Samuel McDaniel, Rebecca Betensky, and Tianxi Cai

PDF

Spatio-temporal Associations Between GOES Aerosol Optical Depth Retrievals and Ground-Level PM2.5, Christopher J. Paciorek, Yang Liu, Hortensia Moreno-Macias, and Shobha Kondragunta

PDF

Conservative Estimation of Optimal Multiple Testing Procedures, James E. Signorovitch

PDF

Effectively Combining Independent 2 x 2 Tables for Valid Inferences in Meta Analysis with all Available Data but no Artificial Continuity Corrections for Studies with Zero Events and its Application to the Analysis of Rosiglitazone's Cardiovascular Disease Related Event Data, Lu Tian, Tianxi Cai, Nikita Piankov, Pierre-Yves Cremieux, and L. J. Wei

PDF

Identifying patients who need additional biomarkers for better prediction of health outcome or diagnosis of clinical phenotype, Lu Tian, Tianxi Cai, and L. J. Wei

PDF

Correcting Instrumental Variables Estimators for Systematic Measurement Error, Stijn Vansteelandt, Manoochehr Babanezhad, and Els Goetghebeur

Papers from 2006

PDF

Regression Analysis for the Partial Area Under the ROC Curve, Tianxi Cai and Lori E. Dodd

PDF

Predicting Future Responses Based on Possibly Misspecified Working Models, Tianxi Cai, Lu Tian, Scott D. Solomon, and L.J. Wei

PDF

Spatial Cluster Detection for Censored Outcome Data, Andrea J. Cook, Diane Gold, and Yi Li

PDF

A Computationally Tractable Multivariate Random Effects Model for Clustered Binary Data, Brent A. Coull, E. Andres Houseman, and Rebecca A. Betensky

PDF

A Likelihood Based Method for Real Time Estimation of the Serial Interval and Reproductive Number of an Epidemic, Laura Forsberg White and Marcello Pagano

PDF

Survival Analysis with Change Point Hazard Functions, Melody S. Goodman, Yi Li, and Ram C. Tiwari

PDF

Semiparametric Latent Variable Regression Models for Spatio-temporal Modeling of Mobile Source Particles in the Greater Boston Area, Alexandros Gryparis, Brent A. Coull, Joel Schwartz, and Helen H. Suh

PDF

Posterior Simulation in the Generalized Linear Model with Semiparmetric Random Effects, Subharup Guha

PDF

Bayesian Hidden Markov Modeling of Array CGH Data, Subharup Guha, Yi Li, and Donna Neuberg

PDF

Spatio-Temporal Analysis of Areal Data and Discovery of Neighborhood Relationships in Conditionally Autoregressive Models, Subharup Guha and Louise Ryan

PDF

PLASQ: A Generalized Linear Model-Based Procedure to Determine Allelic Dosage ini Cancer Cells from SNP Array Data, Thomas LaFramboise, David P. Harrington, and Barbara A. Weir

PDF

A Comparison of Methods for Estimating the Causal Effect of a Treatment in Randomized Clinical Trials Subject to Noncompliance, Rod Little, Qi Long, and Xihong Lin

PDF

Semiparametric Regression of Multi-Dimensional Genetic Pathway Data: Least Squares Kernel Machines and Linear Mixed Models, Dawei Liu, Xihong Lin, and Debashis Ghosh

PDF

Causal Inference in Hybrid Intervention Trials Involving Treatment Choice, Qi Long, Rod Little, and Xihong Lin

PDF

Selecting 'Significant' Differentially Expressed Genes from the Combined Perspective of the Null and the Alternative, Beatrijs Moerkerke and Els Goetghebeur

PDF

An Informative Bayesian Structural Equation Model to Assess Source-Specific Health Effects of Air Pollution, Margaret C. Nikolov, Brent A. Coull, Paul J. Catalano, and John J. Godleski

PDF

Mixed Multiplicative Factor Analysis Model for Air Pollution Exposure Assessment, Margaret C. Nikolov, Brent A. Coull, Paul J. Catalano, and John J. Godleski

PDF

Bayesian Smoothing of Irregularly-spaced Data Using Fourier Basis Functions, Christopher J. Paciorek

PDF

Structural Inference in Transition Measurement Error Models for Longitudinal Data, Wenqin Pan, Xihong Lin, and Donglin Zeng

PDF

Estimation in Semiparametric Transition Measurement Error Models for Longitudinal Data, Wenqin Pan, Donglin Zeng, and Xihong Lin

PDF

Multiple Testing With an Empirical Alternative Hypothesis, James E. Signorovitch

PDF

A Diagnostic Test for the Mixing Distribution in a Generalised Linear Mixed Model, Eric J. Tchetgen and Brent A. Coull

PDF

Evaluating Prediction Rules for t-Year Survivors With Censored Regression Models, Hajime Uno, Tianxi Cai, Lu Tian, and L.J. Wei

PDF

Using Profile Likelihood for Semiparametric Model Selection with Application to Proportional Hazards Mixed Models, Ronghui Xu, Anthony Gamst, Michael Donohue, Florin Vaida, and David P. Harrington

PDF

Nonparametric Regression Using Local Kernel Estimating Equations for Correlated Failure Time Data, Zhangsheng Yu and Xihong Lin

Papers from 2005

PDF

The Sensitivity and Specificity of Markers for Event Times, Tianxi Cai, Margaret S. Pepe, Thomas Lumley, Yingye Zheng, and Nancy Swords Jenny

PDF

Model Checking for ROC Regression Analysis, Tianxi Cai and Yingye Zheng

PDF

A Pseudolikelihood Approach for Simultaneous Analysis of Array Comparative Genomic Hybridizations (aCGH), David A. Engler, Gayatry Mohapatra, David N. Louis, and Rebecca Betensky

PDF

Gauss-Seidel Estimation of Generalized Linear Mixed Models with Application to Poisson Modeling of Spatially Varying Disease Rates, Subharup Guha and Louise Ryan

PDF

Feature-Specific Penalized Latent Class Analysis for Genomic Data, E. Andres Houseman, Brent A. Coull, and Rebecca A. Betensky

PDF

A Nonstationary Negative Binomial Time Series with Time-Dependent Covariates: Enterococcus Counts in Boston Harbor, E. Andres Houseman, Brent Coull, and James P. Shine

PDF

Robust Inferences For Covariate Effects On Survival Time With Censored Linear Regression Models, Larry Leon, Tianxi Cai, and L. J. Wei

PDF

Semiparametric Estimation in General Repeated Measures Problems, Xihong Lin and Raymond J. Carroll

PDF

Semiparametric Normal Transformation Models for Spatially Correlated Survival Data, Yi Li and Xihong Lin

PDF

Inference on Survival Data with Covariate Measurement Error - An Imputation-based Approach, Yi Li and Louise Ryan

PDF

Designed Extension of Survival Studies: Application to Clinical Trials with Unrecognized Heterogeneity, Yi Li, Mei-Chiung Shih, and Rebecca A. Betensky

PDF

Mixture Cure Survival Models with Dependent Censoring, Yi Li, Ram C. Tiwari, and Subharup Guha

PDF

Computational Techniques for Spatial Logistic Regression with Large Datasets, Christopher J. Paciorek and Louise Ryan

PDF

Simultaneous and Exact Interval Estimates for the Contrast of Two Groups Based on an Extremely High Dimensional Response Variable: Application to Mass Spec Data Analysis, Yuhyun Park, Sean R. Downing, Cheng Li Dr., William C. Hahn, Philip W. Kantoff, and L. J. Wei

PDF

Model Evaluation Based on the Distribution of Estimated Absolute Prediction Error, Lu Tian, Tianxi Cai, Els Goetghebeur, and L. J. Wei

PDF

Implementation Of Estimating-Function Based Inference Procedures With MCMC Sampler, Lu Tian, Jun S. Liu, and L. J. Wei

Papers from 2004

PDF

A Robust Regression Model for a First-Order Autoregressive Time Series with Unequal Spacing: Technical Report, E. Andres Houseman

PDF

A Functional-Based Distribution Diagnostic for a Linear Model with Correlated Outcomes: Technical Report, E. Andres Houseman, Brent Coull, and Louise Ryan

PDF

Cholesky Residuals for Assessing Normal Errors in a Linear Model with Correlated Outcomes: Technical Report, E. Andres Houseman, Louise Ryan, and Brent Coull

PDF

Semiparametric Methods for Semi-competing Risks Problem with Censoring and Truncation, Hongyu Jiang, Jason Fine, and Richard J. Chappell

PDF

One- and Two-Sample Nonparametric Inference Procedures in the Presence of Dependent Censoring, Yuhyun Park, Lu Tian, and L. J. Wei

PDF

On the Accelerated Failure Time Model for Current Status and Interval Censored Data, Lu Tian and Tianxi Cai

PDF

The Optimal Confidence Region for a Random Parameter, Hajime Uno, Lu Tian, and L.J. Wei

Papers from 2003

PDF

Semi-parametric Box-Cox Power Transformation Models for Censored Survival Observations, Tianxi Cai, Lu Tian, and L. J. Wei

PDF

Nonparametric Comparison of Two Survival-Time Distributions in the Presence of Dependent Censoring, Greg DiRienzo

PDF

Nonparametric Methods to predict HIV drug susceptibility phenotype from genotype, Greg DiRienzo

PDF

The Effects of Misspecifying Cox's Regression Model on Randomized Treatment Group Comparisons, Greg DiRienzo

PDF

Statistical Inference for Infinite Dimensional Parameters Via Asymptotically Pivotal Estimating Functions, Meredith A. Goldwasser, Lu Tian, and L. J. Wei

PDF

Empirical and Kernel Estimation of Covariate Distribution Conditional on Survival Time, Xiaochun Li and Ronghui Xu

PDF

A Nonparametric Comparison of Conditional Distributions with Nonnegligible Cure Fractions, Yi Li and Jin Feng

PDF

Survival Analysis with Heterogeneous Covariate Measurement Error, Yi Li and Louise Ryan

PDF

STATISTICAL INFERENCES BASED ON NON-SMOOTH ESTIMATING FUNCTIONS, Lu Tian, Jun S. Liu, Mary Zhao, and L. J. Wei

PDF

Estimating Predictors for Long- Or Short-Term Survivors, Lu Tian, Wei Wang, and L. J. Wei

PDF

On the Cox Model with Time-Varying Regression Coefficients, Lu Tian, David Zucker, and L. J. Wei