"Computational Techniques for Spatial Logistic Regression with Large Da" by Christopher J. Paciorek and Louise Ryan

Harvard University Biostatistics Working Paper Series

Title

Computational Techniques for Spatial Logistic Regression with Large Datasets

Authors

Christopher J. Paciorek, Harvard School of Public HealthFollow
Louise Ryan, Harvard School of Public Health and Dana-Farber Cancer InstituteFollow

Abstract

In epidemiological work, outcomes are frequently non-normal, sample sizes may be large, and effects are often small. To relate health outcomes to geographic risk factors, fast and powerful methods for fitting spatial models, particularly for non-normal data, are required. We focus on binary outcomes, with the risk surface a smooth function of space. We compare penalized likelihood models, including the penalized quasi-likelihood (PQL) approach, and Bayesian models based on fit, speed, and ease of implementation.

A Bayesian model using a spectral basis representation of the spatial surface provides the best tradeoff of sensitivity and specificity in simulations, detecting real spatial features while limiting overfitting and being more efficient computationally than other Bayesian approaches. One of the contributions of this work is further development of this underused representation. The spectral basis model outperforms the penalized likelihood methods, which are prone to overfitting, but is slower to fit and not as easily implemented. Conclusions based on a real dataset of cancer cases in Taiwan are similar albeit less conclusive with respect to comparing the approaches.

The success of the spectral basis with binary data and similar results with count data suggest that it may be generally useful in spatial models and more complicated hierarchical models.

Disciplines

Epidemiology | Numerical Analysis and Computation | Statistical Models

Suggested Citation

Paciorek, Christopher J. and Ryan, Louise, "Computational Techniques for Spatial Logistic Regression with Large Datasets" (October 2005). Harvard University Biostatistics Working Paper Series. Working Paper 32.
https://biostats.bepress.com/harvardbiostat/paper32

Download

Included in

Epidemiology Commons, Numerical Analysis and Computation Commons, Statistical Models Commons

COinS

Collection of Biostatistics Research Archive

Harvard University Biostatistics Working Paper Series

Title

Authors

Abstract

Disciplines

Suggested Citation

Included in

Browse

Search

Author Corner

Harvard Biostatistics

Collection of Biostatistics Research Archive

Harvard University Biostatistics Working Paper Series

Title

Authors

Abstract

Disciplines

Suggested Citation

Included in

Share

Browse

Search

Author Corner

Harvard Biostatistics