"A Comparison of Methods for Generating Correlated Binary Variates with" by John S. Preisser Jr. and Bahjat F. Qaqish

The University of North Carolina at Chapel Hill Department of Biostatistics Technical Report Series

Title

A Comparison of Methods for Generating Correlated Binary Variates with Specified Marginal Means and Correlations

Authors

John S. Preisser Jr., University of North Carolina at Chapel HillFollow
Bahjat F. Qaqish, University of North Carolina, Chapel HillFollow

Abstract

Simulation studies employed to study properties of estimators for parameters in population-averaged models for clustered or longitudinal data require suitable algorithms for data generation. The most useful algorithms for generating correlated binary data are those that allow general specifications of the marginal mean and correlation structures, while being able to generate clusters of moderate to large size. Such methods, however, cannot reproduce data for all possible multivariate binary distributions. Given a vector of marginal means, they often place restrictions on the range of correlations beyond the natural restrictions applicable to any multivariate binary distribution. Motivated by problems in biostatistics, we compare the algorithms of Emrich and Piedmonte (1991) and Qaqish (2003) with respect to range restrictions induced on correlations. Examples include generating longitudinal binary data and generating correlated binary data compatible with specified marginal means and covariance structures for bivariate, overdispersed binomial outcomes. Results show that both algorithms generally have good coverage with Qaqish's method giving a wider range of correlations for longitudinal data having autocorrelated within-subject associations and Emrich and Piedmonte's method giving a wider range of correlations for clustered data having exchangeable-type correlations. Practical considerations for generating data with varying cluster sizes or for subjects in longitudinal studies with missing data are also discussed.

Disciplines

Biostatistics

Suggested Citation

Preisser, John S. Jr. and Qaqish, Bahjat F., "A Comparison of Methods for Generating Correlated Binary Variates with Specified Marginal Means and Correlations" (August 2012). The University of North Carolina at Chapel Hill Department of Biostatistics Technical Report Series. Working Paper 28.
http://biostats.bepress.com/uncbiostat/art28

Download

Included in

Biostatistics Commons

COinS

Collection of Biostatistics Research Archive

The University of North Carolina at Chapel Hill Department of Biostatistics Technical Report Series

Title

Authors

Abstract

Disciplines

Suggested Citation

Downloads

Included in

Browse

Search

Author Corner

UNC Biostatistics

Collection of Biostatistics Research Archive

The University of North Carolina at Chapel Hill Department of Biostatistics Technical Report Series

Title

Authors

Abstract

Disciplines

Suggested Citation

Downloads

Included in

Share

Browse

Search

Author Corner

UNC Biostatistics