COBRA Preprint Series

A Bayesian Model Averaging Approach for Observational Gene Expression Studies

Xi Kathy Zhou, Division of Biostatistics and Epidemiology, Department of Public Health, Weill Cornell Medical CollegeFollow
Fei Liu, Department of Business Analytics and Mathematical Sciences, IBM Watson Research Center
Andrew J. Dannenberg, Department of Medicine, Weill Cornell Cancer center, Weill Cornell Medical College

Abstract

Identifying differentially expressed (DE) genes associated with a sample characteristic is the primary objective of many microarray studies. As more and more studies are carried out with observational rather than well controlled experimental samples, it becomes important to evaluate and properly control the impact of sample heterogeneity on DE gene finding. Typical methods for identifying DE genes require ranking all the genes according to a pre-selected statistic based on a single model for two or more group comparisons, with or without adjustment for other covariates. Such single model approaches unavoidably result in model misspecification, which can lead to increased error due to bias for some genes and reduced efficiency for the others. We evaluated the impact of model misspecification from such approaches on detecting DE genes and identified parameters that affect the magnitude of impact. To properly control for sample heterogeneity and to provide a flexible and coherent framework for identifying simultaneously DE genes associated with a single or multiple sample characteristics and/or their interactions, we proposed a Bayesian model averaging approach which corrects the model misspecification by averaging over model space formed by all relevant covariates. An empirical approach is suggested for specifying prior model probabilities. We demonstrated through simulated microarray data that this approach resulted in improved performance in DE gene identification compared to the single model approaches. The flexibility of this approach is demonstrated through our analysis of data from two observational microarray studies.

Disciplines

Bioinformatics | Computational Biology | Microarrays

Suggested Citation

Zhou, Xi Kathy; Liu, Fei; and Dannenberg, Andrew J., "A Bayesian Model Averaging Approach for Observational Gene Expression Studies" (June 2011). COBRA Preprint Series. Working Paper 81.
https://biostats.bepress.com/cobra/art81

Download

Included in

Bioinformatics Commons, Computational Biology Commons, Microarrays Commons

COinS

Collection of Biostatistics Research Archive

COBRA Preprint Series

A Bayesian Model Averaging Approach for Observational Gene Expression Studies

Abstract

Disciplines

Suggested Citation

Included in

Browse

Search

Author Corner

Collection of Biostatistics Research Archive

COBRA Preprint Series

A Bayesian Model Averaging Approach for Observational Gene Expression Studies

Authors

Abstract

Disciplines

Suggested Citation

Included in

Share

Browse

Search

Author Corner