"Paired and Unpaired Comparisons and Clustering with Gene Expression Da" by Jennifer F. Bryan, Katherine S. Pollard et al.

U.C. Berkeley Division of Biostatistics Working Paper Series

Title

Paired and Unpaired Comparisons and Clustering with Gene Expression Data

Authors

Jennifer F. Bryan, Dept. of Statistics & Biotechnology Lab, University of British ColumbiaFollow
Katherine S. Pollard, Division of Biostatisics, School of Public Health, University of California, BerkeleyFollow
Mark J. van der Laan, Division of Biostatistics, School of Public Health, University of California, BerkeleyFollow

Comments

Published in Statistica Sinica, 12(1)87-110, 2002.

Abstract

We have previously described a statistical framework for using gene expression data from cDNA microarrays to select meaningful subsets of genes and to place genes into clusters (van der Laan and Bryan, 2001). In this paper we extend this methodolgy to the setting in which expression data is collected on a common set of p genes from either two observations within a subject (paired) or on subjects from two subpopulations (unpaired). We present simulation results that illustrate important issues encountered with cluster analysis in gene expression data. In particular, we see that sampling variability of the covariance structure and the presence of unrelated genes can have a strong impact on clustering algorithms and measures of cluster strength. We discuss ways to address this issue, including the application of a hybrid clustering method which incorporates both partitioning and collapsing steps. The hybrid methodology is illustrated on a cancer cell line data set with two types of cancer. We also present a method for selecting significantly differently expressed genes using a null distribution. Finally, we present theoretical results relating to sample size and consistency in this setting.

Disciplines

Suggested Citation

Bryan, Jennifer F. ; Pollard, Katherine S.; and van der Laan, Mark J. , "Paired and Unpaired Comparisons and Clustering with Gene Expression Data" (June 2001). U.C. Berkeley Division of Biostatistics Working Paper Series. Working Paper 95.
https://biostats.bepress.com/ucbbiostat/paper95

This document is currently not available here.

COinS

Collection of Biostatistics Research Archive

U.C. Berkeley Division of Biostatistics Working Paper Series

Title

Authors

Comments

Abstract

Disciplines

Suggested Citation

Browse

Search

Author Corner

UCB Biostatistics

Collection of Biostatistics Research Archive

U.C. Berkeley Division of Biostatistics Working Paper Series

Title

Authors

Comments

Abstract

Disciplines

Suggested Citation

Share

Browse

Search

Author Corner

UCB Biostatistics