"ESTIMATING GENOME-WIDE COPY NUMBER USING ALLELE SPECIFIC MIXTURE MODEL" by Wenyi Wang , Benilton Caravalho et al.

Johns Hopkins University, Dept. of Biostatistics Working Papers

Title

ESTIMATING GENOME-WIDE COPY NUMBER USING ALLELE SPECIFIC MIXTURE MODELS

Authors

Wenyi Wang , Johns Hopkins Bloomberg School of Public Health, Department of Biostatistics
Benilton Caravalho, Johns Hopkins Bloomberg School of Public Health, Department of Biostatistics
Nate Miller, Johns Hopkins University, School of Medicine, Department of Neuroscience
Jonathan Pevsner, The Johns Hopkins Medical Institutions, Kennedy Krieger Institute
Aravinda Chakravarti, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine
Rafael A. Irizarry, Johns Hopkins Bloomberg School of Public Health, Department of BiostatisticsFollow

Abstract

Genomic changes such as copy number alterations are thought to be one of the major underlying causes of human phenotypic variation among normal and disease subjects [23,11,25,26,5,4,7,18]. These include chromosomal regions with so-called copy number alterations: instead of the expected two copies, a section of the chromosome for a particular individual may have zero copies (homozygous deletion), one copy (hemizygous deletions), or more than two copies (amplifications). The canonical example is Down syndrome which is caused by an extra copy of chromosome 21. Identification of such abnormalities in smaller regions has been of great interest, because it is believed to be an underlying cause of cancer.

More than one decade ago comparative genomic hybridization (CGH)technology was developed to detect copy number changes in a high-throughput fashion. However, this technology only provides a 10 MB resolution which limits the ability to detect copy number alterations spanning small regions. It is widely believed that a copy number alteration as small as one base can have significant downstream effects, thus microarray manufacturers have developed technologies that provide much higher resolution. Unfortunately, strong probe effects and variation introduced by sample preparation procedures have made single-point copy number estimates too imprecise to be useful. CGH arrays use a two-color hybridization, usually comparing a sample of interest to a reference sample, which to some degree removes the probe effect. However, the resolution is not nearly high enough to provide single-point copy number estimates. Various groups have proposed statistical procedures that pool data from neighboring locations to successfully improve precision. However, these procedure need to average across relatively large regions to work effectively thus greatly reducing the resolution. Recently, regression-type models that account for probe-effect have been proposed and appear to improve accuracy as well as precision. In this paper, we propose a mixture model solution specifically designed for single-point estimation, that provides various advantages over the existing methodology. We use a 314 sample database, constructed with public datasets, to motivate and fit models for the conditional distribution of the observed intensities given allele specific copy numbers. With the estimated models in place we can compute posterior probabilities that provide a useful prediction rule as well as a confidence measure for each call. Software to implement this procedure will be available in the Bioconductor oligo packagehttp://www.bioconductor.org).

Disciplines

Bioinformatics | Computational Biology

Suggested Citation

Wang , Wenyi; Caravalho, Benilton; Miller, Nate; Pevsner, Jonathan; Chakravarti, Aravinda; and Irizarry, Rafael A., "ESTIMATING GENOME-WIDE COPY NUMBER USING ALLELE SPECIFIC MIXTURE MODELS" (October 2006). Johns Hopkins University, Dept. of Biostatistics Working Papers. Working Paper 122.
https://biostats.bepress.com/jhubiostat/paper122

Download

Included in

Bioinformatics Commons, Computational Biology Commons

COinS

Collection of Biostatistics Research Archive

Johns Hopkins University, Dept. of Biostatistics Working Papers

Title

Authors

Abstract

Disciplines

Suggested Citation

Included in

Browse

Search

Author Corner

JHU Biostatistics

Collection of Biostatistics Research Archive

Johns Hopkins University, Dept. of Biostatistics Working Papers

Title

Authors

Abstract

Disciplines

Suggested Citation

Included in

Share

Browse

Search

Author Corner

JHU Biostatistics