COBRA Preprint Series

Quantifying alternative splicing from paired-end RNA-sequencing data

David Rossell, Institute for Research in Biomedicine of BarcelonaFollow
Camille Stephan-Otto Attolini, Institute for Research in Biomedicine of BarcelonaFollow
Manuel Kroiss, Ludwig Maximilians Universitat, MunchenFollow
Almond Stöcker, Ludwig Maximilians Universitat, MunchenFollow

Abstract

RNA-sequencing has revolutionized biomedical research and, in particular, our ability to study gene alternative splicing. The problem has important implications for human health, as alternative splicing is involved in malfunctions at the cellular level and multiple diseases. However, the high-dimensional nature of the data and the existence of experimental biases pose serious data analysis challenges. We find that the standard data summaries used to study alternative splicing are severely limited, as they ignore a substantial amount of valuable information. Current data analysis methods are based on such summaries and are hence sub-optimal. Further, they have limited flexibility in accounting for technical biases. We propose novel data summaries and a Bayesian modeling framework that overcome these limitations and determine biases in a non-parametric, data-dependent manner. These summaries adapt naturally to the rapid improvements in sequencing technology. We provide efficient point estimates and uncertainty assessments. The approach allows to study alternative splicing patterns for individual samples and can also be the basis for downstream differential expression analysis. We found an over 5 fold improvement in estimation mean square error compared to a popular approach in simulations, and substantially higher correlations between replicates in experimental data. Our findings indicate the need for modifying the routine summarization and analysis of alternative splicing RNA-seq studies. We provide a software implementation in the R package casper.

Disciplines

Biostatistics

Suggested Citation

Rossell, David; Stephan-Otto Attolini, Camille; Kroiss, Manuel; and Stöcker, Almond, "Quantifying alternative splicing from paired-end RNA-sequencing data" (September 2012). COBRA Preprint Series. Working Paper 97.
https://biostats.bepress.com/cobra/art97

Download

Included in

Biostatistics Commons

COinS

Collection of Biostatistics Research Archive

COBRA Preprint Series

Quantifying alternative splicing from paired-end RNA-sequencing data

Abstract

Disciplines

Suggested Citation

Included in

Browse

Search

Author Corner

Collection of Biostatistics Research Archive

COBRA Preprint Series

Quantifying alternative splicing from paired-end RNA-sequencing data

Authors

Abstract

Disciplines

Suggested Citation

Included in

Share

Browse

Search

Author Corner