"Identification of Regulatory Elements Using A Feature Selection Method" by Sunduz Keles, Mark J. van der Laan et al.

U.C. Berkeley Division of Biostatistics Working Paper Series

Title

Identification of Regulatory Elements Using A Feature Selection Method

Authors

Sunduz Keles, Department of Statistics, University of Wisconsin, MadisonFollow
Mark J. van der Laan, Division of Biostatistics, School of Public Health, University of California, Berkeley
Michael B. Eisen, Dept. of Molecular and Cell Biology, University of California, Berkeley

Comments

Published 2002 in Bioinformatics, 18, pp. 1167-1175.

Abstract

Many methods have been described to identify regulatory motifs in the transcription control regions of genes that exhibit similar patterns of gene expression across a variety of experimental conditions. Here we focus on a single experimental condition, and utilize gene expression data to identify sequence motifs associated with genes that are activated under this experimental condition. We use a linear model with two way interactions to model gene expression as a function of sequence features (words) present in presumptive transcription control regions. The most relevant features are selected by a feature selection method called stepwise selection with monte carlo cross validation. We apply this method to a publicly available dataset of the yeast Saccharomyces cerevisiae, focussing on the 800 basepairs immediately upstream of each gene's translation start site (the upstream control region (UCR)). We successfully identify regulatory motifs that are known to be active under the experimental conditions analyzed, and find additional significant sequences that may represent novel regulatory motifs. We also discuss a complementary method that utilizes gene expression data from a single microarray experiment and allows averaging over variety of experimental conditions as an alternative to motif finding methods that act on clusters of co-expressed genes.

Disciplines

Genetics | Microarrays | Statistical Models

Suggested Citation

Keles, Sunduz; van der Laan, Mark J.; and Eisen, Michael B., "Identification of Regulatory Elements Using A Feature Selection Method" (September 2001). U.C. Berkeley Division of Biostatistics Working Paper Series. Working Paper 98.
https://biostats.bepress.com/ucbbiostat/paper98

Download

Included in

Genetics Commons, Microarrays Commons, Statistical Models Commons

COinS

Collection of Biostatistics Research Archive

U.C. Berkeley Division of Biostatistics Working Paper Series

Title

Authors

Comments

Abstract

Disciplines

Suggested Citation

Included in

Browse

Search

Author Corner

UCB Biostatistics

Collection of Biostatistics Research Archive

U.C. Berkeley Division of Biostatistics Working Paper Series

Title

Authors

Comments

Abstract

Disciplines

Suggested Citation

Included in

Share

Browse

Search

Author Corner

UCB Biostatistics