The past two decades have witnessed significant advances in high-throughput ``omics" technologies such as genomics, proteomics, metabolomics, transcriptomics and radiomics. These technologies have enabled the simultaneous measurement of the expression levels of tens of thousands of features from individual patient samples and have generated enormous amounts of data that require analysis and interpretation. One specific area of interest has been in studying the relationship between these features and patient outcomes such as overall and recurrence-free survival with the goal of developing a predictive ``omics" profile. In this paper, we propose a supervised dimension reduction method for feature selection and survival prediction. Our approach utilizes continuum power regression - a framework that includes ordinary least squares, principal components regression and partial least squares - in conjunction with the parametric or semi-parametric accelerated failure time model, and enables feature selection under possible non-proportional hazards. The proposed approach can handle censored observations using robust Buckley-James estimation in this high-dimensional setting and the parametric version employs the flexible generalized F model that encompasses a wide spectrum of well known survival models. We evaluate the predictive performance of our methods via extensive simulation studies and compare it to existing methods using publicly available data sets in cancer genomics.
Biochemical Phenomena, Metabolism, and Nutrition | Biochemistry | Bioinformatics | Biological Phenomena, Cell Phenomena, and Immunity | Biology | Biotechnology | Cancer Biology | Computational Biology | Genetic Processes | Genetics | Genetics and Genomics | Genetic Structures | Genomics | Integrative Biology | Life Sciences | Medical Biomathematics and Biometrics | Medical Biotechnology | Medical Genetics | Medical Molecular Biology | Medical Pathology | Medicine and Health Sciences | Molecular Genetics | Other Genetics and Genomics | Physiological Processes
Spirko-Burns, Lauren and Devarajan, Karthik, "Supervised Dimension Reduction for Large-scale "Omics" Data with Censored Survival Outcomes Under Possible Non-proportional Hazards" (March 2019). COBRA Preprint Series. Working Paper 119.
Biochemical Phenomena, Metabolism, and Nutrition Commons, Biochemistry Commons, Bioinformatics Commons, Biological Phenomena, Cell Phenomena, and Immunity Commons, Biotechnology Commons, Cancer Biology Commons, Computational Biology Commons, Genetic Processes Commons, Genetics Commons, Genetic Structures Commons, Genomics Commons, Integrative Biology Commons, Medical Biomathematics and Biometrics Commons, Medical Biotechnology Commons, Medical Genetics Commons, Medical Molecular Biology Commons, Medical Pathology Commons, Molecular Genetics Commons, Other Genetics and Genomics Commons, Physiological Processes Commons