Abstract
Identifying relevant genomic features that can act as prognostic markers for building
predictive survival models is one of the central themes in medical research, affecting the future of
personalized medicine and omics technologies. However, the high dimension of genome-wide omic
data, the strong correlation among the features, and the low sample size significantly increase the
complexity of cancer survival analysis, demanding the development of specific statistical methods
and software. Here, we present a novel R package, COSMONET (COx Survival Methods based On
NETworks), that provides a complete workflow from the pre-processing of omics data to the selection
of gene signatures and prediction of survival outcomes. In particular, COSMONET implements (i) three
different screening approaches to reduce the initial dimension of the data from a high-dimensional
space p to a moderate scale d, (ii) a network-penalized Cox regression algorithm to identify the gene
signature, (iii) several approaches to determine an optimal cut-off on the prognostic index (PI) to
separate high- and low-risk patients, and (iv) a prediction step for patients' risk class based on the
evaluation of PIs. Moreover, COSMONET provides functions for data pre-processing, visualization,
survival prediction, and gene enrichment analysis. We illustrate COSMONET through a step-by-step R
vignette using two cancer datasets.
Anno
2021
Autori IAC
Tipo pubblicazione
Altri Autori
Iuliano, A.; Occhipinti, A.; Angelini, C.; De Feis, I.; Li, P.
Editore
MDPI
Rivista
Mathematics