Identifying interactions in omics data for clinical biomarker discovery using symbolic regression

Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › fagfællebedømt

Dokumenter

Fulltext
Forlagets udgivne version, 1,98 MB, PDF-dokument

Christensen, Niels Johan
Samuel Demharter
Meera Machado
Lykke Pedersen
Marco Salvatore
Valdemar Stentoft-Hansen
Miquel Triana Iglesias

Motivation The identification of predictive biomarker signatures from omics and multi-omics data for clinical applications is an active area of research. Recent developments in assay technologies and machine learning (ML) methods have led to significant improvements in predictive performance. However, most high-performing ML methods suffer from complex architectures and lack interpretability.

Results We present the application of a novel symbolic-regression-based algorithm, the QLattice, on a selection of clinical omics datasets. This approach generates parsimonious high-performing models that can both predict disease outcomes and reveal putative disease mechanisms, demonstrating the importance of selecting maximally relevant and minimally redundant features in omics-based machine-learning applications. The simplicity and high-predictive power of these biomarker signatures make them attractive tools for high-stakes applications in areas such as primary care, clinical decision-making and patient stratification.

Originalsprog	Engelsk
Artikelnummer	405
Tidsskrift	Bioinformatics
Vol/bind	38
Udgave nummer	15
Sider (fra-til)	3749-3758
Antal sider	10
ISSN	1367-4803
DOI	https://doi.org/10.1093/bioinformatics/btac405
Status	Udgivet - 2022

ID: 314353858