Deep generative AI models analyzing circulating orphan non-coding RNAs enable detection of early-stage lung cancer.

TitleDeep generative AI models analyzing circulating orphan non-coding RNAs enable detection of early-stage lung cancer.
Publication TypeJournal Article
Year of Publication2024
AuthorsKarimzadeh M, Momen-Roknabadi A, Cavazos TB, Fang Y, Chen N-C, Multhaup M, Yen J, Ku J, Wang J, Zhao X, Murzynowski P, Wang K, Hanna R, Huang A, Corti D, Nguyen D, Lam T, Kilinc S, Arensdorf P, Chau KH, Hartwig A, Fish L, Li H, Behsaz B, Elemento O, Zou J, Hormozdiari F, Alipanahi B, Goodarzi H
JournalNat Commun
Volume15
Issue1
Pagination10090
Date Published2024 Nov 21
ISSN2041-1723
KeywordsAged, Artificial Intelligence, Biomarkers, Tumor, Carcinoma, Non-Small-Cell Lung, Deep Learning, Early Detection of Cancer, Female, Humans, Liquid Biopsy, Lung Neoplasms, Male, Middle Aged, Neoplasm Staging, RNA, Untranslated, Sensitivity and Specificity
Abstract

Liquid biopsies have the potential to revolutionize cancer care through non-invasive early detection of tumors. Developing a robust liquid biopsy test requires collecting high-dimensional data from a large number of blood samples across heterogeneous groups of patients. We propose that the generative capability of variational auto-encoders enables learning a robust and generalizable signature of blood-based biomarkers. In this study, we analyze orphan non-coding RNAs (oncRNAs) from serum samples of 1050 individuals diagnosed with non-small cell lung cancer (NSCLC) at various stages, as well as sex-, age-, and BMI-matched controls. We demonstrate that our multi-task generative AI model, Orion, surpasses commonly used methods in both overall performance and generalizability to held-out datasets. Orion achieves an overall sensitivity of 94% (95% CI: 87%-98%) at 87% (95% CI: 81%-93%) specificity for cancer detection across all stages, outperforming the sensitivity of other methods on held-out validation datasets by more than  ~ 30%.

DOI10.1038/s41467-024-53851-9
Alternate JournalNat Commun
PubMed ID39572521
PubMed Central IDPMC11582319