| Title | Deep generative AI models analyzing circulating orphan non-coding RNAs enable detection of early-stage lung cancer. |
| Publication Type | Journal Article |
| Year of Publication | 2024 |
| Authors | Karimzadeh M, Momen-Roknabadi A, Cavazos TB, Fang Y, Chen N-C, Multhaup M, Yen J, Ku J, Wang J, Zhao X, Murzynowski P, Wang K, Hanna R, Huang A, Corti D, Nguyen D, Lam T, Kilinc S, Arensdorf P, Chau KH, Hartwig A, Fish L, Li H, Behsaz B, Elemento O, Zou J, Hormozdiari F, Alipanahi B, Goodarzi H |
| Journal | Nat Commun |
| Volume | 15 |
| Issue | 1 |
| Pagination | 10090 |
| Date Published | 2024 Nov 21 |
| ISSN | 2041-1723 |
| Keywords | Aged, Artificial Intelligence, Biomarkers, Tumor, Carcinoma, Non-Small-Cell Lung, Deep Learning, Early Detection of Cancer, Female, Humans, Liquid Biopsy, Lung Neoplasms, Male, Middle Aged, Neoplasm Staging, RNA, Untranslated, Sensitivity and Specificity |
| Abstract | Liquid biopsies have the potential to revolutionize cancer care through non-invasive early detection of tumors. Developing a robust liquid biopsy test requires collecting high-dimensional data from a large number of blood samples across heterogeneous groups of patients. We propose that the generative capability of variational auto-encoders enables learning a robust and generalizable signature of blood-based biomarkers. In this study, we analyze orphan non-coding RNAs (oncRNAs) from serum samples of 1050 individuals diagnosed with non-small cell lung cancer (NSCLC) at various stages, as well as sex-, age-, and BMI-matched controls. We demonstrate that our multi-task generative AI model, Orion, surpasses commonly used methods in both overall performance and generalizability to held-out datasets. Orion achieves an overall sensitivity of 94% (95% CI: 87%-98%) at 87% (95% CI: 81%-93%) specificity for cancer detection across all stages, outperforming the sensitivity of other methods on held-out validation datasets by more than ~ 30%. |
| DOI | 10.1038/s41467-024-53851-9 |
| Alternate Journal | Nat Commun |
| PubMed ID | 39572521 |
| PubMed Central ID | PMC11582319 |
