Protein-based Diagnosis and Analysis of Co-pathologies Across Neurodegenerative Diseases: Large-Scale AI-Boosted CSF and Plasma Classification

|
Ying Xu, Daniel Western, Gyujin Heo, Kwangsik Nho, Yen-Ning Huang, Shiwei Liu, Hamilton Se-Hwee Oh, Yike Chen, Jigyasha Timsina, Menghan Liu, Yinxu Tang, Katherine Gong, John Budde, Varsha Krish, Farhad Imam, Raquel Puerta Fuentes, Amanda Cano, Marta Marquie, Merce Boada, Knight Alzheimer Disease Research Center (Knight-ADRC), Dominantly Inherited Alzheimer Network (DIAN), Alzheimer Disease Neuroimaging Initiative (ADNI), ACE Alzheimer Center Barcelona (ACE), Barcelona-1, Stanford Alzheimer Disease Research Center (Stanford-ADRC), Global Neurodegeneration Proteomics Consortium (GNPC), Pau Pastor, Agustin Ruiz, Maria Victoria Fernández, David Bennett, Gregory Klein, Tony Wyss-Coray, Andrew J Saykin, Muhammad Ali, Carlos Cruchaga

Res Sq [Preprint]. 2025 Jul 31:rs.3.rs-6933762. doi: 10.21203/rs.3.rs-6933762/v1.

ABSTRACT

Neurodegenerative diseases (including Alzheimer's disease, Parkinson's disease, Frontotemporal dementia, and Dementia with Lewy bodies) pose diagnostic challenges due to overlapping pathology and clinical heterogeneity. We leveraged proteomic data from more than 21,000 cerebrospinal fluid and plasma samples to develop and validate explainable, boosting-based multi-disease AI classifiers. The models achieved weighted AUCs in the testing datasets of 0.97 for CSF and 0.88 for plasma, equivalent to traditional biomarkers. The model was validated with neuropathological and clinical data, confirming robust generalizability without any retraining. Using zero-shot learning, we classified disease subtypes including autosomal dominant AD and prodromal PD and clarified disease states for those with conflicting clinical information. The model also showed the ability to prioritize cognitively normal individuals at disease risk. This framework enabled the identification and quantification of continuous, individual-level disease probabilities that allow for the quantification of overlap across diseases and co-pathologies within an individual. Through this work, we establish a benchmark computational framework for enhancing diagnostic precision in NDs.

PMID:40766224 | PMC:PMC12324596 | DOI:10.21203/rs.3.rs-6933762/v1