Ayush Noori1,2,3,4,5,6,7, Joaquin Polonuer1, Katharina Meyer2,5,8, Bogdan Budnik2,5, Shad Morton2,5, Xinyuan Wang6,9, Sumaiya Nazeen6,9, Yingnan He3, Iñaki Arango1, Lucas Vittor1, Matthew Woodworth2,5,8, Richard C. Krolewski6,9, Michelle M. Li1,6, Ninning Liu2,5, Tushar Kamath10, Evan Macosko10, Dylan Ritter6,11, Jalwa Afroz6,11, Alexander B. H. Henderson3,6, Lorenz Studer6,11, Samuel G. Rodriques12, Andrew White12, Noa Dagan7,13,14, David A. Clifton4,15, George M. Church2,5,8, Sudeshna Das3†, Jenny M. Tam2,5,8†, Vikram Khurana6,9,10,16†, Marinka Zitnik1,6,7,10,17,18†
Neurological diseases are the leading global cause of disability, yet most lack disease-modifying treatments. To help address this gap, we developed PROTON, a graph AI model that generates hypotheses for neurological disease.
We demonstrate diverse disease-specific applications of PROTON using experimental and clinical data in three neurological conditions: Parkinson's disease (PD), bipolar disorder (BD), and Alzheimer's disease (AD). PROTON nominates candidate drugs, forecasts drug approvals, prioritizes pesticides, and identifies genetic, proteomic, and protein–protein interaction links across multiple biological scales.

PROTON is a 578-million-parameter heterogeneous graph transformer contextualized to the human brain. PROTON was trained on NeuroKG using a self-supervised link prediction objective. NeuroKG is a biomedical knowledge graph that unifies 36 human datasets and ontologies, and integrates single-nucleus RNA-sequencing atlases comprising 3,756,702 cells from the adult human brain. The training data contained 147,020 nodes across 16 entity types and 7,366,745 edges across 47 relation types. Through Bayesian hyperparameter optimization, we selected a model architecture that achieved high link-prediction performance (AUROC = 0.9145; accuracy = 82.23%) on an independent test set.
Genome-wide α-synuclein experiments
Human patient-derived brain organoids
Health records from 610,524 patients
Genome-wide α-synuclein experiments
In PD, PROTON linked genetic risk loci to genes essential for dopaminergic neuron survival and predicted pesticides toxic to patient-derived neurons, including the insecticide Naled, which ranked within the top 6.75% of predictions. In silico screens performed by PROTON reproduced six genome-wide -synuclein experiments, including a split-ubiquitin yeast two-hybrid system (normalized enrichment score [NES] = 2.30, FDR-adjusted ), an ascorbate peroxidase proximity labeling assay (NES = 2.16, FDR ), and a high-depth targeted deep sequencing study in 496 synucleinopathy patients (NES = 2.13, FDR ).

Human patient-derived brain organoids
In BD, PROTON nominated calcitriol as a candidate drug that reversed proteomic alterations observed in cortical organoids derived from BD patients.

Health records from 610,524 patients
In AD, we evaluated PROTON predictions in electronic health records from = 610,524 patients at Mass General Brigham, confirming that five PROTON-predicted drugs were associated with reduced seven-year dementia risk (minimum hazard ratio = 0.63, 95% CI: 0.53–0.75, ).

PROTON is released under the MIT License. If you use PROTON, please consider citing our paper.
@article{noori_graph_2025,
title = {Graph {{AI}} generates neurological hypotheses validated in molecular, organoid, and clinical systems},
author = {Noori, Ayush and Polonuer, Joaqu{\'i}n and Meyer, Katharina and Budnik, Bogdan and Morton, Shad and Wang, Xinyuan and Nazeen, Sumaiya and He, Yingnan and Arango, I{\~n}aki and Vittor, Lucas and Woodworth, Matthew and Krolewski, Richard C. and Li, Michelle M. and Liu, Ninning and Kamath, Tushar and Macosko, Evan and Ritter, Dylan and Afroz, Jalwa and Henderson, Alexander B. H. and Studer, Lorenz and Rodriques, Samuel G. and White, Andrew and Dagan, Noa and Clifton, David A. and Church, George M. and Das, Sudeshna and Tam, Jenny M. and Khurana, Vikram and Zitnik, Marinka},
journal = {arXiv pre-print},
note = {arXiv:2512.13724},
year = 2025,
doi = {10.48550/arXiv.2512.13724}
}
ayush.noori@sjc.ox.ac.uk

sdas5@mgh.harvard.edu

jenny.tam@wyss.harvard.edu

vkhurana@bwh.harvard.edu

marinka@hms.harvard.edu
Ayush Noori, Joaquin Polonuer, Katharina Meyer, Bogdan Budnik, Shad Morton, Xinyuan Wang, Sumaiya Nazeen, Yingnan He, Iñaki Arango, Lucas Vittor, Matthew Woodworth, Richard C. Krolewski, Michelle M. Li, Ninning Liu, Tushar Kamath, Evan Macosko, Dylan Ritter, Jalwa Afroz, Alexander B. H. Henderson, Lorenz Studer, Samuel G. Rodriques, Andrew White, Noa Dagan, David A. Clifton, George M. Church, Sudeshna Das, Jenny M. Tam, Vikram Khurana, Marinka Zitnik