Graph AI generates neurological hypotheses validated in molecular, organoid, and clinical systems


Introduction

Neurological diseases are the leading global cause of disability, yet most lack disease-modifying treatments. To help address this gap, we developed PROTON, a graph AI model that generates hypotheses for neurological disease.

We demonstrate diverse disease-specific applications of PROTON using experimental and clinical data in three neurological conditions: Parkinson's disease (PD), bipolar disorder (BD), and Alzheimer's disease (AD). PROTON nominates candidate drugs, forecasts drug approvals, prioritizes pesticides, and identifies genetic, proteomic, and protein–protein interaction links across multiple biological scales.

Figure-1
Figure 1: Overview of PROTON. PROTON is a graph AI model for neurological disease. (a) We demonstrate diverse disease-specific applications of PROTON using experimental and clinical data in three neurological conditions: Parkinson's disease (PD), bipolar disorder (BD), and Alzheimer's disease (AD). PROTON nominates candidate drugs, forecasts drug approvals, prioritizes pesticides, and identifies genetic, proteomic, and protein–protein interaction links across multiple biological scales. (b) To develop PROTON, we first constructed NeuroKG, a heterogeneous biomedical knowledge graph with 147,020 nodes (16 types) and 7,366,745 edges (47 types). Diseases have unique connectivity patterns in NeuroKG; for example, 200 type-balanced neighbors from the two-hop neighborhoods of PD, BD, and AD are shown on a single shared layout. The one-hop neighborhood of each disease node is rendered in full color; two-hop and out-of-neighborhood nodes are rendered faintly. (c) We pre-trained a 578-million-parameter heterogeneous graph transformer (HGT) on NeuroKG with a self-supervised link-prediction objective. Node and edge type-aware multi-head attention enabled neighborhood integration across modalities.

PROTON is a 578-million-parameter heterogeneous graph transformer contextualized to the human brain. PROTON was trained on NeuroKG using a self-supervised link prediction objective. NeuroKG is a biomedical knowledge graph that unifies 36 human datasets and ontologies, and integrates single-nucleus RNA-sequencing atlases comprising 3,756,702 cells from the adult human brain. The training data contained 147,020 nodes across 16 entity types and 7,366,745 edges across 47 relation types. Through Bayesian hyperparameter optimization, we selected a model architecture that achieved high link-prediction performance (AUROC = 0.9145; accuracy = 82.23%) on an independent test set.

Evaluations of AI predictions in molecular, organoid, and clinical systems

Genome-wide α-synuclein experiments

Human patient-derived brain organoids

Health records from 610,524 patients

Evaluations of AI predictions in molecular, organoid, and clinical systems

Parkinson’s disease

Genome-wide α-synuclein experiments

In PD, PROTON linked genetic risk loci to genes essential for dopaminergic neuron survival and predicted pesticides toxic to patient-derived neurons, including the insecticide Naled, which ranked within the top 6.75% of predictions. In silico screens performed by PROTON reproduced six genome-wide α\alpha-synuclein experiments, including a split-ubiquitin yeast two-hybrid system (normalized enrichment score [NES] = 2.30, FDR-adjusted p<1×104p < 1 \times 10^{-4}), an ascorbate peroxidase proximity labeling assay (NES = 2.16, FDR <1×104< 1 \times 10^{-4}), and a high-depth targeted deep sequencing study in 496 synucleinopathy patients (NES = 2.13, FDR <1×104< 1 \times 10^{-4}).

Figure-2
Figure 2: PROTON prioritizes genes and pesticides linked to Parkinson's disease. (a) PROTON conducted three proteome-scale in silico screens: genetic relationship with PD, protein–protein interaction with α\alpha-synuclein, and a combined prediction. (b) PROTON predictions were compared to six genome-wide experimental α\alpha-synuclein studies. (c) Gene set enrichment analysis (GSEA) demonstrated that PROTON predictions were significantly enriched for experimental hits, outperforming a graph random walk baseline. Results are grouped by experiment class: α\alpha-synuclein interactome profiling or α\alpha-synuclein genetic studies in humans or disease models. N.S. == not significant. (d) We compiled 288 PD GWAS-linked genes across multiple genetic study types and contrasted them against 8,976 GWAS hits from 10 other diseases. (e) An unbiased whole-genome CRISPR screen in human midbrain dopaminergic (DA) neurons identified genes essential for neuronal survival. (f) For each essential gene, PROTON performed a proteome-wide in silico screen to rank likely protein–protein interactors; the median rank of disease GWAS hits showed the strongest association for PD compared with other diseases. (g) For 14 essential genes enriched in autophagy regulation and previously implicated in PD by NERINE, an independent network-based method for rare variant burden analysis, PROTON's minimum PD GWAS association rank per gene was compared to an empirical null, highlighting EXOC4 and HAX1. (h) Fine-tuning dataset constructed from the Parkinson’s Environment and Genes (PEG) study (n=829n=829 PD patients and n=824n=824 controls), comprising 28 pesticides linked to PD and 100 negative samples. (i) The held-out test dataset consisted of four pesticides toxic to iPSC-derived midbrain DA neurons from a PD patient. (j) Schematic of the PROTON pesticide prediction pipeline. (k) PROTON prioritized pesticides toxic to patient-derived dopaminergic neurons, including endosulfan, dicofol, Naled, and copper sulfate.

Bipolar disorder

Human patient-derived brain organoids

In BD, PROTON nominated calcitriol as a candidate drug that reversed proteomic alterations observed in cortical organoids derived from BD patients.

Figure-3
Figure 3: PROTON-nominated drug reverses proteomic alterations in patient-derived organoid models of bipolar disorder. (a) For each of 17 neurological diseases, we constructed disease-centric data splits by withholding drug information from the training set for the disease of interest and all related diseases. An example disease-centric data split is shown for BD. (b) Across 17 neurological diseases, PROTON demonstrates strong performance at recovering drugs that are FDA approved, in late-stage clinical trials, or prescribed off-label, as measured by macro-averaged recall (RR) at kk. (c) PROTON highlighted calcitriol as a candidate therapeutic for BD. (d) Human cortical organoids derived from BD patients and healthy controls were treated with calcitriol, followed by deep proteomic profiling. (e–f) Volcano plots show differentially expressed proteins (DEPs) in BD versus control organoids (512 DEPs, pp < 0.05) and in BD versus drug-treated BD organoids (633 DEPs, pp < 0.05). (g) Expression levels of 73 proteins dysregulated in BD organoids were normalized by calcitriol treatment, as demonstrated by the unsupervised clustering of calcitriol-treated BD organoids with control organoids versus untreated BD organoids. (h) Key pathways affected by calcitriol treatment include metabolism and protein homeostasis, RNA processing, and synaptic adhesion. (i) Comparative proteomics revealed both distinct and overlapping proteomic effects of calcitriol and lithium in patient-derived organoids.

Alzheimer’s disease

Health records from 610,524 patients

In AD, we evaluated PROTON predictions in electronic health records from nn = 610,524 patients at Mass General Brigham, confirming that five PROTON-predicted drugs were associated with reduced seven-year dementia risk (minimum hazard ratio = 0.63, 95% CI: 0.53–0.75, p<1×107p < 1 \times 10^{-7}).

Figure-4
Figure 4: PROTON predicts drugs that reduce the risk of Alzheimer's disease and related dementias in EHR analyses of 610,525 patients. (a) To evaluate PROTON drug repurposing predictions, we conducted retrospective analyses of Mass General Brigham health records data across nn = 610,524 patients. For each candidate drug, patients with the relevant primary indication were divided into treatment and control cohorts. We compared ADRD-free survival between cohorts using Cox proportional hazards models with inverse probability treatment weighting for age and sex. (b) Five of eight model-predicted drugs were significantly associated with reduced ADRD risk (pp < 0.05). The other three drugs neither significantly increased nor decreased risk (pp > 0.05). (c) Cohort sizes for treatment and control groups across retrospective EHR analyses. (d) Cox hazard ratios and pp-values. (e–g) Kaplan–Meier survival curves show reduced ADRD incidence for aflibercept, dapagliflozin, and valsartan. Log-rank pp-values and χ2\chi^2 statistics are reported.

Citation

PROTON is released under the MIT License. If you use PROTON, please consider citing our paper.

@article{noori_graph_2025,
	title = {Graph {{AI}} generates neurological hypotheses validated in molecular, organoid, and clinical systems},
	author = {Noori, Ayush and Polonuer, Joaqu{\'i}n and Meyer, Katharina and Budnik, Bogdan and Morton, Shad and Wang, Xinyuan and Nazeen, Sumaiya and He, Yingnan and Arango, I{\~n}aki and Vittor, Lucas and Woodworth, Matthew and Krolewski, Richard C. and Li, Michelle M. and Liu, Ninning and Kamath, Tushar and Macosko, Evan and Ritter, Dylan and Afroz, Jalwa and Henderson, Alexander B. H. and Studer, Lorenz and Rodriques, Samuel G. and White, Andrew and Dagan, Noa and Clifton, David A. and Church, George M. and Das, Sudeshna and Tam, Jenny M. and Khurana, Vikram and Zitnik, Marinka},
	journal = {arXiv pre-print},
	note = {arXiv:2512.13724},
	year = 2025,
	doi = {10.48550/arXiv.2512.13724}
}

Team

Ayush Noori
Ayush Noori

ayush.noori@sjc.ox.ac.uk

Sudeshna Das
Sudeshna Das

sdas5@mgh.harvard.edu

Jenny M. Tam
Jenny M. Tam

jenny.tam@wyss.harvard.edu

Vikram Khurana
Vikram Khurana

vkhurana@bwh.harvard.edu

Marinka Zitnik
Marinka Zitnik

marinka@hms.harvard.edu

Ayush Noori, Joaquin Polonuer, Katharina Meyer, Bogdan Budnik, Shad Morton, Xinyuan Wang, Sumaiya Nazeen, Yingnan He, Iñaki Arango, Lucas Vittor, Matthew Woodworth, Richard C. Krolewski, Michelle M. Li, Ninning Liu, Tushar Kamath, Evan Macosko, Dylan Ritter, Jalwa Afroz, Alexander B. H. Henderson, Lorenz Studer, Samuel G. Rodriques, Andrew White, Noa Dagan, David A. Clifton, George M. Church, Sudeshna Das, Jenny M. Tam, Vikram Khurana, Marinka Zitnik