First PWAS for AD, now TWAS for PD. Hot on the heels of a study that linked AD variants to changes in a protein-wide association study (Feb 2021 news), geneticists now have linked Parkinson’s genome-wide association hits to genes in transcriptome-wide association and epigenetic studies. In the February 1 JAMA Neurology, John Hardy, Nicholas Wood, and colleagues at University College London reported 11 PD risk gene candidates.
- Marrying GWAS with transcriptomic and epigenetic data IDs 11 PD genes.
- Four are involved in the lysosomal pathway.
- Nine are linked to familial PD genes and regulate cell growth.
- Most are highly expressed in glial cells, not neurons.
Five regulate gene expression and six affect splicing. Four are new PD genes. Nine of the 11 encode proteins that interact with known familial PD genes involved in cell growth and differentiation. Of the seven known PD risk genes, four are in the lysosomal pathway, which is tied to PD. Expression changes were more prevalent in glial cells than in neurons. That these loci emerged from multiple different analyses may strengthen researchers’ confidence about them. “Potential biases and flaws in any one dataset are mitigated when hits are consistent across them all,” Wood told Alzforum.
“This type of large-scale bioinformatics analysis is important to narrow down potential leads for researchers working on functional studies,” Konstantin Senkevich at Montreal’s McGill University told Alzforum.
Matching GWAS hits to changes in gene expression is crucial to understanding how variants act in disease. This strategy has taught scientists about the function of at least 17 PD and 23 Alzheimer’s disease risk gene candidates so far (Feb 2021 news; Oct 2020 news).
In this latest work, a collaboration between the U.K. Brain Expression Consortium (UKBEC) and the International Parkinson’s Disease Genomics Consortium (IPDGC), first author Demis Kia and colleagues identified genes that increase a person’s risk for sporadic PD. They started with a PD GWAS dataset of 26,035 cases and 403,190 controls of European ancestry (Chang et al., 2017). Of the 8 million single-nucleotide polymorphisms (SNPs) analyzed in that study, the researchers searched gene-expression databases for any genes near PD variants that were either up- or downregulated in people with PD. They chose any variants that associated with PD with a P value below 5 × 10−8. Similarly, they looked for genes near those SNPs that might have alternatively spliced variants or epigenetic changes that associate with the disease.
For expression and alternatively spliced variants, Kia and colleagues queried three data sets: Braineac, the Genotype-Tissue Expression (GTEx), and CommonMind Consortium (CMC). Braineac, which is part of the U.K. Brain Expression Consortium run by Hardy, includes expression data of 515 genes from 10 brain regions of 134 healthy controls. GTEx includes expression data of 748 genes from healthy tissue of 13 areas, each with between 80 and 154 samples. CMC includes data on expression of 5,420 genes and splicing of 3,315 genes from the dorsolateral prefrontal cortex of 206 people with schizophrenia, 52 people with affective disorder, and 206 healthy controls.
For genome-wide methylation, the authors analyzed DNA from the substantiae nigrae and frontal cortices of 134 people with PD who had donated their brains to the Parkinson Disease U.K. Brain Bank. The researchers looked for CpG methylation within a 500 kb window around the 8 million SNPs, turning up 37,460 methylation sites for further analysis.
Kia and colleagues compiled data from all three expression sets and analyzed it with statistical tools called Coloc and TWAS (see image below). Short for “co-localization,” Coloc calculates the probability that a single variant changes both PD risk and gene-expression levels. TWAS predicts whether gene expression is associated with disease by correlating GWAS hits with expression quantitative trait loci (eQTL). Linking loci to gene expression, splicing, or methylation provides functional context to GWAS hits (Apr 2019 conference news).
Go With the Flow. Gene expression (left) and splicing (right) analyses whittled thousands of genes down to 11 candidates. [Courtesy of Kia et al., JAMA Neurology, 2021.]
After Coloc analysis, nine and 42 genes from Braineac and GTEx, respectively, were expressed differently in PD than in controls and came with higher PD risk. TWAS found 137 differently expressed genes in CMC, of which 61 lay close to a known PD variant. Comparing Coloc and TWAS revealed seven overlapping hits in Braineac and 18 in GTEx. Only these five genes were common to both data sets: WDR6, CD38, GPNMB, RAB29 (also called RAB7L1), and TMEM163.
Next, the researchers searched for differently spliced genes that lay close to the GWAS SNPs. Coloc identified 25 genes from Braineac linked to PD and expression, of which 15 were alternatively spliced as indicated by expression of specific exons rather than the whole gene. Kia compared these hits to those obtained from TWAS. In CMC, TWAS spotted 129 genes that had evidence of splicing, 40 of them near PD SNPs. Lo and behold, six of those also appeared in the Coloc list—ZRANB3, PCGF3, NEK1, NUPL2, GALC, and CTSB. In total, this gave the researchers 11 genes whose expression or splicing changes in PD.
What about methylation? Of the more than 37,000 CpG sites analyzed, methylation at 134 of them, across 107 genes, was altered in the substantiae nigrae of people with PD. Of those, 116 sites in 93 genes were statistically significant, and three of those genes were also among the 11 that have altered expression in PD: GPNMB, TMEM163, and CTSB.
Of the 11 finalists, four are newcomers to PD research. They are WDR6, which suppresses cell growth; PCGF3, which remodels chromatin; ZRANB3, which stabilizes DNA replication forks; and the cell-cycle regulator NEK1.
How do these genes act in PD pathogenesis? Among the 11, the lysosomal pathway emerged most clearly. GALC encodes a lysosomal galactosidase, CTSB a lysosomal protease, GPNMB a lysosomal regulator. All are implicated in lysosomal storage disorders (reviewed in Bellomo et al., 2020; Moloney et al., 2018). RAB29 recruits LRRK2 to lysosomes, creating a complex crucial for clearing trans-Golgi vesicles. LRRK2 is a major PD gene (Sep 2018 news; Feb 2014 news).
Lysosomes have long been implicated in PD. Variants in the lysosomal glucocerebrosidase gene boost risk for the disease, and deficiencies in the enzyme have been linked to poor clearance of LRRK2 and α-synuclein, the major component of Lewy bodies (Sep 2017 news; Mar 2014 conference news).
As for the other seven genes, their links to PD are tenuous. Although transmembrane protein TMEM163, which is involved in amino acid metabolism, was linked to PD in multiple previous GWAS, small studies conducted in Spain, China, and Taiwan found no association (Tejera-Parrado et al., 2019; Wang et al., 2019; Chang et al., 2019). CD38, a glycoprotein expressed on the surface of many white blood cells, regulates neuroinflammation and has been indirectly implicated in neurodegeneration (Guerreiro et al., 2020). Nucleoporin-like protein 2 (NUPL2) may be involved with exporting mRNA from the nucleus into the cytoplasm.
Three of the 11 candidates have links to other neurological diseases. CTSB came up in a recent AD GWAS (Jul 2020 news), and transcriptional profiling of microglia found GPNMB upregulation in mouse models of amyloidosis and tauopathy (Aug 2018 news). NEK1 is a rare risk variant in amyotrophic lateral sclerosis (Aug 2016 news).
Almost All Glia. PD gene candidates are more highly expressed in human glial cells, including astrocytes (gray), microglia (light blue), and oligodendrocytes (orange and tan), than in neurons (dark blue) or endothelial cells (white). [Courtesy of Kia et al., JAMA Neurology, 2021.]
Kia and colleagues then asked which cell types express these 11 candidate genes. They pulled expression information from transcriptome data of brain cells from healthy people. Astrocytes, oligodendrocytes, and microglia throughout the brain, including in the substantia nigra, more highly expressed most of these genes than did neurons (see image above). “This shows that the immune system is playing an important role in Parkinson’s disease,” Senkevich said.
Mathias Toft, University of Oslo, Norway, was surprised by the preponderance of glial expression. His group recently compared PD GWAS hits with areas of active chromatin, a marker of expression, and found them mostly in cortical neurons (Berge-Seidl et al., 2021). “We found enrichment of PD risk variants in regulatory gene regions of neurons, indicating that neurons, not glial cells, are the primary mediators of genetic risk for PD,” he wrote to Alzforum (full comment below).
Kia and colleagues took their analysis one step further by compiling co-expression networks to determine how the 11 genes fit into molecular pathways in different areas of the brain. In the nucleus accumbens, the caudate, and putamen, the scientists found NUPL2 to be involved in protein ubiquitination and degradation. In the frontal cortex and caudate, respectively, TMEM163 and ZRANB3 coordinate synapse signaling and cell communication.
Taking a similar approach at the protein level, the scientists found that nine of the 11 proteins interacted with proteins known to cause familial PD at a rate higher than chance alone would predict. Such a network regulates ErbB and the related EGFR signaling pathways, which are crucial to cell growth and differentiation (see image below).
Guilt by Association. In a protein-protein network analysis (top), nine of 11 PD candidates (green) interact through intermediates (gray) with proteins known to cause familial PD (pink), including LRRK2, PINK1, SNCA, and GBA. Searching for functional pathways (bottom) common to the familial PD genes and the nine candidates (yellow) brought out ErbB and EGFR signaling (circled in red). [Courtesy of Kia et al., JAMA Neurology, 2021.]
This study is one of a growing number that push past GWAS hits toward gene function in disease. Wood believes that as datasets expand and statistical tools advance, researchers can dig deeper into the functional implications of GWAS loci. “Marrying and analyzing large datasets is in its infancy,” he said. “Publicly available databanks are really good quality and they are here to stay. They’re going to get larger, deeper, and include more diverse populations as -omic sciences come to fruition.”—Chelsea Weidman Burke
- PWAS x GWAS? Proteome Analysis Nets 10 New Alzheimer’s Genes
- Epigenomic Roadmap Points to Causal Genes
- Expression, Expression, Expression—Time to Get on Board with eQTLs
- Largest Meta-GWAS Yet Uncovers New Genetic Links to Parkinson’s
- Researchers Build on GWAS to Parse Genetic Players in AD and PD
- Does LRRK2 Sweep α-Synuclein from the Cell?
- LRRK2 Interactions Identify New Parkinson’s Genes, Implicate Autophagy
- Lysosomes Take Center Stage in Parkinson’s and Frontotemporal Dementia
- Protecting Neurons by Ramping Up Waste Disposal?
- Doubling Down on Sequencing Serves up More Alzheimer’s Genes
- ApoE: Common Microglial Culprit in Aging, Alzheimer’s, and Tauopathy?
- Genetic Studies Uncover Four New ALS Genes
- Chang D, Nalls MA, Hallgrímsdóttir IB, Hunkapiller J, van der Brug M, Cai F, International Parkinson's Disease Genomics Consortium, 23andMe Research Team, Kerchner GA, Ayalon G, Bingol B, Sheng M, Hinds D, Behrens TW, Singleton AB, Bhangale TR, Graham RR. A meta-analysis of genome-wide association studies identifies 17 new Parkinson's disease risk loci. Nat Genet. 2017 Sep 11; PubMed.
- Murthy MN, Blauwendraat C, UKBEC, Guelfi S, IPDGC, Hardy J, Lewis PA, Trabzuni D. Increased brain expression of GPNMB is associated with genome wide significant risk for Parkinson's disease on chromosome 7p15.3. Neurogenetics. 2017 Jul;18(3):121-133. Epub 2017 Apr 8 PubMed.
- Bellomo G, Paciotti S, Gatticchi L, Parnetti L. The vicious cycle between α-synuclein aggregation and autophagic-lysosomal dysfunction. Mov Disord. 2020 Jan;35(1):34-44. Epub 2019 Nov 15 PubMed.
- Moloney EB, Moskites A, Ferrari EJ, Isacson O, Hallett PJ. The glycoprotein GPNMB is selectively elevated in the substantia nigra of Parkinson's disease patients and increases after lysosomal stress. Neurobiol Dis. 2018 Dec;120:1-11. Epub 2018 Aug 24 PubMed.
- Tejera-Parrado C, Jesús S, Periñán MT, Buiza-Rueda D, Oliva-Ariza G, Adarmes-Gómez AD, Macías-García D, Gómez-Garre P, Mir P. A replication study of GWAS-genetic risk variants associated with Parkinson's disease in a Spanish population. Neurosci Lett. 2019 Nov 1;712:134425. Epub 2019 Aug 17 PubMed.
- Wang L, Li NN, Lu ZJ, Li JY, Peng JX, Duan LR, Peng R. Association of three candidate genetic variants in ACMSD/TMEM163, GPNMB and BCKDK /STX1B with sporadic Parkinson's disease in Han Chinese. Neurosci Lett. 2019 Jun 11;703:45-48. Epub 2019 Mar 14 PubMed.
- Chang KH, Chen CM, Chen YC, Fung HC, Wu YR. Polymorphisms of ACMSD-TMEM163, MCCC1, and BCKDK-STX1B Are Not Associated with Parkinson's Disease in Taiwan. Parkinsons Dis. 2019;2019:3489638. Epub 2019 Jan 2 PubMed.
- Guerreiro S, Privat AL, Bressac L, Toulorge D. CD38 in Neurodegeneration and Neuroinflammation. Cells. 2020 Feb 18;9(2) PubMed.
- Berge-Seidl V, Pihlstrøm L, Toft M. Integrative analysis identifies bHLH transcription factors as contributors to Parkinson's disease risk mechanisms. Sci Rep. 2021 Feb 10;11(1):3502. PubMed.
- Nalls MA, Blauwendraat C, Vallerga CL, Heilbron K, Bandres-Ciga S, Chang D, Tan M, Kia DA, Noyce AJ, Xue A, Bras J, Young E, von Coelln R, Simón-Sánchez J, Schulte C, Sharma M, Krohn L, Pihlstrøm L, Siitonen A, Iwaki H, Leonard H, Faghri F, Gibbs JR, Hernandez DG, Scholz SW, Botia JA, Martinez M, Corvol JC, Lesage S, Jankovic J, Shulman LM, Sutherland M, Tienari P, Majamaa K, Toft M, Andreassen OA, Bangale T, Brice A, Yang J, Gan-Or Z, Gasser T, Heutink P, Shulman JM, Wood NW, Hinds DA, Hardy JA, Morris HR, Gratten J, Visscher PM, Graham RR, Singleton AB, 23andMe Research Team, System Genomics of Parkinson's Disease Consortium, International Parkinson's Disease Genomics Consortium. Identification of novel risk loci, causal insights, and heritable risk for Parkinson's disease: a meta-analysis of genome-wide association studies. Lancet Neurol. 2019 Dec;18(12):1091-1102. PubMed.
- Kia DA, Zhang D, Guelfi S, Manzoni C, Hubbard L, Reynolds RH, Botía J, Ryten M, Ferrari R, Lewis PA, Williams N, Trabzuni D, Hardy J, Wood NW, United Kingdom Brain Expression Consortium (UKBEC) and the International Parkinson’s Disease Genomics Consortium (IPDGC). Identification of Candidate Parkinson Disease Genes by Integrating Genome-Wide Association Study, Expression, and Epigenetic Data Sets. JAMA Neurol. 2021 Feb 1; PubMed.