%0 Journal Article %J Genet Med %D 2020 %T Electronic health record phenotypes associated with genetically regulated expression of CFTR and application to cystic fibrosis. %A Zhong, Xue %A Yin, Zhijun %A Jia, Gengjie %A Zhou, Dan %A Wei, Qiang %A Faucon, Annika %A Evans, Patrick %A Gamazon, Eric R %A Li, Bingshan %A Tao, Ran %A Rzhetsky, Andrey %A Bastarache, Lisa %A Cox, Nancy J %K Adult %K Cystic Fibrosis %K Cystic Fibrosis Transmembrane Conductance Regulator %K Electronic Health Records %K Humans %K Mutation %K Phenotype %X

PURPOSE: The increasing use of electronic health records (EHRs) and biobanks offers unique opportunities to study Mendelian diseases. We described a novel approach to summarize clinical manifestations from patient EHRs into phenotypic evidence for cystic fibrosis (CF) with potential to alert unrecognized patients of the disease.

METHODS: We estimated genetically predicted expression (GReX) of cystic fibrosis transmembrane conductance regulator (CFTR) and tested for association with clinical diagnoses in the Vanderbilt University biobank (N = 9142 persons of European descent with 71 cases of CF). The top associated EHR phenotypes were assessed in combination as a phenotype risk score (PheRS) for discriminating CF case status in an additional 2.8 million patients from Vanderbilt University Medical Center (VUMC) and 125,305 adult patients including 25,314 CF cases from MarketScan, an independent external cohort.

RESULTS: GReX of CFTR was associated with EHR phenotypes consistent with CF. PheRS constructed using the EHR phenotypes and weights discovered by the genetic associations improved discriminative power for CF over the initially proposed PheRS in both VUMC and MarketScan.

CONCLUSION: Our study demonstrates the power of EHRs for clinical description of CF and the benefits of using a genetics-informed weighing scheme in construction of a phenotype risk score. This research may find broad applications for phenomic studies of Mendelian disease genes.

%B Genet Med %V 22 %P 1191-1200 %8 2020 07 %G eng %N 7 %1 https://www.ncbi.nlm.nih.gov/pubmed/32296164?dopt=Abstract %R 10.1038/s41436-020-0786-5 %0 Journal Article %J Nat Med %D 2020 %T Phenome-based approach identifies RIC1-linked Mendelian syndrome through zebrafish models, biobank associations and clinical studies. %A Unlu, Gokhan %A Qi, Xinzi %A Gamazon, Eric R %A Melville, David B %A Patel, Nisha %A Rushing, Amy R %A Hashem, Mais %A Al-Faifi, Abdullah %A Chen, Rui %A Li, Bingshan %A Cox, Nancy J %A Alkuraya, Fowzan S %A Knapik, Ela W %K Abnormalities, Multiple %K Animals %K Behavior, Animal %K Biological Specimen Banks %K Chondrocytes %K Disease Models, Animal %K Extracellular Matrix %K Fibroblasts %K Guanine Nucleotide Exchange Factors %K Humans %K Models, Biological %K Musculoskeletal System %K Osteogenesis %K Phenomics %K Phenotype %K Procollagen %K Protein Transport %K Secretory Pathway %K Syndrome %K Zebrafish %K Zebrafish Proteins %X

Discovery of genotype-phenotype relationships remains a major challenge in clinical medicine. Here, we combined three sources of phenotypic data to uncover a new mechanism for rare and common diseases resulting from collagen secretion deficits. Using a zebrafish genetic screen, we identified the ric1 gene as being essential for skeletal biology. Using a gene-based phenome-wide association study (PheWAS) in the EHR-linked BioVU biobank, we show that reduced genetically determined expression of RIC1 is associated with musculoskeletal and dental conditions. Whole-exome sequencing identified individuals homozygous-by-descent for a rare variant in RIC1 and, through a guided clinical re-evaluation, it was discovered that they share signs with the BioVU-associated phenome. We named this new Mendelian syndrome CATIFA (cleft lip, cataract, tooth abnormality, intellectual disability, facial dysmorphism, attention-deficit hyperactivity disorder) and revealed further disease mechanisms. This gene-based, PheWAS-guided approach can accelerate the discovery of clinically relevant disease phenome and associated biological mechanisms.

%B Nat Med %V 26 %P 98-109 %8 2020 01 %G eng %N 1 %1 https://www.ncbi.nlm.nih.gov/pubmed/31932796?dopt=Abstract %R 10.1038/s41591-019-0705-y %0 Journal Article %J Cancer Res %D 2020 %T A Transcriptome-Wide Association Study Identifies Candidate Susceptibility Genes for Pancreatic Cancer Risk. %A Liu, Duo %A Zhou, Dan %A Sun, Yanfa %A Zhu, Jingjing %A Ghoneim, Dalia %A Wu, Chong %A Yao, Qizhi %A Gamazon, Eric R %A Cox, Nancy J %A Wu, Lang %K Age Factors %K Case-Control Studies %K European Continental Ancestry Group %K Female %K Gene Expression Regulation, Neoplastic %K Genetic Predisposition to Disease %K Genome-Wide Association Study %K Humans %K Male %K Models, Genetic %K Pancreatic Neoplasms %K Polymorphism, Single Nucleotide %X

Pancreatic cancer is among the most well-characterized cancer types, yet a large proportion of the heritability of pancreatic cancer risk remains unclear. Here, we performed a large transcriptome-wide association study to systematically investigate associations between genetically predicted gene expression in normal pancreas tissue and pancreatic cancer risk. Using data from 305 subjects of mostly European descent in the Genotype-Tissue Expression Project, we built comprehensive genetic models to predict normal pancreas tissue gene expression, modifying the UTMOST (unified test for molecular signatures). These prediction models were applied to the genetic data of 8,275 pancreatic cancer cases and 6,723 controls of European ancestry. Thirteen genes showed an association of genetically predicted expression with pancreatic cancer risk at an FDR ≤ 0.05, including seven previously reported genes (, and ) and six novel genes not yet reported for pancreatic cancer risk [6q27: OR (95% confidence interval (CI), 1.54 (1.25-1.89); 13q12.13: OR (95% CI), 0.78 (0.70-0.88); 14q24.3: OR (95% CI), 1.35 (1.17-1.56); 17q12: OR (95% CI), 6.49 (2.96-14.27); 17q21.1: OR (95% CI), 1.94 (1.45-2.58); and 20p13: OR (95% CI): 1.41 (1.20-1.66)]. The associations for 10 of these genes (, and ) remained statistically significant even after adjusting for risk SNPs identified in previous genome-wide association study. Collectively, this analysis identified novel candidate susceptibility genes for pancreatic cancer that warrant further investigation. SIGNIFICANCE: A transcriptome-wide association analysis identified seven previously reported and six novel candidate susceptibility genes for pancreatic cancer risk.

%B Cancer Res %V 80 %P 4346-4354 %8 2020 10 15 %G eng %N 20 %1 https://www.ncbi.nlm.nih.gov/pubmed/32907841?dopt=Abstract %R 10.1158/0008-5472.CAN-20-1353 %0 Journal Article %J Nat Genet %D 2020 %T A unified framework for joint-tissue transcriptome-wide association and Mendelian randomization analysis. %A Zhou, Dan %A Jiang, Yi %A Zhong, Xue %A Cox, Nancy J %A Liu, Chunyu %A Gamazon, Eric R %K Animals %K Gene Expression Profiling %K Genetic Association Studies %K Humans %K Lipoproteins, LDL %K Mendelian Randomization Analysis %K Mice %K Models, Genetic %K Multifactorial Inheritance %K Predictive Value of Tests %X

Here, we present a joint-tissue imputation (JTI) approach and a Mendelian randomization framework for causal inference, MR-JTI. JTI borrows information across transcriptomes of different tissues, leveraging shared genetic regulation, to improve prediction performance in a tissue-dependent manner. Notably, JTI includes the single-tissue imputation method PrediXcan as a special case and outperforms other single-tissue approaches (the Bayesian sparse linear mixed model and Dirichlet process regression). MR-JTI models variant-level heterogeneity (primarily due to horizontal pleiotropy, addressing a major challenge of transcriptome-wide association study interpretation) and performs causal inference with type I error control. We make explicit the connection between the genetic architecture of gene expression and of complex traits and the suitability of Mendelian randomization as a causal inference strategy for transcriptome-wide association studies. We provide a resource of imputation models generated from GTEx and PsychENCODE panels. Analysis of biobanks and meta-analysis data, and extensive simulations show substantially improved statistical power, replication and causal mapping rate for JTI relative to existing approaches.

%B Nat Genet %V 52 %P 1239-1246 %8 2020 11 %G eng %N 11 %1 https://www.ncbi.nlm.nih.gov/pubmed/33020666?dopt=Abstract %R 10.1038/s41588-020-0706-2 %0 Journal Article %J Am J Hum Genet %D 2019 %T GRIK5 Genetically Regulated Expression Associated with Eye and Vascular Phenomes: Discovery through Iteration among Biobanks, Electronic Health Records, and Zebrafish. %A Unlu, Gokhan %A Gamazon, Eric R %A Qi, Xinzi %A Levic, Daniel S %A Bastarache, Lisa %A Denny, Joshua C %A Roden, Dan M %A Mayzus, Ilya %A Breyer, Max %A Zhong, Xue %A Konkashbaev, Anuar I %A Rzhetsky, Andrey %A Knapik, Ela W %A Cox, Nancy J %X

Although the use of model systems for studying the mechanism of mutations that have a large effect is common, we highlight here the ways that zebrafish-model-system studies of a gene, GRIK5, that contributes to the polygenic liability to develop eye diseases have helped to illuminate a mechanism that implicates vascular biology in eye disease. A gene-expression prediction derived from a reference transcriptome panel applied to BioVU, a large electronic health record (EHR)-linked biobank at Vanderbilt University Medical Center, implicated reduced GRIK5 expression in diverse eye diseases. We tested the function of GRIK5 by depletion of its ortholog in zebrafish, and we observed reduced blood vessel numbers and integrity in the eye and increased vascular permeability. Analyses of EHRs in >2.6 million Vanderbilt subjects revealed significant comorbidity of eye and vascular diseases (relative risks 2-15); this comorbidity was confirmed in 150 million individuals from a large insurance claims dataset. Subsequent studies in >60,000 genotyped BioVU participants confirmed the association of reduced genetically predicted expression of GRIK5 with comorbid vascular and eye diseases. Our studies pioneer an approach that allows a rapid iteration of the discovery of gene-phenotype relationships to the primary genetic mechanism contributing to the pathophysiology of human disease. Our findings also add dimension to the understanding of the biology driven by glutamate receptors such as GRIK5 (also referred to as GLUK5 in protein form) and to mechanisms contributing to human eye diseases.

%B Am J Hum Genet %V 104 %P 503-519 %8 2019 Mar 07 %G eng %N 3 %1 https://www.ncbi.nlm.nih.gov/pubmed/30827500?dopt=Abstract %R 10.1016/j.ajhg.2019.01.017 %0 Journal Article %J Genet Epidemiol %D 2018 %T An ancestry-based approach for detecting interactions. %A Park, Danny S %A Eskin, Itamar %A Kang, Eun Yong %A Gamazon, Eric R %A Eng, Celeste %A Gignoux, Christopher R %A Galanter, Joshua M %A Burchard, Esteban %A Ye, Chun J %A Aschard, Hugues %A Eskin, Eleazar %A Halperin, Eran %A Zaitlen, Noah %K African Americans %K African Continental Ancestry Group %K DNA Methylation %K Epistasis, Genetic %K European Continental Ancestry Group %K Gene-Environment Interaction %K Hispanic Americans %K Humans %K Models, Genetic %K Phenotype %X

BACKGROUND: Epistasis and gene-environment interactions are known to contribute significantly to variation of complex phenotypes in model organisms. However, their identification in human association studies remains challenging for myriad reasons. In the case of epistatic interactions, the large number of potential interacting sets of genes presents computational, multiple hypothesis correction, and other statistical power issues. In the case of gene-environment interactions, the lack of consistently measured environmental covariates in most disease studies precludes searching for interactions and creates difficulties for replicating studies.

RESULTS: In this work, we develop a new statistical approach to address these issues that leverages genetic ancestry, defined as the proportion of ancestry derived from each ancestral population (e.g., the fraction of European/African ancestry in African Americans), in admixed populations. We applied our method to gene expression and methylation data from African American and Latino admixed individuals, respectively, identifying nine interactions that were significant at P<5×10-8. We show that two of the interactions in methylation data replicate, and the remaining six are significantly enriched for low P-values (P<1.8×10-6).

CONCLUSION: We show that genetic ancestry can be a useful proxy for unknown and unmeasured covariates in the search for interaction effects. These results have important implications for our understanding of the genetic architecture of complex traits.

%B Genet Epidemiol %V 42 %P 49-63 %8 2018 02 %G eng %N 1 %1 https://www.ncbi.nlm.nih.gov/pubmed/29114909?dopt=Abstract %R 10.1002/gepi.22087