%0 Journal Article %J Genet Epidemiol %D 2021 %T Incorporating European GWAS findings improve polygenic risk prediction accuracy of breast cancer among East Asians. %A Ji, Ying %A Long, Jirong %A Kweon, Sun-Seog %A Kang, Daehee %A Kubo, Michiaki %A Park, Boyoung %A Shu, Xiao-Ou %A Zheng, Wei %A Tao, Ran %A Li, Bingshan %X

Previous genome-wide association studies (GWASs) have been largely focused on European (EUR) populations. However, polygenic risk scores (PRSs) derived from EUR have been shown to perform worse in non-EURs compared with EURs. In this study, we aim to improve PRS prediction in East Asians (EASs). We introduce a rescaled meta-analysis framework to combine both EUR (N = 122,175) and EAS (N = 30,801) GWAS summary statistics. To improve PRS prediction in EASs, we use a scaling factor to up-weight the EAS data, such that the resulting effect size estimates are more relevant to EASs. We then derive PRSs for EAS from the rescaled meta-analysis results of EAS and EUR data. Evaluated in an independent EAS validation data set, this approach increases the prediction liability-adjusted Nagelkerke's pseudo R by 40%, 41%, and 5%, respectively, compared with PRSs derived from an EAS GWAS only, EUR GWAS only, and conventional fixed-effects meta-analysis of EAS and EUR data. The PRS derived from the rescaled meta-analysis approach achieved an area under the receiver operating characteristic curve (AUC) of 0.6059, higher than AUC = 0.5782, 0.5809, 0.6008 for EAS, EUR, and conventional meta-analysis of EAS and EUR. We further compare PRSs constructed by single-nucleotide polymorphisms that have different linkage disequilibrium (LD) scores and minor allele frequencies (MAFs) between EUR and EAS, and observe that lower LD scores or MAF in EAS correspond to poorer PRS performance (AUC = 0.5677, 0.5530, respectively) than higher LD scores or MAF (AUC = 0.589, 0.5993, respectively). We finally build a PRS stratified by LD score differences in EUR and EAS using rescaled meta-analysis, and obtain an AUC of 0.6096, with improvement over other strategies investigated.

%B Genet Epidemiol %8 2021 Mar 19 %G eng %1 https://www.ncbi.nlm.nih.gov/pubmed/33739539?dopt=Abstract %R 10.1002/gepi.22382 %0 Journal Article %J Genet Med %D 2020 %T Electronic health record phenotypes associated with genetically regulated expression of CFTR and application to cystic fibrosis. %A Zhong, Xue %A Yin, Zhijun %A Jia, Gengjie %A Zhou, Dan %A Wei, Qiang %A Faucon, Annika %A Evans, Patrick %A Gamazon, Eric R %A Li, Bingshan %A Tao, Ran %A Rzhetsky, Andrey %A Bastarache, Lisa %A Cox, Nancy J %K Adult %K Cystic Fibrosis %K Cystic Fibrosis Transmembrane Conductance Regulator %K Electronic Health Records %K Humans %K Mutation %K Phenotype %X

PURPOSE: The increasing use of electronic health records (EHRs) and biobanks offers unique opportunities to study Mendelian diseases. We described a novel approach to summarize clinical manifestations from patient EHRs into phenotypic evidence for cystic fibrosis (CF) with potential to alert unrecognized patients of the disease.

METHODS: We estimated genetically predicted expression (GReX) of cystic fibrosis transmembrane conductance regulator (CFTR) and tested for association with clinical diagnoses in the Vanderbilt University biobank (N = 9142 persons of European descent with 71 cases of CF). The top associated EHR phenotypes were assessed in combination as a phenotype risk score (PheRS) for discriminating CF case status in an additional 2.8 million patients from Vanderbilt University Medical Center (VUMC) and 125,305 adult patients including 25,314 CF cases from MarketScan, an independent external cohort.

RESULTS: GReX of CFTR was associated with EHR phenotypes consistent with CF. PheRS constructed using the EHR phenotypes and weights discovered by the genetic associations improved discriminative power for CF over the initially proposed PheRS in both VUMC and MarketScan.

CONCLUSION: Our study demonstrates the power of EHRs for clinical description of CF and the benefits of using a genetics-informed weighing scheme in construction of a phenotype risk score. This research may find broad applications for phenomic studies of Mendelian disease genes.

%B Genet Med %V 22 %P 1191-1200 %8 2020 07 %G eng %N 7 %1 https://www.ncbi.nlm.nih.gov/pubmed/32296164?dopt=Abstract %R 10.1038/s41436-020-0786-5 %0 Journal Article %J Circ Genom Precis Med %D 2020 %T Multi-Ethnic Genome-Wide Association Study of Decomposed Cardioelectric Phenotypes Illustrates Strategies to Identify and Characterize Evidence of Shared Genetic Effects for Complex Traits. %A Baldassari, Antoine R %A Sitlani, Colleen M %A Highland, Heather M %A Arking, Dan E %A Buyske, Steve %A Darbar, Dawood %A Gondalia, Rahul %A Graff, Misa %A Guo, Xiuqing %A Heckbert, Susan R %A Hindorff, Lucia A %A Hodonsky, Chani J %A Ida Chen, Yii-Der %A Kaplan, Robert C %A Peters, Ulrike %A Post, Wendy %A Reiner, Alex P %A Rotter, Jerome I %A Shohet, Ralph V %A Seyerle, Amanda A %A Sotoodehnia, Nona %A Tao, Ran %A Taylor, Kent D %A Wojcik, Genevieve L %A Yao, Jie %A Kenny, Eimear E %A Lin, Henry J %A Soliman, Elsayed Z %A Whitsel, Eric A %A North, Kari E %A Kooperberg, Charles %A Avery, Christy L %X

BACKGROUND: We examined how expanding electrocardiographic trait genome-wide association studies to include ancestrally diverse populations, prioritize more precise phenotypic measures, and evaluate evidence for shared genetic effects enabled the detection and characterization of loci.

METHODS: We decomposed 10 seconds, 12-lead electrocardiograms from 34 668 multi-ethnic participants (15% Black; 30% Hispanic/Latino) into 6 contiguous, physiologically distinct (P wave, PR segment, QRS interval, ST segment, T wave, and TP segment) and 2 composite, conventional (PR interval and QT interval) interval scale traits and conducted multivariable-adjusted, trait-specific univariate genome-wide association studies using 1000-G imputed single-nucleotide polymorphisms. Evidence of shared genetic effects was evaluated by aggregating meta-analyzed univariate results across the 6 continuous electrocardiographic traits using the combined phenotype adaptive sum of powered scores test.

RESULTS: We identified 6 novels (, and ) and 87 known loci (adaptive sum of powered score test <5×10). Lead single-nucleotide polymorphism rs3211938 at was common in Blacks (minor allele frequency=10%), near monomorphic in European Americans, and had effects on the QT interval and TP segment that ranked among the largest reported to date for common variants. The other 5 novel loci were observed when evaluating the contiguous but not the composite electrocardiographic traits. Combined phenotype testing did not identify novel electrocardiographic loci unapparent using traditional univariate approaches, although this approach did assist with the characterization of known loci.

CONCLUSIONS: Despite including one-third as many participants as published electrocardiographic trait genome-wide association studies, our study identified 6 novel loci, emphasizing the importance of ancestral diversity and phenotype resolution in this era of ever-growing genome-wide association studies.

%B Circ Genom Precis Med %V 13 %P e002680 %8 2020 08 %G eng %N 4 %1 https://www.ncbi.nlm.nih.gov/pubmed/32602732?dopt=Abstract %R 10.1161/CIRCGEN.119.002680 %0 Journal Article %J Nat Neurosci %D 2019 %T A Bayesian framework that integrates multi-omics data and gene networks predicts risk genes from schizophrenia GWAS data. %A Wang, Quan %A Chen, Rui %A Cheng, Feixiong %A Wei, Qiang %A Ji, Ying %A Yang, Hai %A Zhong, Xue %A Tao, Ran %A Wen, Zhexing %A Sutcliffe, James S %A Liu, Chunyu %A Cook, Edwin H %A Cox, Nancy J %A Li, Bingshan %K Animals %K Bayes Theorem %K Disease Models, Animal %K Gene Regulatory Networks %K Genetic Predisposition to Disease %K Genome-Wide Association Study %K Genomics %K Humans %K Mice %K Risk Factors %K Schizophrenia %X

Genome-wide association studies (GWAS) have identified more than 100 schizophrenia (SCZ)-associated loci, but using these findings to illuminate disease biology remains a challenge. Here we present integrative risk gene selector (iRIGS), a Bayesian framework that integrates multi-omics data and gene networks to infer risk genes in GWAS loci. By applying iRIGS to SCZ GWAS data, we predicted a set of high-confidence risk genes, most of which are not the nearest genes to the GWAS index variants. High-confidence risk genes account for a significantly enriched heritability, as estimated by stratified linkage disequilibrium score regression. Moreover, high-confidence risk genes are predominantly expressed in brain tissues, especially prenatally, and are enriched for targets of approved drugs, suggesting opportunities to reposition existing drugs for SCZ. Thus, iRIGS can leverage accumulating functional genomics and GWAS data to advance our understanding of SCZ etiology and potential therapeutics.

%B Nat Neurosci %V 22 %P 691-699 %8 2019 05 %G eng %N 5 %1 https://www.ncbi.nlm.nih.gov/pubmed/30988527?dopt=Abstract %R 10.1038/s41593-019-0382-7