Publications
Adjusting for Principal Components of Molecular Phenotypes Induces Replicating False Positives. Genetics 211, 1179-1189 (2019).
Allelic Heterogeneity at the CRP Locus Identified by Whole-Genome Sequencing in Multi-ancestry Cohorts. Am J Hum Genet 106, 112-120 (2020).
Allelic Heterogeneity at the CRP Locus Identified by Whole-Genome Sequencing in Multi-ancestry Cohorts. Am J Hum Genet 106, 112-120 (2020).
A Bayesian framework that integrates multi-omics data and gene networks predicts risk genes from schizophrenia GWAS data. Nat Neurosci 22, 691-699 (2019).
A common loss-of-function variant is associated with lower vitamin B concentration in African Americans. Blood 131, 2859-2863 (2018).
Comprehensive cell type decomposition of circulating cell-free DNA with CelFiE. Nat Commun 12, 2717 (2021).
Covariate selection for association screening in multiphenotype genetic studies. Nat Genet 49, 1789-1795 (2017).
On the cross-population generalizability of gene expression prediction models. PLoS Genet 16, e1008927 (2020).
De novo pattern discovery enables robust assessment of functional consequences of non-coding variants. Bioinformatics 35, 1453-1460 (2019).
De novo pattern discovery enables robust assessment of functional consequences of non-coding variants. Bioinformatics 35, 1453-1460 (2019).
Diagnostic Algorithms to Study Post-Concussion Syndrome Using Electronic Health Records: Validating a Method to Capture an Important Patient Population. J Neurotrauma 36, 2167-2177 (2019).
Differential NOVA2-Mediated Splicing in Excitatory and Inhibitory Neurons Regulates Cortical Development and Cerebellar Function. Neuron 101, 707-720.e5 (2019).
Disentangling selection on genetically correlated polygenic traits via whole-genome genealogies. Am J Hum Genet 108, 219-239 (2021).
The Effects of Migration and Assortative Mating on Admixture Linkage Disequilibrium. Genetics 205, 375-383 (2017).
Efficient Estimation and Applications of Cross-Validated Genetic Predictions to Polygenic Risk Scores and Linear Mixed Models. J Comput Biol 27, 599-612 (2020).
Efficient Estimation and Applications of Cross-Validated Genetic Predictions to Polygenic Risk Scores and Linear Mixed Models. J Comput Biol 27, 599-612 (2020).
Electronic health record phenotypes associated with genetically regulated expression of CFTR and application to cystic fibrosis. Genet Med 22, 1191-1200 (2020).
Electronic health record phenotypes associated with genetically regulated expression of CFTR and application to cystic fibrosis. Genet Med 22, 1191-1200 (2020).
An evolutionary compass for detecting signals of polygenic selection and mutational bias. Evol Lett 3, 69-79 (2019).
FasTag: Automatic text classification of unstructured medical narratives. PLoS One 15, e0234647 (2020).
A genetic stochastic process model for genome-wide joint analysis of biomarker dynamics and disease susceptibility with longitudinal data. Genet Epidemiol 41, 620-635 (2017).
Genome sequencing analysis identifies Epstein-Barr virus subtypes associated with high risk of nasopharyngeal carcinoma. Nat Genet 51, 1131-1136 (2019).
Genome sequencing analysis identifies Epstein-Barr virus subtypes associated with high risk of nasopharyngeal carcinoma. Nat Genet 51, 1131-1136 (2019).
Genome sequencing analysis identifies Epstein-Barr virus subtypes associated with high risk of nasopharyngeal carcinoma. Nat Genet 51, 1131-1136 (2019).
Genome sequencing analysis identifies Epstein-Barr virus subtypes associated with high risk of nasopharyngeal carcinoma. Nat Genet 51, 1131-1136 (2019).
Genome sequencing analysis identifies Epstein-Barr virus subtypes associated with high risk of nasopharyngeal carcinoma. Nat Genet 51, 1131-1136 (2019).
Genome sequencing analysis identifies Epstein-Barr virus subtypes associated with high risk of nasopharyngeal carcinoma. Nat Genet 51, 1131-1136 (2019).
Identification of rare and common regulatory variants in pluripotent cells using population-scale transcriptomics. Nat Genet 53, 313-321 (2021).
Identification of rare and common regulatory variants in pluripotent cells using population-scale transcriptomics. Nat Genet 53, 313-321 (2021).
Identification of rare-disease genes using blood transcriptome sequencing and large control cohorts. Nat Med 25, 911-919 (2019).
Identification of rare-disease genes using blood transcriptome sequencing and large control cohorts. Nat Med 25, 911-919 (2019).
Incorporating European GWAS findings improve polygenic risk prediction accuracy of breast cancer among East Asians. Genet Epidemiol (2021). doi:10.1002/gepi.22382
Phenotype risk scores identify patients with unrecognized Mendelian disease patterns. Science 359, 1233-1239 (2018).
Phenotype risk scores identify patients with unrecognized Mendelian disease patterns. Science 359, 1233-1239 (2018).
RAFFI: Accurate and fast familial relationship inference in large scale biobank studies using RaPID. PLoS Genet 17, e1009315 (2021).
RAFFI: Accurate and fast familial relationship inference in large scale biobank studies using RaPID. PLoS Genet 17, e1009315 (2021).
A Robust Method Uncovers Significant Context-Specific Heritability in Diverse Complex Traits. Am J Hum Genet 106, 71-91 (2020).