Publications
Adjusting for Principal Components of Molecular Phenotypes Induces Replicating False Positives. Genetics 211, 1179-1189 (2019).
Allelic Heterogeneity at the CRP Locus Identified by Whole-Genome Sequencing in Multi-ancestry Cohorts. Am J Hum Genet 106, 112-120 (2020).
Assessing Digital Phenotyping to Enhance Genetic Studies of Human Diseases. Am J Hum Genet 106, 611-622 (2020).
A Bayesian framework that integrates multi-omics data and gene networks predicts risk genes from schizophrenia GWAS data. Nat Neurosci 22, 691-699 (2019).
Cohort-specific imputation of gene expression improves prediction of warfarin dose for African Americans. Genome Med 9, 98 (2017).
A common loss-of-function variant is associated with lower vitamin B concentration in African Americans. Blood 131, 2859-2863 (2018).
Comprehensive cell type decomposition of circulating cell-free DNA with CelFiE. Nat Commun 12, 2717 (2021).
Covariate selection for association screening in multiphenotype genetic studies. Nat Genet 49, 1789-1795 (2017).
A cross-population atlas of genetic associations for 220 human phenotypes. Nat Genet 53, 1415-1424 (2021).
On the cross-population generalizability of gene expression prediction models. PLoS Genet 16, e1008927 (2020).
Current Challenges and New Opportunities for Gene-Environment Interaction Studies of Complex Diseases. Am J Epidemiol 186, 753-761 (2017).
Detection of widespread horizontal pleiotropy in causal relationships inferred from Mendelian randomization between complex traits and diseases. Nat Genet 50, 693-698 (2018).
Disentangling selection on genetically correlated polygenic traits via whole-genome genealogies. Am J Hum Genet 108, 219-239 (2021).
The Effects of Migration and Assortative Mating on Admixture Linkage Disequilibrium. Genetics 205, 375-383 (2017).
Efficient Variant Set Mixed Model Association Tests for Continuous and Binary Traits in Large-Scale Whole-Genome Sequencing Studies. Am J Hum Genet 104, 260-274 (2019).
Electronic health record phenotypes associated with genetically regulated expression of CFTR and application to cystic fibrosis. Genet Med 22, 1191-1200 (2020).
Elevated Polygenic Burden for Autism Spectrum Disorder Is Associated With the Broad Autism Phenotype in Mothers of Individuals With Autism Spectrum Disorder. Biol Psychiatry 89, 476-485 (2021).
Epidemiology of Functional Seizures Among Adults Treated at a University Hospital. JAMA Netw Open 3, e2027920 (2020).
Error-prone bypass of DNA lesions during lagging-strand replication is a common source of germline and cancer mutations. Nat Genet 51, 36-41 (2019).
Estimating the selective effects of heterozygous protein-truncating variants from human exome data. Nat Genet 49, 806-810 (2017).
Evidence for secondary-variant genetic burden and non-random distribution across biological modules in a recessive ciliopathy. Nat Genet 52, 1145-1150 (2020).
A fast and scalable framework for large-scale and ultrahigh-dimensional sparse regression with application to the UK Biobank. PLoS Genet 16, e1009141 (2020).
FasTag: Automatic text classification of unstructured medical narratives. PLoS One 15, e0234647 (2020).
Fine-mapping and functional studies highlight potential causal variants for rheumatoid arthritis and type 1 diabetes. Nat Genet 50, 1366-1374 (2018).
Functional architecture of low-frequency variants highlights strength of negative selection across coding and non-coding annotations. Nat Genet 50, 1600-1607 (2018).
Functional equivalence of genome sequencing analysis pipelines enables harmonized variant calling across human genetics projects. Nat Commun 9, 4038 (2018).
Functional genomics of stromal cells in chronic inflammatory diseases. Curr Opin Rheumatol 30, 65-71 (2018).
Genetic diagnoses in epilepsy: The impact of dynamic exome analysis in a pediatric cohort. Epilepsia 61, 249-258 (2020).
Genetic diversity in populations across Latin America: implications for population and medical genetic studies. Curr Opin Genet Dev 53, 98-104 (2018).
Genetic identification of a common collagen disease in puerto ricans via identity-by-descent mapping in a health system. Elife 6, (2017).
A genetic stochastic process model for genome-wide joint analysis of biomarker dynamics and disease susceptibility with longitudinal data. Genet Epidemiol 41, 620-635 (2017).
Graphical analysis for phenome-wide causal discovery in genotyped population-scale biobanks. Nat Commun 12, 350 (2021).
Human Demographic History Impacts Genetic Risk Prediction across Diverse Populations. Am J Hum Genet 100, 635-649 (2017).
Identification of rare and common regulatory variants in pluripotent cells using population-scale transcriptomics. Nat Genet 53, 313-321 (2021).
Identification of rare-disease genes using blood transcriptome sequencing and large control cohorts. Nat Med 25, 911-919 (2019).
Identifying causal variants and genes using functional genomics in specialized cell types and contexts. Hum Genet 139, 95-102 (2020).
Improved methods for multi-trait fine mapping of pleiotropic risk loci. Bioinformatics 33, 248-255 (2017).
Improving the trans-ancestry portability of polygenic risk scores by prioritizing variants in predicted cell-type-specific regulatory elements. Nat Genet 52, 1346-1354 (2020).
Imputation-Aware Tag SNP Selection To Improve Power for Large-Scale, Multi-ethnic Association Studies. G3 (Bethesda) 8, 3255-3267 (2018).
Incorporation of Biological Knowledge Into the Study of Gene-Environment Interactions. Am J Epidemiol 186, 771-777 (2017).
Lessons Learned From Past Gene-Environment Interaction Successes. Am J Epidemiol 186, 778-786 (2017).
Leveraging blood and tissue CD4+ T cell heterogeneity at the single cell level to identify mechanisms of disease in rheumatoid arthritis. Curr Opin Immunol 49, 27-36 (2017).