Publications
ACAT: A Fast and Powerful p Value Combination Method for Rare-Variant Analysis in Sequencing Studies. Am J Hum Genet 104, 410-421 (2019).
Adjusting for Principal Components of Molecular Phenotypes Induces Replicating False Positives. Genetics 211, 1179-1189 (2019).
Allelic Heterogeneity at the CRP Locus Identified by Whole-Genome Sequencing in Multi-ancestry Cohorts. Am J Hum Genet 106, 112-120 (2020).
Analysis in case-control sequencing association studies with different sequencing depths. Biostatistics 21, 577-593 (2020).
Applicability of the Mutation-Selection Balance Model to Population Genetics of Heterozygous Protein-Truncating Variants in Humans. Mol Biol Evol 36, 1701-1710 (2019).
Assessing Digital Phenotyping to Enhance Genetic Studies of Human Diseases. Am J Hum Genet 106, 611-622 (2020).
Association between Smoking History and Tumor Mutation Burden in Advanced Non-Small Cell Lung Cancer. Cancer Res 81, 2566-2573 (2021).
A Bayesian framework that integrates multi-omics data and gene networks predicts risk genes from schizophrenia GWAS data. Nat Neurosci 22, 691-699 (2019).
A Bootstrap Method for Goodness of Fit and Model Selection with a Single Observed Network. Sci Rep 9, 16674 (2019).
Cohort-specific imputation of gene expression improves prediction of warfarin dose for African Americans. Genome Med 9, 98 (2017).
A common loss-of-function variant is associated with lower vitamin B concentration in African Americans. Blood 131, 2859-2863 (2018).
A common variant in PNPLA3 is associated with age at diagnosis of NAFLD in patients from a multi-ethnic biobank. J Hepatol 72, 1070-1081 (2020).
Components of genetic associations across 2,138 phenotypes in the UK Biobank highlight adipocyte biology. Nat Commun 10, 4064 (2019).
Comprehensive cell type decomposition of circulating cell-free DNA with CelFiE. Nat Commun 12, 2717 (2021).
Covariate selection for association screening in multiphenotype genetic studies. Nat Genet 49, 1789-1795 (2017).
A cross-population atlas of genetic associations for 220 human phenotypes. Nat Genet 53, 1415-1424 (2021).
On the cross-population generalizability of gene expression prediction models. PLoS Genet 16, e1008927 (2020).
Current Challenges and New Opportunities for Gene-Environment Interaction Studies of Complex Diseases. Am J Epidemiol 186, 753-761 (2017).
De novo pattern discovery enables robust assessment of functional consequences of non-coding variants. Bioinformatics 35, 1453-1460 (2019).
A deoxyribonuclease 1-like 3 genetic variant associates with asthma exacerbations. J Allergy Clin Immunol 147, 1095-1097.e10 (2021).
Detection of widespread horizontal pleiotropy in causal relationships inferred from Mendelian randomization between complex traits and diseases. Nat Genet 50, 693-698 (2018).
Diagnostic Algorithms to Study Post-Concussion Syndrome Using Electronic Health Records: Validating a Method to Capture an Important Patient Population. J Neurotrauma 36, 2167-2177 (2019).
Differential NOVA2-Mediated Splicing in Excitatory and Inhibitory Neurons Regulates Cortical Development and Cerebellar Function. Neuron 101, 707-720.e5 (2019).
Disentangling selection on genetically correlated polygenic traits via whole-genome genealogies. Am J Hum Genet 108, 219-239 (2021).
Drug-Resistant Juvenile Myoclonic Epilepsy: Misdiagnosis of Progressive Myoclonus Epilepsy. Front Neurol 10, 946 (2019).
The Effects of Migration and Assortative Mating on Admixture Linkage Disequilibrium. Genetics 205, 375-383 (2017).
Efficient Estimation and Applications of Cross-Validated Genetic Predictions to Polygenic Risk Scores and Linear Mixed Models. J Comput Biol 27, 599-612 (2020).
Efficient Variant Set Mixed Model Association Tests for Continuous and Binary Traits in Large-Scale Whole-Genome Sequencing Studies. Am J Hum Genet 104, 260-274 (2019).
Electronic health record phenotypes associated with genetically regulated expression of CFTR and application to cystic fibrosis. Genet Med 22, 1191-1200 (2020).
Elevated Polygenic Burden for Autism Spectrum Disorder Is Associated With the Broad Autism Phenotype in Mothers of Individuals With Autism Spectrum Disorder. Biol Psychiatry 89, 476-485 (2021).
Epidemiology of Functional Seizures Among Adults Treated at a University Hospital. JAMA Netw Open 3, e2027920 (2020).
Error-prone bypass of DNA lesions during lagging-strand replication is a common source of germline and cancer mutations. Nat Genet 51, 36-41 (2019).
Estimating the selective effects of heterozygous protein-truncating variants from human exome data. Nat Genet 49, 806-810 (2017).
Evidence for secondary-variant genetic burden and non-random distribution across biological modules in a recessive ciliopathy. Nat Genet 52, 1145-1150 (2020).
An evolutionary compass for detecting signals of polygenic selection and mutational bias. Evol Lett 3, 69-79 (2019).
Exome sequencing reveals a high prevalence of BRCA1 and BRCA2 founder variants in a diverse population-based biobank. Genome Med 12, 2 (2019).
Extreme Polygenicity of Complex Traits Is Explained by Negative Selection. Am J Hum Genet 105, 456-476 (2019).
A fast and scalable framework for large-scale and ultrahigh-dimensional sparse regression with application to the UK Biobank. PLoS Genet 16, e1009141 (2020).
Fast Lasso method for large-scale and ultrahigh-dimensional Cox model with applications to UK Biobank. Biostatistics (2020). doi:10.1093/biostatistics/kxaa038
FasTag: Automatic text classification of unstructured medical narratives. PLoS One 15, e0234647 (2020).
Fine-mapping and functional studies highlight potential causal variants for rheumatoid arthritis and type 1 diabetes. Nat Genet 50, 1366-1374 (2018).
Functional architecture of low-frequency variants highlights strength of negative selection across coding and non-coding annotations. Nat Genet 50, 1600-1607 (2018).
Functional equivalence of genome sequencing analysis pipelines enables harmonized variant calling across human genetics projects. Nat Commun 9, 4038 (2018).
Functional genomics of stromal cells in chronic inflammatory diseases. Curr Opin Rheumatol 30, 65-71 (2018).
The Generalized Higher Criticism for Testing SNP-Set Effects in Genetic Association Studies. J Am Stat Assoc 112, 64-76 (2017).
Genetic diagnoses in epilepsy: The impact of dynamic exome analysis in a pediatric cohort. Epilepsia 61, 249-258 (2020).
Genetic diversity in populations across Latin America: implications for population and medical genetic studies. Curr Opin Genet Dev 53, 98-104 (2018).