Publications
Medical relevance of protein-truncating variants across 337,205 individuals in the UK Biobank study. Nat Commun 9, 1612 (2018).
Lung Function in African American Children with Asthma Is Associated with Novel Regulatory Variants of the KIT Ligand and Gene-By-Air-Pollution Interaction. Genetics 215, 869-886 (2020).
Linkage disequilibrium-dependent architecture of human complex traits shows action of negative selection. Nat Genet 49, 1421-1427 (2017).
Limited statistical evidence for shared genetic effects of eQTLs and autoimmune-disease-associated loci in three major immune-cell types. Nat Genet 49, 600-605 (2017).
Leveraging blood and tissue CD4+ T cell heterogeneity at the single cell level to identify mechanisms of disease in rheumatoid arthritis. Curr Opin Immunol 49, 27-36 (2017).
Lessons Learned From Past Gene-Environment Interaction Successes. Am J Epidemiol 186, 778-786 (2017).
Integrative genomic analysis in African American children with asthma finds three novel loci associated with lung function. Genet Epidemiol 45, 190-208 (2021).
Integration of multiomic annotation data to prioritize and characterize inflammation and immune-related risk variants in squamous cell lung cancer. Genet Epidemiol 45, 99-114 (2021).
Incorporation of Biological Knowledge Into the Study of Gene-Environment Interactions. Am J Epidemiol 186, 771-777 (2017).
Incorporating European GWAS findings improve polygenic risk prediction accuracy of breast cancer among East Asians. Genet Epidemiol (2021). doi:10.1002/gepi.22382
Imputation-Aware Tag SNP Selection To Improve Power for Large-Scale, Multi-ethnic Association Studies. G3 (Bethesda) 8, 3255-3267 (2018).
Improving the trans-ancestry portability of polygenic risk scores by prioritizing variants in predicted cell-type-specific regulatory elements. Nat Genet 52, 1346-1354 (2020).
Improved methods for multi-trait fine mapping of pleiotropic risk loci. Bioinformatics 33, 248-255 (2017).
Impact of admixture and ancestry on eQTL analysis and GWAS colocalization in GTEx. Genome Biol 21, 233 (2020).
Identifying causal variants and genes using functional genomics in specialized cell types and contexts. Hum Genet 139, 95-102 (2020).
Identification of rare-disease genes using blood transcriptome sequencing and large control cohorts. Nat Med 25, 911-919 (2019).
Identification of rare and common regulatory variants in pluripotent cells using population-scale transcriptomics. Nat Genet 53, 313-321 (2021).
Human Demographic History Impacts Genetic Risk Prediction across Diverse Populations. Am J Hum Genet 107, 788-789 (2020).
Human Demographic History Impacts Genetic Risk Prediction across Diverse Populations. Am J Hum Genet 100, 635-649 (2017).
GWAS of three molecular traits highlights core genes and pathways alongside a highly polygenic background. Elife 10, (2021).
GRIK5 Genetically Regulated Expression Associated with Eye and Vascular Phenomes: Discovery through Iteration among Biobanks, Electronic Health Records, and Zebrafish. Am J Hum Genet 104, 503-519 (2019).
Graphical analysis for phenome-wide causal discovery in genotyped population-scale biobanks. Nat Commun 12, 350 (2021).
Global Biobank Engine: enabling genotype-phenotype browsing for biobank summary statistics. Bioinformatics 35, 2495-2497 (2019).
Genome-wide association study reveals a novel locus for asthma with severe exacerbations in diverse populations. Pediatr Allergy Immunol 32, 106-115 (2021).
Genome-Wide Analysis Reveals Mucociliary Remodeling of the Nasal Airway Epithelium Induced by Urban PM. Am J Respir Cell Mol Biol 63, 172-184 (2020).
Genome sequencing analysis identifies Epstein-Barr virus subtypes associated with high risk of nasopharyngeal carcinoma. Nat Genet 51, 1131-1136 (2019).
A genetic stochastic process model for genome-wide joint analysis of biomarker dynamics and disease susceptibility with longitudinal data. Genet Epidemiol 41, 620-635 (2017).
Genetic identification of a common collagen disease in puerto ricans via identity-by-descent mapping in a health system. Elife 6, (2017).
Genetic diversity in populations across Latin America: implications for population and medical genetic studies. Curr Opin Genet Dev 53, 98-104 (2018).
Genetic diagnoses in epilepsy: The impact of dynamic exome analysis in a pediatric cohort. Epilepsia 61, 249-258 (2020).
The Generalized Higher Criticism for Testing SNP-Set Effects in Genetic Association Studies. J Am Stat Assoc 112, 64-76 (2017).
Functional genomics of stromal cells in chronic inflammatory diseases. Curr Opin Rheumatol 30, 65-71 (2018).
Functional equivalence of genome sequencing analysis pipelines enables harmonized variant calling across human genetics projects. Nat Commun 9, 4038 (2018).
Functional architecture of low-frequency variants highlights strength of negative selection across coding and non-coding annotations. Nat Genet 50, 1600-1607 (2018).
Fine-mapping and functional studies highlight potential causal variants for rheumatoid arthritis and type 1 diabetes. Nat Genet 50, 1366-1374 (2018).
FasTag: Automatic text classification of unstructured medical narratives. PLoS One 15, e0234647 (2020).
Fast Lasso method for large-scale and ultrahigh-dimensional Cox model with applications to UK Biobank. Biostatistics (2020). doi:10.1093/biostatistics/kxaa038
A fast and scalable framework for large-scale and ultrahigh-dimensional sparse regression with application to the UK Biobank. PLoS Genet 16, e1009141 (2020).
Extreme Polygenicity of Complex Traits Is Explained by Negative Selection. Am J Hum Genet 105, 456-476 (2019).
Exome sequencing reveals a high prevalence of BRCA1 and BRCA2 founder variants in a diverse population-based biobank. Genome Med 12, 2 (2019).
An evolutionary compass for detecting signals of polygenic selection and mutational bias. Evol Lett 3, 69-79 (2019).
Evidence for secondary-variant genetic burden and non-random distribution across biological modules in a recessive ciliopathy. Nat Genet 52, 1145-1150 (2020).
Estimating the selective effects of heterozygous protein-truncating variants from human exome data. Nat Genet 49, 806-810 (2017).
Error-prone bypass of DNA lesions during lagging-strand replication is a common source of germline and cancer mutations. Nat Genet 51, 36-41 (2019).
Epidemiology of Functional Seizures Among Adults Treated at a University Hospital. JAMA Netw Open 3, e2027920 (2020).