Publications
A fast and scalable framework for large-scale and ultrahigh-dimensional sparse regression with application to the UK Biobank. PLoS Genet 16, e1009141 (2020).
Race, socioeconomic deprivation, and hospitalization for COVID-19 in English participants of a national biobank. Int J Equity Health 19, 114 (2020).
Extreme Polygenicity of Complex Traits Is Explained by Negative Selection. Am J Hum Genet 105, 456-476 (2019).
Elevated Polygenic Burden for Autism Spectrum Disorder Is Associated With the Broad Autism Phenotype in Mothers of Individuals With Autism Spectrum Disorder. Biol Psychiatry 89, 476-485 (2021).
RAFFI: Accurate and fast familial relationship inference in large scale biobank studies using RaPID. PLoS Genet 17, e1009315 (2021).
Genome-Wide Analysis Reveals Mucociliary Remodeling of the Nasal Airway Epithelium Induced by Urban PM. Am J Respir Cell Mol Biol 63, 172-184 (2020).
Whole-exome sequencing in adult patients with developmental and epileptic encephalopathy: It is never too late. Clin Genet 98, 477-485 (2020).
Efficient Estimation and Applications of Cross-Validated Genetic Predictions to Polygenic Risk Scores and Linear Mixed Models. J Comput Biol 27, 599-612 (2020).
Global Biobank Engine: enabling genotype-phenotype browsing for biobank summary statistics. Bioinformatics 35, 2495-2497 (2019).
Operating characteristics of the rank-based inverse normal transformation for quantitative trait analysis in genome-wide association studies. Biometrics 76, 1262-1272 (2020).
Current Challenges and New Opportunities for Gene-Environment Interaction Studies of Complex Diseases. Am J Epidemiol 186, 753-761 (2017).
Drug-Resistant Juvenile Myoclonic Epilepsy: Misdiagnosis of Progressive Myoclonus Epilepsy. Front Neurol 10, 946 (2019).
Human Demographic History Impacts Genetic Risk Prediction across Diverse Populations. Am J Hum Genet 107, 788-789 (2020).
Human Demographic History Impacts Genetic Risk Prediction across Diverse Populations. Am J Hum Genet 100, 635-649 (2017).
A multi-task convolutional deep neural network for variant calling in single molecule sequencing. Nat Commun 10, 998 (2019).
Multiple phenotype association tests using summary statistics in genome-wide association studies. Biometrics 74, 165-175 (2018).
ACAT: A Fast and Powerful p Value Combination Method for Rare-Variant Analysis in Sequencing Studies. Am J Hum Genet 104, 410-421 (2019).
A Transcriptome-Wide Association Study Identifies Candidate Susceptibility Genes for Pancreatic Cancer Risk. Cancer Res 80, 4346-4354 (2020).
Identifying causal variants and genes using functional genomics in specialized cell types and contexts. Hum Genet 139, 95-102 (2020).
Fast Lasso method for large-scale and ultrahigh-dimensional Cox model with applications to UK Biobank. Biostatistics (2020). doi:10.1093/biostatistics/kxaa038
Survival Analysis on Rare Events Using Group-Regularized Multi-Response Cox Regression. Bioinformatics (2021). doi:10.1093/bioinformatics/btab095
A synthetic-diploid benchmark for accurate variant-calling evaluation. Nat Methods 15, 595-597 (2018).
Dynamic incorporation of multiple in silico functional annotations empowers rare variant association analysis of large whole-genome sequencing studies at scale. Nat Genet 52, 969-983 (2020).
VEXOR: an integrative environment for prioritization of functional variants in fine-mapping analysis. Bioinformatics 33, 1389-1391 (2017).
Evidence for secondary-variant genetic burden and non-random distribution across biological modules in a recessive ciliopathy. Nat Genet 52, 1145-1150 (2020).
Improved methods for multi-trait fine mapping of pleiotropic risk loci. Bioinformatics 33, 248-255 (2017).
On the cross-population generalizability of gene expression prediction models. PLoS Genet 16, e1008927 (2020).
Incorporating European GWAS findings improve polygenic risk prediction accuracy of breast cancer among East Asians. Genet Epidemiol (2021). doi:10.1002/gepi.22382
A common loss-of-function variant is associated with lower vitamin B concentration in African Americans. Blood 131, 2859-2863 (2018).
Whole exome sequencing analyses reveal gene-microbiota interactions in the context of IBD. Gut 70, 285-296 (2021).
A deoxyribonuclease 1-like 3 genetic variant associates with asthma exacerbations. J Allergy Clin Immunol 147, 1095-1097.e10 (2021).
Genome-wide association study reveals a novel locus for asthma with severe exacerbations in diverse populations. Pediatr Allergy Immunol 32, 106-115 (2021).
A genetic stochastic process model for genome-wide joint analysis of biomarker dynamics and disease susceptibility with longitudinal data. Genet Epidemiol 41, 620-635 (2017).
Cohort-specific imputation of gene expression improves prediction of warfarin dose for African Americans. Genome Med 9, 98 (2017).
Epidemiology of Functional Seizures Among Adults Treated at a University Hospital. JAMA Netw Open 3, e2027920 (2020).
Integrative genomic analysis in African American children with asthma finds three novel loci associated with lung function. Genet Epidemiol 45, 190-208 (2021).
Functional architecture of low-frequency variants highlights strength of negative selection across coding and non-coding annotations. Nat Genet 50, 1600-1607 (2018).
Linkage disequilibrium-dependent architecture of human complex traits shows action of negative selection. Nat Genet 49, 1421-1427 (2017).
Impact of admixture and ancestry on eQTL analysis and GWAS colocalization in GTEx. Genome Biol 21, 233 (2020).
Identification of rare-disease genes using blood transcriptome sequencing and large control cohorts. Nat Med 25, 911-919 (2019).
Leveraging blood and tissue CD4+ T cell heterogeneity at the single cell level to identify mechanisms of disease in rheumatoid arthritis. Curr Opin Immunol 49, 27-36 (2017).
Ultra-Rare Genetic Variation in the Epilepsies: A Whole-Exome Sequencing Study of 17,606 Individuals. Am J Hum Genet 105, 267-282 (2019).
Sub-genic intolerance, ClinVar, and the epilepsies: A whole-exome sequencing study of 29,165 individuals. Am J Hum Genet 108, 965-982 (2021).