Efficient Estimation and Applications of Cross-Validated Genetic Predictions to Polygenic Risk Scores and Linear Mixed Models.

TitleEfficient Estimation and Applications of Cross-Validated Genetic Predictions to Polygenic Risk Scores and Linear Mixed Models.
Publication TypeJournal Article
Year of Publication2020
AuthorsMefford, J, Park, D, Zheng, Z, Ko, A, Ala-Korpela, M, Laakso, M, Pajukanta, P, Yang, J, Witte, J, Zaitlen, N
JournalJ Comput Biol
Volume27
Issue4
Pagination599-612
Date Published2020 04
ISSN1557-8666
Abstract

Large-scale cohorts with combined genetic and phenotypic data, coupled with methodological advances, have produced increasingly accurate genetic predictors of complex human phenotypes called polygenic risk scores (PRSs). In addition to the potential translational impacts of identifying at-risk individuals, PRS are being utilized for a growing list of scientific applications, including causal inference, identifying pleiotropy and genetic correlation, and powerful gene-based and mixed-model association tests. Existing PRS approaches rely on external large-scale genetic cohorts that have also measured the phenotype of interest. They further require matching on ancestry and genotyping platform or imputation quality. In this work, we present a novel reference-free method to produce a PRS that does not rely on an external cohort. We show that naive implementations of reference-free PRS either result in substantial overfitting or prohibitive increases in computational time. We show that our algorithm avoids both of these issues and can produce informative in-sample PRSs over a single cohort without overfitting. We then demonstrate several novel applications of reference-free PRSs, including detection of pleiotropy across 246 metabolic traits and efficient mixed-model association testing.

DOI10.1089/cmb.2019.0325
Alternate JournalJ Comput Biol
PubMed ID32077750
PubMed Central IDPMC7185352
Grant ListK25 HL121295 / HL / NHLBI NIH HHS / United States
R01 CA227237 / CA / NCI NIH HHS / United States
R01 ES029929 / ES / NIEHS NIH HHS / United States
R56 MD013312 / MD / NIMHD NIH HHS / United States
U01 HG009080 / HG / NHGRI NIH HHS / United States
R01 HG006399 / HG / NHGRI NIH HHS / United States
U01 DK105561 / DK / NIDDK NIH HHS / United States
MC-UU-12013/1 / MR / Medical Research Council / United Kingdom