Beyond Missing Heritability: Prediction of Complex Traits

Despite rapid advances in genomic technology, our ability to account for phenotypic variation using genetic information remains limited for many traits. This has unfortunately resulted in limited application of genetic data towards preventive and personalized medicine, one of the primary impetuses o...

Full description

Bibliographic Details
Main Authors: Makowsky, Robert, Pajewski, Nicholas M., Klimentidis, Yann C., Vazquez, Ana I., Duarte, Christine W., Allison, David B., de los Campos, Gustavo
Format: Online
Language:English
Published: Public Library of Science 2011
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3084207/
id pubmed-3084207
recordtype oai_dc
spelling pubmed-30842072011-05-06 Beyond Missing Heritability: Prediction of Complex Traits Makowsky, Robert Pajewski, Nicholas M. Klimentidis, Yann C. Vazquez, Ana I. Duarte, Christine W. Allison, David B. de los Campos, Gustavo Research Article Despite rapid advances in genomic technology, our ability to account for phenotypic variation using genetic information remains limited for many traits. This has unfortunately resulted in limited application of genetic data towards preventive and personalized medicine, one of the primary impetuses of genome-wide association studies. Recently, a large proportion of the “missing heritability” for human height was statistically explained by modeling thousands of single nucleotide polymorphisms concurrently. However, it is currently unclear how gains in explained genetic variance will translate to the prediction of yet-to-be observed phenotypes. Using data from the Framingham Heart Study, we explore the genomic prediction of human height in training and validation samples while varying the statistical approach used, the number of SNPs included in the model, the validation scheme, and the number of subjects used to train the model. In our training datasets, we are able to explain a large proportion of the variation in height (h2 up to 0.83, R2 up to 0.96). However, the proportion of variance accounted for in validation samples is much smaller (ranging from 0.15 to 0.36 depending on the degree of familial information used in the training dataset). While such R2 values vastly exceed what has been previously reported using a reduced number of pre-selected markers (<0.10), given the heritability of the trait (∼0.80), substantial room for improvement remains. Public Library of Science 2011-04-28 /pmc/articles/PMC3084207/ /pubmed/21552331 http://dx.doi.org/10.1371/journal.pgen.1002051 Text en Makowsky et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
repository_type Open Access Journal
institution_category Foreign Institution
institution US National Center for Biotechnology Information
building NCBI PubMed
collection Online Access
language English
format Online
author Makowsky, Robert
Pajewski, Nicholas M.
Klimentidis, Yann C.
Vazquez, Ana I.
Duarte, Christine W.
Allison, David B.
de los Campos, Gustavo
spellingShingle Makowsky, Robert
Pajewski, Nicholas M.
Klimentidis, Yann C.
Vazquez, Ana I.
Duarte, Christine W.
Allison, David B.
de los Campos, Gustavo
Beyond Missing Heritability: Prediction of Complex Traits
author_facet Makowsky, Robert
Pajewski, Nicholas M.
Klimentidis, Yann C.
Vazquez, Ana I.
Duarte, Christine W.
Allison, David B.
de los Campos, Gustavo
author_sort Makowsky, Robert
title Beyond Missing Heritability: Prediction of Complex Traits
title_short Beyond Missing Heritability: Prediction of Complex Traits
title_full Beyond Missing Heritability: Prediction of Complex Traits
title_fullStr Beyond Missing Heritability: Prediction of Complex Traits
title_full_unstemmed Beyond Missing Heritability: Prediction of Complex Traits
title_sort beyond missing heritability: prediction of complex traits
description Despite rapid advances in genomic technology, our ability to account for phenotypic variation using genetic information remains limited for many traits. This has unfortunately resulted in limited application of genetic data towards preventive and personalized medicine, one of the primary impetuses of genome-wide association studies. Recently, a large proportion of the “missing heritability” for human height was statistically explained by modeling thousands of single nucleotide polymorphisms concurrently. However, it is currently unclear how gains in explained genetic variance will translate to the prediction of yet-to-be observed phenotypes. Using data from the Framingham Heart Study, we explore the genomic prediction of human height in training and validation samples while varying the statistical approach used, the number of SNPs included in the model, the validation scheme, and the number of subjects used to train the model. In our training datasets, we are able to explain a large proportion of the variation in height (h2 up to 0.83, R2 up to 0.96). However, the proportion of variance accounted for in validation samples is much smaller (ranging from 0.15 to 0.36 depending on the degree of familial information used in the training dataset). While such R2 values vastly exceed what has been previously reported using a reduced number of pre-selected markers (<0.10), given the heritability of the trait (∼0.80), substantial room for improvement remains.
publisher Public Library of Science
publishDate 2011
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3084207/
_version_ 1611450948046028800