Type I error rates of rare single nucleotide variants are inflated in tests of association with non–normally distributed traits using simple linear regression methods
In this study, the effects of (a) the minor allele frequency of the single nucleotide variant (SNV), (b) the degree of departure from normality of the trait, and (c) the position of the SNVs on type I error rates were investigated in the Genetic Analysis Workshop (GAW) 19 whole exome sequence data....
Main Authors: | , , , , , |
---|---|
Format: | Online |
Language: | English |
Published: |
BioMed Central
2016
|
Online Access: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5133479/ |
id |
pubmed-5133479 |
---|---|
recordtype |
oai_dc |
spelling |
pubmed-51334792016-12-15 Type I error rates of rare single nucleotide variants are inflated in tests of association with non–normally distributed traits using simple linear regression methods Schwantes-An, Tae-Hwi Sung, Heejong Sabourin, Jeremy A. Justice, Cristina M. Sorant, Alexa J. M. Wilson, Alexander F. Proceedings In this study, the effects of (a) the minor allele frequency of the single nucleotide variant (SNV), (b) the degree of departure from normality of the trait, and (c) the position of the SNVs on type I error rates were investigated in the Genetic Analysis Workshop (GAW) 19 whole exome sequence data. To test the distribution of the type I error rate, 5 simulated traits were considered: standard normal and gamma distributed traits; 2 transformed versions of the gamma trait (log10 and rank-based inverse normal transformations); and trait Q1 provided by GAW 19. Each trait was tested with 313,340 SNVs. Tests of association were performed with simple linear regression and average type I error rates were determined for minor allele frequency classes. Rare SNVs (minor allele frequency < 0.05) showed inflated type I error rates for non–normally distributed traits that increased as the minor allele frequency decreased. The inflation of average type I error rates increased as the significance threshold decreased. Normally distributed traits did not show inflated type I error rates with respect to the minor allele frequency for rare SNVs. There was no consistent effect of transformation on the uniformity of the distribution of the location of SNVs with a type I error. BioMed Central 2016-10-18 /pmc/articles/PMC5133479/ /pubmed/27980666 http://dx.doi.org/10.1186/s12919-016-0060-7 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
repository_type |
Open Access Journal |
institution_category |
Foreign Institution |
institution |
US National Center for Biotechnology Information |
building |
NCBI PubMed |
collection |
Online Access |
language |
English |
format |
Online |
author |
Schwantes-An, Tae-Hwi Sung, Heejong Sabourin, Jeremy A. Justice, Cristina M. Sorant, Alexa J. M. Wilson, Alexander F. |
spellingShingle |
Schwantes-An, Tae-Hwi Sung, Heejong Sabourin, Jeremy A. Justice, Cristina M. Sorant, Alexa J. M. Wilson, Alexander F. Type I error rates of rare single nucleotide variants are inflated in tests of association with non–normally distributed traits using simple linear regression methods |
author_facet |
Schwantes-An, Tae-Hwi Sung, Heejong Sabourin, Jeremy A. Justice, Cristina M. Sorant, Alexa J. M. Wilson, Alexander F. |
author_sort |
Schwantes-An, Tae-Hwi |
title |
Type I error rates of rare single nucleotide variants are inflated in tests of association with non–normally distributed traits using simple linear regression methods |
title_short |
Type I error rates of rare single nucleotide variants are inflated in tests of association with non–normally distributed traits using simple linear regression methods |
title_full |
Type I error rates of rare single nucleotide variants are inflated in tests of association with non–normally distributed traits using simple linear regression methods |
title_fullStr |
Type I error rates of rare single nucleotide variants are inflated in tests of association with non–normally distributed traits using simple linear regression methods |
title_full_unstemmed |
Type I error rates of rare single nucleotide variants are inflated in tests of association with non–normally distributed traits using simple linear regression methods |
title_sort |
type i error rates of rare single nucleotide variants are inflated in tests of association with non–normally distributed traits using simple linear regression methods |
description |
In this study, the effects of (a) the minor allele frequency of the single nucleotide variant (SNV), (b) the degree of departure from normality of the trait, and (c) the position of the SNVs on type I error rates were investigated in the Genetic Analysis Workshop (GAW) 19 whole exome sequence data. To test the distribution of the type I error rate, 5 simulated traits were considered: standard normal and gamma distributed traits; 2 transformed versions of the gamma trait (log10 and rank-based inverse normal transformations); and trait Q1 provided by GAW 19. Each trait was tested with 313,340 SNVs. Tests of association were performed with simple linear regression and average type I error rates were determined for minor allele frequency classes. Rare SNVs (minor allele frequency < 0.05) showed inflated type I error rates for non–normally distributed traits that increased as the minor allele frequency decreased. The inflation of average type I error rates increased as the significance threshold decreased. Normally distributed traits did not show inflated type I error rates with respect to the minor allele frequency for rare SNVs. There was no consistent effect of transformation on the uniformity of the distribution of the location of SNVs with a type I error. |
publisher |
BioMed Central |
publishDate |
2016 |
url |
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5133479/ |
_version_ |
1613752464418275328 |