A High Resolution Genome-Wide Scan for Significant Selective Sweeps: An Application to Pooled Sequence Data in Laying Chickens

In most studies aimed at localizing footprints of past selection, outliers at tails of the empirical distribution of a given test statistic are assumed to reflect locus-specific selective forces. Significance cutoffs are subjectively determined, rather than being related to a clear set of hypotheses...

Full description

Bibliographic Details
Main Authors: Qanbari, Saber, Strom, Tim M., Haberer, Georg, Weigend, Steffen, Gheyas, Almas A., Turner, Frances, Burt, David W., Preisinger, Rudolf, Gianola, Daniel, Simianer, Henner
Format: Online
Language:English
Published: Public Library of Science 2012
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3510216/
id pubmed-3510216
recordtype oai_dc
spelling pubmed-35102162012-12-03 A High Resolution Genome-Wide Scan for Significant Selective Sweeps: An Application to Pooled Sequence Data in Laying Chickens Qanbari, Saber Strom, Tim M. Haberer, Georg Weigend, Steffen Gheyas, Almas A. Turner, Frances Burt, David W. Preisinger, Rudolf Gianola, Daniel Simianer, Henner Research Article In most studies aimed at localizing footprints of past selection, outliers at tails of the empirical distribution of a given test statistic are assumed to reflect locus-specific selective forces. Significance cutoffs are subjectively determined, rather than being related to a clear set of hypotheses. Here, we define an empirical p-value for the summary statistic by means of a permutation method that uses the observed SNP structure in the real data. To illustrate the methodology, we applied our approach to a panel of 2.9 million autosomal SNPs identified from re-sequencing a pool of 15 individuals from a brown egg layer line. We scanned the genome for local reductions in heterozygosity, suggestive of selective sweeps. We also employed a modified sliding window approach that accounts for gaps in the sequence and increases scanning resolution by moving the overlapping windows by steps of one SNP only, and suggest to call this a “creeping window” strategy. The approach confirmed selective sweeps in the region of previously described candidate genes, i.e. TSHR, PRL, PRLHR, INSR, LEPR, IGF1, and NRAMP1 when used as positive controls. The genome scan revealed 82 distinct regions with strong evidence of selection (genome-wide p-value<0.001), including genes known to be associated with eggshell structure and immune system such as CALB1 and GAL cluster, respectively. A substantial proportion of signals was found in poor gene content regions including the most extreme signal on chromosome 1. The observation of multiple signals in a highly selected layer line of chicken is consistent with the hypothesis that egg production is a complex trait controlled by many genes. Public Library of Science 2012-11-29 /pmc/articles/PMC3510216/ /pubmed/23209582 http://dx.doi.org/10.1371/journal.pone.0049525 Text en © 2012 Qanbari et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
repository_type Open Access Journal
institution_category Foreign Institution
institution US National Center for Biotechnology Information
building NCBI PubMed
collection Online Access
language English
format Online
author Qanbari, Saber
Strom, Tim M.
Haberer, Georg
Weigend, Steffen
Gheyas, Almas A.
Turner, Frances
Burt, David W.
Preisinger, Rudolf
Gianola, Daniel
Simianer, Henner
spellingShingle Qanbari, Saber
Strom, Tim M.
Haberer, Georg
Weigend, Steffen
Gheyas, Almas A.
Turner, Frances
Burt, David W.
Preisinger, Rudolf
Gianola, Daniel
Simianer, Henner
A High Resolution Genome-Wide Scan for Significant Selective Sweeps: An Application to Pooled Sequence Data in Laying Chickens
author_facet Qanbari, Saber
Strom, Tim M.
Haberer, Georg
Weigend, Steffen
Gheyas, Almas A.
Turner, Frances
Burt, David W.
Preisinger, Rudolf
Gianola, Daniel
Simianer, Henner
author_sort Qanbari, Saber
title A High Resolution Genome-Wide Scan for Significant Selective Sweeps: An Application to Pooled Sequence Data in Laying Chickens
title_short A High Resolution Genome-Wide Scan for Significant Selective Sweeps: An Application to Pooled Sequence Data in Laying Chickens
title_full A High Resolution Genome-Wide Scan for Significant Selective Sweeps: An Application to Pooled Sequence Data in Laying Chickens
title_fullStr A High Resolution Genome-Wide Scan for Significant Selective Sweeps: An Application to Pooled Sequence Data in Laying Chickens
title_full_unstemmed A High Resolution Genome-Wide Scan for Significant Selective Sweeps: An Application to Pooled Sequence Data in Laying Chickens
title_sort high resolution genome-wide scan for significant selective sweeps: an application to pooled sequence data in laying chickens
description In most studies aimed at localizing footprints of past selection, outliers at tails of the empirical distribution of a given test statistic are assumed to reflect locus-specific selective forces. Significance cutoffs are subjectively determined, rather than being related to a clear set of hypotheses. Here, we define an empirical p-value for the summary statistic by means of a permutation method that uses the observed SNP structure in the real data. To illustrate the methodology, we applied our approach to a panel of 2.9 million autosomal SNPs identified from re-sequencing a pool of 15 individuals from a brown egg layer line. We scanned the genome for local reductions in heterozygosity, suggestive of selective sweeps. We also employed a modified sliding window approach that accounts for gaps in the sequence and increases scanning resolution by moving the overlapping windows by steps of one SNP only, and suggest to call this a “creeping window” strategy. The approach confirmed selective sweeps in the region of previously described candidate genes, i.e. TSHR, PRL, PRLHR, INSR, LEPR, IGF1, and NRAMP1 when used as positive controls. The genome scan revealed 82 distinct regions with strong evidence of selection (genome-wide p-value<0.001), including genes known to be associated with eggshell structure and immune system such as CALB1 and GAL cluster, respectively. A substantial proportion of signals was found in poor gene content regions including the most extreme signal on chromosome 1. The observation of multiple signals in a highly selected layer line of chicken is consistent with the hypothesis that egg production is a complex trait controlled by many genes.
publisher Public Library of Science
publishDate 2012
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3510216/
_version_ 1611936609900429312