Next generation sequencing of CLU, PICALM and CR1: pitfalls and potential solutions

CLU, PICALM and CR1 were identified as genetic risk factors for late onset Alzheimer’s disease (AD) in two large genome wide association studies (GWAS) published in 2009, but the variants that convey this alteration in disease risk, and how the genes relate to AD pathology is yet to be discovered. A...

Full description

Bibliographic Details
Main Authors: Lord, Jenny, Turton, James, Medway, Christopher, Shi, Hui, Brown, Kristelle, Lowe, James, Mann, David, Pickering-Brown, Stuart, Kalsheker, Noor, Passmore, Peter, Morgan, Kevin
Format: Article
Published: e-Century Publishing 2012
Online Access:https://eprints.nottingham.ac.uk/2696/
_version_ 1848790852140269568
author Lord, Jenny
Turton, James
Medway, Christopher
Shi, Hui
Brown, Kristelle
Lowe, James
Mann, David
Pickering-Brown, Stuart
Kalsheker, Noor
Passmore, Peter
Morgan, Kevin
author_facet Lord, Jenny
Turton, James
Medway, Christopher
Shi, Hui
Brown, Kristelle
Lowe, James
Mann, David
Pickering-Brown, Stuart
Kalsheker, Noor
Passmore, Peter
Morgan, Kevin
author_sort Lord, Jenny
building Nottingham Research Data Repository
collection Online Access
description CLU, PICALM and CR1 were identified as genetic risk factors for late onset Alzheimer’s disease (AD) in two large genome wide association studies (GWAS) published in 2009, but the variants that convey this alteration in disease risk, and how the genes relate to AD pathology is yet to be discovered. A next generation sequencing (NGS) project was conducted targeting CLU, CR1 and PICALM, in 96 AD samples (8 pools of 12), in an attempt to discover rare variants within these AD associated genes. Inclusion of repetitive regions in the design of the SureSelect capture lead to significant issues in alignment of the data, leading to poor specificity and a lower than expected depth of coverage. A strong positive correlation (0.964, p<0.001) was seen between NGS and 1000 genome project frequency estimates. Of the ~170 “novel” variants detected in the genes, seven SNPs, all of which were present in multiple sample pools, were selected for validation by Sanger sequencing. Two SNPs were successfully validated by this method, and shown to be genuine variants, while five failed validation. These spurious SNP calls occurred as a result of the presence of small indels and mononucleotide repeats, indicating such features should be regarded with caution, and validation via an independent method is important for NGS variant calls.
first_indexed 2025-11-14T18:19:12Z
format Article
id nottingham-2696
institution University of Nottingham Malaysia Campus
institution_category Local University
last_indexed 2025-11-14T18:19:12Z
publishDate 2012
publisher e-Century Publishing
recordtype eprints
repository_type Digital Repository
spelling nottingham-26962024-08-15T15:13:56Z https://eprints.nottingham.ac.uk/2696/ Next generation sequencing of CLU, PICALM and CR1: pitfalls and potential solutions Lord, Jenny Turton, James Medway, Christopher Shi, Hui Brown, Kristelle Lowe, James Mann, David Pickering-Brown, Stuart Kalsheker, Noor Passmore, Peter Morgan, Kevin CLU, PICALM and CR1 were identified as genetic risk factors for late onset Alzheimer’s disease (AD) in two large genome wide association studies (GWAS) published in 2009, but the variants that convey this alteration in disease risk, and how the genes relate to AD pathology is yet to be discovered. A next generation sequencing (NGS) project was conducted targeting CLU, CR1 and PICALM, in 96 AD samples (8 pools of 12), in an attempt to discover rare variants within these AD associated genes. Inclusion of repetitive regions in the design of the SureSelect capture lead to significant issues in alignment of the data, leading to poor specificity and a lower than expected depth of coverage. A strong positive correlation (0.964, p<0.001) was seen between NGS and 1000 genome project frequency estimates. Of the ~170 “novel” variants detected in the genes, seven SNPs, all of which were present in multiple sample pools, were selected for validation by Sanger sequencing. Two SNPs were successfully validated by this method, and shown to be genuine variants, while five failed validation. These spurious SNP calls occurred as a result of the presence of small indels and mononucleotide repeats, indicating such features should be regarded with caution, and validation via an independent method is important for NGS variant calls. e-Century Publishing 2012-11-15 Article PeerReviewed Lord, Jenny, Turton, James, Medway, Christopher, Shi, Hui, Brown, Kristelle, Lowe, James, Mann, David, Pickering-Brown, Stuart, Kalsheker, Noor, Passmore, Peter and Morgan, Kevin (2012) Next generation sequencing of CLU, PICALM and CR1: pitfalls and potential solutions. International Journal of Molecular Epidemiology and Genetics, 3 (4). pp. 262-275. ISSN 1948-1756 http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3508540/ PMCID: PMC3508540 PMCID: PMC3508540
spellingShingle Lord, Jenny
Turton, James
Medway, Christopher
Shi, Hui
Brown, Kristelle
Lowe, James
Mann, David
Pickering-Brown, Stuart
Kalsheker, Noor
Passmore, Peter
Morgan, Kevin
Next generation sequencing of CLU, PICALM and CR1: pitfalls and potential solutions
title Next generation sequencing of CLU, PICALM and CR1: pitfalls and potential solutions
title_full Next generation sequencing of CLU, PICALM and CR1: pitfalls and potential solutions
title_fullStr Next generation sequencing of CLU, PICALM and CR1: pitfalls and potential solutions
title_full_unstemmed Next generation sequencing of CLU, PICALM and CR1: pitfalls and potential solutions
title_short Next generation sequencing of CLU, PICALM and CR1: pitfalls and potential solutions
title_sort next generation sequencing of clu, picalm and cr1: pitfalls and potential solutions
url https://eprints.nottingham.ac.uk/2696/
https://eprints.nottingham.ac.uk/2696/
https://eprints.nottingham.ac.uk/2696/