Filtering duplicate reads from 454 pyrosequencing data
Motivation: Throughout the recent years, 454 pyrosequencing has emerged as an efficient alternative to traditional Sanger sequencing and is widely used in both de novo whole-genome sequencing and metagenomics. Especially the latter application is extremely sensitive to sequencing errors and artifici...
Main Authors: | , , , |
---|---|
Format: | Online |
Language: | English |
Published: |
Oxford University Press
2013
|
Online Access: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3605598/ |
id |
pubmed-3605598 |
---|---|
recordtype |
oai_dc |
spelling |
pubmed-36055982013-03-22 Filtering duplicate reads from 454 pyrosequencing data Balzer, Susanne Malde, Ketil Grohme, Markus A. Jonassen, Inge Original Papers Motivation: Throughout the recent years, 454 pyrosequencing has emerged as an efficient alternative to traditional Sanger sequencing and is widely used in both de novo whole-genome sequencing and metagenomics. Especially the latter application is extremely sensitive to sequencing errors and artificially duplicated reads. Both are common in 454 pyrosequencing and can create a strong bias in the estimation of diversity and composition of a sample. To date, there are several tools that aim to remove both sequencing noise and duplicates. Nevertheless, duplicate removal is often based on nucleotide sequences rather than on the underlying flow values, which contain additional information. Oxford University Press 2013-04-01 2013-02-01 /pmc/articles/PMC3605598/ /pubmed/23376350 http://dx.doi.org/10.1093/bioinformatics/btt047 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
repository_type |
Open Access Journal |
institution_category |
Foreign Institution |
institution |
US National Center for Biotechnology Information |
building |
NCBI PubMed |
collection |
Online Access |
language |
English |
format |
Online |
author |
Balzer, Susanne Malde, Ketil Grohme, Markus A. Jonassen, Inge |
spellingShingle |
Balzer, Susanne Malde, Ketil Grohme, Markus A. Jonassen, Inge Filtering duplicate reads from 454 pyrosequencing data |
author_facet |
Balzer, Susanne Malde, Ketil Grohme, Markus A. Jonassen, Inge |
author_sort |
Balzer, Susanne |
title |
Filtering duplicate reads from 454 pyrosequencing data |
title_short |
Filtering duplicate reads from 454 pyrosequencing data |
title_full |
Filtering duplicate reads from 454 pyrosequencing data |
title_fullStr |
Filtering duplicate reads from 454 pyrosequencing data |
title_full_unstemmed |
Filtering duplicate reads from 454 pyrosequencing data |
title_sort |
filtering duplicate reads from 454 pyrosequencing data |
description |
Motivation: Throughout the recent years, 454 pyrosequencing has emerged as an efficient alternative to traditional Sanger sequencing and is widely used in both de novo whole-genome sequencing and metagenomics. Especially the latter application is extremely sensitive to sequencing errors and artificially duplicated reads. Both are common in 454 pyrosequencing and can create a strong bias in the estimation of diversity and composition of a sample. To date, there are several tools that aim to remove both sequencing noise and duplicates. Nevertheless, duplicate removal is often based on nucleotide sequences rather than on the underlying flow values, which contain additional information. |
publisher |
Oxford University Press |
publishDate |
2013 |
url |
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3605598/ |
_version_ |
1611964691422117888 |