CD-HIT: accelerated for clustering the next-generation sequencing data

Summary: CD-HIT is a widely used program for clustering biological sequences to reduce sequence redundancy and improve the performance of other sequence analyses. In response to the rapid increase in the amount of sequencing data produced by the next-generation sequencing technologies, we have devel...

Full description

Bibliographic Details
Main Authors: Fu, Limin, Niu, Beifang, Zhu, Zhengwei, Wu, Sitao, Li, Weizhong
Format: Online
Language:English
Published: Oxford University Press 2012
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3516142/