Benchmark analysis of algorithms for determining and quantifying full-length mRNA splice forms from RNA-seq data

Motivation: Because of the advantages of RNA sequencing (RNA-Seq) over microarrays, it is gaining widespread popularity for highly parallel gene expression analysis. For example, RNA-Seq is expected to be able to provide accurate identification and quantification of full-length splice forms. A numbe...

Full description

Bibliographic Details
Main Authors: Hayer, Katharina E., Pizarro, Angel, Lahens, Nicholas F., Hogenesch, John B., Grant, Gregory R.
Format: Online
Language:English
Published: Oxford University Press 2015
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4673975/
Description
Summary:Motivation: Because of the advantages of RNA sequencing (RNA-Seq) over microarrays, it is gaining widespread popularity for highly parallel gene expression analysis. For example, RNA-Seq is expected to be able to provide accurate identification and quantification of full-length splice forms. A number of informatics packages have been developed for this purpose, but short reads make it a difficult problem in principle. Sequencing error and polymorphisms add further complications. It has become necessary to perform studies to determine which algorithms perform best and which if any algorithms perform adequately. However, there is a dearth of independent and unbiased benchmarking studies. Here we take an approach using both simulated and experimental benchmark data to evaluate their accuracy.