ReQON: a Bioconductor package for recalibrating quality scores from next-generation sequencing data

Christopher R. Cabanski, Keary Cavin, Chris Bizon, Matthew D. Wilkerson, Joel S. Parker, Kirk C. Wilhelmsen, Charles M. Perou, J. S. Marron, D. N. Hayes*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

16 Scopus citations

Abstract

Background: Next-generation sequencing technologies have become important tools for genome-wide studies. However, the quality scores that are assigned to each base have been shown to be inaccurate. If the quality scores are used in downstream analyses, these inaccuracies can have a significant impact on the results.Results: Here we present ReQON, a tool that recalibrates the base quality scores from an input BAM file of aligned sequencing data using logistic regression. ReQON also generates diagnostic plots showing the effectiveness of the recalibration. We show that ReQON produces quality scores that are both more accurate, in the sense that they more closely correspond to the probability of a sequencing error, and do a better job of discriminating between sequencing errors and non-errors than the original quality scores. We also compare ReQON to other available recalibration tools and show that ReQON is less biased and performs favorably in terms of quality score accuracy.Conclusion: ReQON is an open source software package, written in R and available through Bioconductor, for recalibrating base quality scores for next-generation sequencing data. ReQON produces a new BAM file with more accurate quality scores, which can improve the results of downstream analysis, and produces several diagnostic plots showing the effectiveness of the recalibration.

Original languageEnglish
Article number221
JournalBMC Bioinformatics
Volume13
Issue number1
DOIs
StatePublished - 4 Sep 2012
Externally publishedYes

Keywords

  • Bioconductor
  • Bioinformatics
  • Next-generation sequencing
  • Quality score
  • Recalibration

Fingerprint

Dive into the research topics of 'ReQON: a Bioconductor package for recalibrating quality scores from next-generation sequencing data'. Together they form a unique fingerprint.

Cite this