SigFuge: Single gene clustering of RNA-seq reveals differential isoform usage among cancer samples

Patrick K. Kimes, Christopher R. Cabanski, Matthew D. Wilkerson, Ni Zhao, Amy R. Johnson, Charles M. Perou, Liza Makowski, Christopher A. Maher, Yufeng Liu, J. S. Marron, D. Neil Hayes*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

14 Scopus citations


High-throughput sequencing technologies, including RNA-seq, have made it possible to move beyond gene expression analysis to study transcriptional events including alternative splicing and gene fusions. Furthermore, recent studies in cancer have suggested the importance of identifying transcriptionally altered loci as biomarkers for improved prognosis and therapy. While many statistical methods have been proposed for identifying novel transcriptional events with RNA-seq, nearly all rely on contrasting known classes of samples, such as tumor and normal. Few tools exist for the unsupervised discovery of such events without class labels. In this paper, we present SigFuge for identifying genomic loci exhibiting differential transcription patterns across many RNA-seq samples. SigFuge combines clustering with hypothesis testing to identify genes exhibiting alternative splicing, or differences in isoform expression. We apply SigFuge to RNA-seq cohorts of 177 lung and 279 head and neck squamous cell carcinoma samples from the Cancer Genome Atlas, and identify several cases of differential isoform usage including CDKN2A, a tumor suppressor gene known to be inactivated in a majority of lung squamous cell tumors. By not restricting attention to known sample stratifications, SigFuge offers a novel approach to unsupervised screening of genetic loci across RNA-seq cohorts. SigFuge is available as an R package through Bioconductor.

Original languageEnglish
Pages (from-to)e113
JournalNucleic Acids Research
Issue number14
StatePublished - 18 Aug 2014
Externally publishedYes


Dive into the research topics of 'SigFuge: Single gene clustering of RNA-seq reveals differential isoform usage among cancer samples'. Together they form a unique fingerprint.

Cite this