TY - JOUR
T1 - Integrated RNA and DNA sequencing improves mutation detection in low purity tumors
AU - Wilkerson, Matthew D.
AU - Cabanski, Christopher R.
AU - Sun, Wei
AU - Hoadley, Katherine A.
AU - Walter, Vonn
AU - Mose, Lisle E.
AU - Troester, Melissa A.
AU - Hammerman, Peter S.
AU - Parker, Joel S.
AU - Perou, Charles M.
AU - Hayes, D. Neil
N1 - Funding Information:
National Cancer Institute [F32CA142039 to M.D.W., Breast SPORE P50-CA058223 to C.M.P.]; National Institutes of Health [U24 CA143848 to C.M.P., D.N.H and U24 CA143848-02S1 to C.M.P., D.N.H.]. Funding for open access charge: National Institutes of Health. Conflict of interest statement. M.D.W. was a consultant for GeneCentric Diagnostics and Cancer Therapeutics Innovation Group. C.M.P. is an equity stock holder, and Board of Director Member, of BioClassifier LLC and GeneCentric Diagnostics. D.N.H. is equity stock holder, and Board of Director Member GeneCentric Diagnostics.
PY - 2014/7/29
Y1 - 2014/7/29
N2 - Identifying somatic mutations is critical for cancer genome characterization and for prioritizing patient treatment. DNA whole exome sequencing (DNA-WES) is currently the most popular technology; however, this yields low sensitivity in low purity tumors. RNA sequencing (RNA-seq) covers the expressed exome with depth proportional to expression. We hypothesized that integrating DNA-WES and RNA-seq would enable superior mutation detection versus DNA-WES alone. We developed a first-of-its-kind method, called UNCeqR, that detects somatic mutations by integrating patient-matched RNA-seq and DNA-WES. In simulation, the integrated DNA and RNA model outperformed the DNA-WES only model. Validation by patient-matched whole genome sequencing demonstrated superior performance of the integrated model over DNA-WES only models, including a published method and published mutation profiles. Genome-wide mutational analysis of breast and lung cancer cohorts (n = 871) revealed remarkable tumor genomics properties. Low purity tumors experienced the largest gains in mutation detection by integrating RNA-seq and DNA-WES. RNA provided greater mutation signal than DNA in expressed mutations. Compared to earlier studies on this cohort, UNCeqR increased mutation rates of driver and therapeutically targeted genes (e.g. PIK3CA, ERBB2 and FGFR2). In summary, integrating RNA-seq with DNA-WES increases mutation detection performance, especially for low purity tumors.
AB - Identifying somatic mutations is critical for cancer genome characterization and for prioritizing patient treatment. DNA whole exome sequencing (DNA-WES) is currently the most popular technology; however, this yields low sensitivity in low purity tumors. RNA sequencing (RNA-seq) covers the expressed exome with depth proportional to expression. We hypothesized that integrating DNA-WES and RNA-seq would enable superior mutation detection versus DNA-WES alone. We developed a first-of-its-kind method, called UNCeqR, that detects somatic mutations by integrating patient-matched RNA-seq and DNA-WES. In simulation, the integrated DNA and RNA model outperformed the DNA-WES only model. Validation by patient-matched whole genome sequencing demonstrated superior performance of the integrated model over DNA-WES only models, including a published method and published mutation profiles. Genome-wide mutational analysis of breast and lung cancer cohorts (n = 871) revealed remarkable tumor genomics properties. Low purity tumors experienced the largest gains in mutation detection by integrating RNA-seq and DNA-WES. RNA provided greater mutation signal than DNA in expressed mutations. Compared to earlier studies on this cohort, UNCeqR increased mutation rates of driver and therapeutically targeted genes (e.g. PIK3CA, ERBB2 and FGFR2). In summary, integrating RNA-seq with DNA-WES increases mutation detection performance, especially for low purity tumors.
UR - http://www.scopus.com/inward/record.url?scp=84905581610&partnerID=8YFLogxK
U2 - 10.1093/nar/gku489
DO - 10.1093/nar/gku489
M3 - Article
C2 - 24970867
AN - SCOPUS:84905581610
SN - 0305-1048
VL - 42
SP - e107
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - 13
ER -