Caffeine intake, race, and risk of invasive breast cancer lessons learned from data mining a clinical database

Susan Maskery*, Zhang Yonghong, Hu Hai, Craig Shrivel, Jeffrey Hooke, Michael Liebman

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Scopus citations

Abstract

Over the past five years the Clinical Breast Care Project (CBCP) has amassed a significant patient database and tissue repository related to breast disease and breast cancer. We have begun mining this unique data source (i.e. life history questionnaire data, pathology reports, analysis of blood and tissue samples) to examine interactions between known risk factors for breast cancer development (i.e. menopausal status, parity, etc.) with breast disease and cancer incidence in our patient population. From these initial forays into analyzing the CBCP's data repository, we have begun to develop protocols for data mining. In particular, a crucial first step is to quantify interactions between variables of interest prior to any specific significance tests relating individual variables to risk of a clinical result. For this purpose, we find Bayesian network analysis the most useful method for exploration of data interactions. To illustrate this point, this abstract details an investigation into the effect of caffeine consumption on breast cancer incidence in our CBCP population, Based on our experience with this and other studies we strongly recommend Bayesian network analysis of all variables of interest as an initial data exploration tool.

Original languageEnglish
Title of host publicationProceedings - 19th IEEE International Symposium on Computer-Based Medical Systems, CBMS 2006
Pages714-718
Number of pages5
DOIs
StatePublished - 2006
Externally publishedYes
Event19th IEEE International Symposium on Computer-Based Medical Systems, CBMS 2006 - Salt Lake City, UT, United States
Duration: 22 Jun 200623 Jun 2006

Publication series

NameProceedings - IEEE Symposium on Computer-Based Medical Systems
Volume2006
ISSN (Print)1063-7125

Conference

Conference19th IEEE International Symposium on Computer-Based Medical Systems, CBMS 2006
Country/TerritoryUnited States
CitySalt Lake City, UT
Period22/06/0623/06/06

Fingerprint

Dive into the research topics of 'Caffeine intake, race, and risk of invasive breast cancer lessons learned from data mining a clinical database'. Together they form a unique fingerprint.

Cite this