Machine Learning Approach for Predicting Past Environmental Exposures from Molecular Profiling of Post-Exposure Human Serum Samples

Atif Khan, Thomas H. Thatcher, Collynn F. Woeller, Patricia J. Sime, Richard P. Phipps, Philip K. Hopke, Mark J. Utell, Pamela L. Krahl, Timothy M. Mallon, Juilee Thakar*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

Objective:To develop an approach for a retrospective analysis of post-exposure serum samples using diverse molecular profiles.Methods:The 236 molecular profiles from 800 de-identified human serum samples from the Department of Defense Serum Repository were classified as smokers or non-smokers based on direct measurement of serum cotinine levels. A machine-learning pipeline was used to classify smokers and non-smokers from their molecular profiles.Results:The refined supervised support vector machines with recursive feature elimination predicted smokers and non-smokers with 78% accuracy on the independent held-out set. Several of the identified classifiers of smoking status have previously been reported and four additional miRNAs were validated with experimental tobacco smoke exposure in mice, supporting the computational approach.Conclusions:We developed and validated a pipeline that shows retrospective analysis of post-exposure serum samples can identify environmental exposures.

Original languageEnglish
Pages (from-to)S55-S64
JournalJournal of Occupational and Environmental Medicine
Volume61
DOIs
StatePublished - 1 Dec 2019
Externally publishedYes

Keywords

  • molecular profiling
  • support vector machines
  • tobacco use

Fingerprint

Dive into the research topics of 'Machine Learning Approach for Predicting Past Environmental Exposures from Molecular Profiling of Post-Exposure Human Serum Samples'. Together they form a unique fingerprint.

Cite this