TY - JOUR
T1 - Voice EHR
T2 - introducing multimodal audio data for health
AU - Anibal, James
AU - Huth, Hannah
AU - Li, Ming
AU - Hazen, Lindsey
AU - Daoud, Veronica
AU - Ebedes, Dominique
AU - Lam, Yen Minh
AU - Nguyen, Hang
AU - Hong, Phuc Vo
AU - Kleinman, Michael
AU - Ost, Shelley
AU - Jackson, Christopher
AU - Sprabery, Laura
AU - Elangovan, Cheran
AU - Krishnaiah, Balaji
AU - Akst, Lee
AU - Lina, Ioan
AU - Elyazar, Iqbal
AU - Ekawati, Lenny
AU - Jansen, Stefan
AU - Nduwayezu, Richard
AU - Garcia, Charisse
AU - Plum, Jeffrey
AU - Brenner, Jacqueline
AU - Song, Miranda
AU - Ricotta, Emily
AU - Clifton, David
AU - Thwaites, C. Louise
AU - Bensoussan, Yael
AU - Wood, Bradford
N1 - Publisher Copyright:
2025 Anibal, Huth, Li, Hazen, Daoud, Ebedes, Lam, Nguyen, Hong, Kleinman, Ost, Jackson, Sprabery, Elangovan, Krishnaiah, Akst, Lina, Elyazar, Ekawati, Jansen, Nduwayezu, Garcia, Plum, Brenner, Song, Ricotta, Clifton, Thwaites, Bensoussan and Wood.
PY - 2024
Y1 - 2024
N2 - Introduction: Artificial intelligence (AI) models trained on audio data may have the potential to rapidly perform clinical tasks, enhancing medical decision-making and potentially improving outcomes through early detection. Existing technologies depend on limited datasets collected with expensive recording equipment in high-income countries, which challenges deployment in resource-constrained, high-volume settings where audio data may have a profound impact on health equity. Methods: This report introduces a novel protocol for audio data collection and a corresponding application that captures health information through guided questions. Results: To demonstrate the potential of Voice EHR as a biomarker of health, initial experiments on data quality and multiple case studies are presented in this report. Large language models (LLMs) were used to compare transcribed Voice EHR data with data (from the same patients) collected through conventional techniques like multiple choice questions. Information contained in the Voice EHR samples was consistently rated as equally or more relevant to a health evaluation. Discussion: The HEAR application facilitates the collection of an audio electronic health record (“Voice EHR”) that may contain complex biomarkers of health from conventional voice/respiratory features, speech patterns, and spoken language with semantic meaning and longitudinal context–potentially compensating for the typical limitations of unimodal clinical datasets.
AB - Introduction: Artificial intelligence (AI) models trained on audio data may have the potential to rapidly perform clinical tasks, enhancing medical decision-making and potentially improving outcomes through early detection. Existing technologies depend on limited datasets collected with expensive recording equipment in high-income countries, which challenges deployment in resource-constrained, high-volume settings where audio data may have a profound impact on health equity. Methods: This report introduces a novel protocol for audio data collection and a corresponding application that captures health information through guided questions. Results: To demonstrate the potential of Voice EHR as a biomarker of health, initial experiments on data quality and multiple case studies are presented in this report. Large language models (LLMs) were used to compare transcribed Voice EHR data with data (from the same patients) collected through conventional techniques like multiple choice questions. Information contained in the Voice EHR samples was consistently rated as equally or more relevant to a health evaluation. Discussion: The HEAR application facilitates the collection of an audio electronic health record (“Voice EHR”) that may contain complex biomarkers of health from conventional voice/respiratory features, speech patterns, and spoken language with semantic meaning and longitudinal context–potentially compensating for the typical limitations of unimodal clinical datasets.
KW - AI for health
KW - large language models (LLM)
KW - multimodal data
KW - natural language processing
KW - voice biomarkers
UR - http://www.scopus.com/inward/record.url?scp=85217691601&partnerID=8YFLogxK
U2 - 10.3389/fdgth.2024.1448351
DO - 10.3389/fdgth.2024.1448351
M3 - Article
AN - SCOPUS:85217691601
SN - 2673-253X
VL - 6
JO - Frontiers in Digital Health
JF - Frontiers in Digital Health
M1 - 1448351
ER -