TY - JOUR
T1 - Development of a natural language processing algorithm to extract social determinants of health from clinician notes
AU - Zaribafzadeh, Hamed
AU - Henson, Jacqueline B.
AU - Chan, Norine W.
AU - Rogers, Ursula
AU - Webster, Wendy
AU - Schappe, Tyler
AU - Li, Fan
AU - Matsouaka, Roland A.
AU - Kirk, Allan D.
AU - Henao, Ricardo
AU - McElroy, Lisa M.
N1 - Publisher Copyright:
© 2025 American Society of Transplantation & American Society of Transplant Surgeons
PY - 2025
Y1 - 2025
N2 - Disparities in access to the organ transplant waitlist are well-documented, but research into modifiable factors has been limited due to a lack of access to organized prewaitlisting data. This study aimed to develop a natural language processing (NLP) algorithm to extract social determinants of health (SDOH) from free-text notes and quantify the association of SDOH with access to the transplant waitlist. We collected 261 802 clinician notes from 11 111 adults referred for kidney or liver transplants between 2016 and 2022 at the Duke University Health System. An SDOH ontology and a rule-based NLP algorithm were created to extract and organize terms. Education, transportation, and age were the most frequent terms identified. Negative sentiment and refer were the most negatively associated features with listing in both kidney and liver transplant patients. Income and employment for the kidney, and judgment and positive sentiment for liver were the most positively associated features with the listing. This study suggests that the integration of NLP tools into the transplant clinical workflow could help improve collection and organization of SDOH and inform center-level efforts at resource allocation, potentially improving access to the transplant waitlist and posttransplant outcomes.
AB - Disparities in access to the organ transplant waitlist are well-documented, but research into modifiable factors has been limited due to a lack of access to organized prewaitlisting data. This study aimed to develop a natural language processing (NLP) algorithm to extract social determinants of health (SDOH) from free-text notes and quantify the association of SDOH with access to the transplant waitlist. We collected 261 802 clinician notes from 11 111 adults referred for kidney or liver transplants between 2016 and 2022 at the Duke University Health System. An SDOH ontology and a rule-based NLP algorithm were created to extract and organize terms. Education, transportation, and age were the most frequent terms identified. Negative sentiment and refer were the most negatively associated features with listing in both kidney and liver transplant patients. Income and employment for the kidney, and judgment and positive sentiment for liver were the most positively associated features with the listing. This study suggests that the integration of NLP tools into the transplant clinical workflow could help improve collection and organization of SDOH and inform center-level efforts at resource allocation, potentially improving access to the transplant waitlist and posttransplant outcomes.
KW - natural language processing
KW - social determinants of health
UR - http://www.scopus.com/inward/record.url?scp=105000760759&partnerID=8YFLogxK
U2 - 10.1016/j.ajt.2025.02.019
DO - 10.1016/j.ajt.2025.02.019
M3 - Article
C2 - 40057196
AN - SCOPUS:105000760759
SN - 1600-6135
JO - American Journal of Transplantation
JF - American Journal of Transplantation
ER -