TY - JOUR
T1 - A machine learning prediction model for total shoulder arthroplasty procedure duration
T2 - an evaluation of surgeon, patient, and shoulder-specific factors
AU - Levin, Jay M.
AU - Zaribafzadeh, Hamed
AU - Doyle, Tom R.
AU - Adu-Kwarteng, Kwabena
AU - Lunn, Kiera
AU - Helmkamp, Joshua K.
AU - Webster, Wendy
AU - Hurley, Eoghan T.
AU - Dickens, Jonathan F.
AU - Toth, Alison
AU - Anakwenze, Oke
AU - Klifto, Christopher S.
N1 - Publisher Copyright:
© 2025 Journal of Shoulder and Elbow Surgery Board of Trustees
PY - 2025
Y1 - 2025
N2 - Background: Operating room efficiency is of paramount importance for scheduling, cost efficiency, and to allow for the high operating volume required to address the growing demand for arthroplasty. The purpose of this study was to develop a machine learning predictive model for total shoulder arthroplasty (TSA) procedure duration and to identify factors which are predictive of a prolonged procedure. Methods: A retrospective review was undertaken of all TSA between 2013 and 2021 in a large academic institution. Patient, surgeon, anesthetic, and shoulder-specific factors were assessed. The duration of time in the operating room was recorded and compared to the human scheduler and electronic health record predicted procedure duration. Two gradient-boosted decision tree regression models were created with both training and validation datasets. The mean squared logarithmic error was chosen as the loss function. The first model (M1) considered patient, surgeon, and anesthetic factors, while the second model (M2) considered shoulder anatomy and pathology specific factors in addition. Results: Human schedulers’ predicted 64.1% of cases accurately, with 26.7% underpredicted and 9.2% overpredicted. M1 successfully predicted 79.7% of cases, with 6.9% underpredicted and 13.4% overpredicted. M2 successfully predicted 82.5% of cases with 8.8% underpredicted and 8.8% overpredicted. M2 was significantly more accurate in predicting anatomic total shoulder arthroplasty compared to reverse (rTSA) (90.6% vs. 78.1%, P < .001).The feature with the greatest impact on the shoulder-specific model's prediction was the historical median procedure duration; followed by the electronic health record prediction, surgeon prediction, patient age, and a traumatic indication. Factors which were associated with underpredicting procedure duration included younger age, traumatic indication, male sex, greater body mass index, and a B2 glenoid. Conclusion: Machine learning predictive models outperformed traditional scheduling, with a model incorporating general and shoulder-specific data providing the most accurate prediction of TSA procedure duration. Integration of modeling has the potential to optimize theater utilization and improve efficiency.
AB - Background: Operating room efficiency is of paramount importance for scheduling, cost efficiency, and to allow for the high operating volume required to address the growing demand for arthroplasty. The purpose of this study was to develop a machine learning predictive model for total shoulder arthroplasty (TSA) procedure duration and to identify factors which are predictive of a prolonged procedure. Methods: A retrospective review was undertaken of all TSA between 2013 and 2021 in a large academic institution. Patient, surgeon, anesthetic, and shoulder-specific factors were assessed. The duration of time in the operating room was recorded and compared to the human scheduler and electronic health record predicted procedure duration. Two gradient-boosted decision tree regression models were created with both training and validation datasets. The mean squared logarithmic error was chosen as the loss function. The first model (M1) considered patient, surgeon, and anesthetic factors, while the second model (M2) considered shoulder anatomy and pathology specific factors in addition. Results: Human schedulers’ predicted 64.1% of cases accurately, with 26.7% underpredicted and 9.2% overpredicted. M1 successfully predicted 79.7% of cases, with 6.9% underpredicted and 13.4% overpredicted. M2 successfully predicted 82.5% of cases with 8.8% underpredicted and 8.8% overpredicted. M2 was significantly more accurate in predicting anatomic total shoulder arthroplasty compared to reverse (rTSA) (90.6% vs. 78.1%, P < .001).The feature with the greatest impact on the shoulder-specific model's prediction was the historical median procedure duration; followed by the electronic health record prediction, surgeon prediction, patient age, and a traumatic indication. Factors which were associated with underpredicting procedure duration included younger age, traumatic indication, male sex, greater body mass index, and a B2 glenoid. Conclusion: Machine learning predictive models outperformed traditional scheduling, with a model incorporating general and shoulder-specific data providing the most accurate prediction of TSA procedure duration. Integration of modeling has the potential to optimize theater utilization and improve efficiency.
KW - Basic Science Study
KW - Computer Modeling Using AI/Machine Learning
KW - Shoulder arthroplasty
KW - artificial intelligence
KW - clinical effectiveness
KW - health economics
KW - machine learning
KW - reverse shoulder arthroplasty
UR - http://www.scopus.com/inward/record.url?scp=85217885352&partnerID=8YFLogxK
U2 - 10.1016/j.jse.2024.10.028
DO - 10.1016/j.jse.2024.10.028
M3 - Article
C2 - 39716610
AN - SCOPUS:85217885352
SN - 1058-2746
JO - Journal of Shoulder and Elbow Surgery
JF - Journal of Shoulder and Elbow Surgery
ER -