Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Predicting Nurse Turnover for Highly Imbalanced Data Using the Synthetic Minority Over-Sampling Technique and Machine Learning Algorithmsopen access

Authors
Xu, YuanPark, YongshinPark, Ju DongSun, Bora
Issue Date
Dec-2023
Publisher
Multidisciplinary Digital Publishing Institute (MDPI)
Keywords
machine learning; NSSRN; nurse turnover; random forest; SMOTE; XGBoost
Citation
Healthcare (Switzerland), v.11, no.24
Indexed
SCIE
SSCI
SCOPUS
Journal Title
Healthcare (Switzerland)
Volume
11
Number
24
URI
https://scholarworks.gnu.ac.kr/handle/sw.gnu/69041
DOI
10.3390/healthcare11243173
ISSN
2227-9032
2227-9032
Abstract
Predicting nurse turnover is a growing challenge within the healthcare sector, profoundly impacting healthcare quality and the nursing profession. This study employs the Synthetic Minority Over-sampling Technique (SMOTE) to address class imbalance issues in the 2018 National Sample Survey of Registered Nurses dataset and predict nurse turnover using machine learning algorithms. Four machine learning algorithms, namely logistic regression, random forests, decision tree, and extreme gradient boosting, were applied to the SMOTE-enhanced dataset. The data were split into 80% training and 20% validation sets. Eighteen carefully selected variables from the database served as predictive features, and the machine learning model identified age, working hours, electric health record/electronic medical record, individual income, and job type as important features concerning nurse turnover. The study includes a performance comparison based on accuracy, precision, recall (sensitivity), F1-score, and AUC. In summary, the results demonstrate that SMOTE-enhanced random forests exhibit the most robust predictive power in the classical approach (with all 18 predictive variables) and an optimized approach (utilizing eight key predictive variables). Extreme gradient boosting, decision tree, and logistic regression follow in performance. Notably, age emerges as the most influential factor in nurse turnover, with working hours, electric health record/electronic medical record usability, individual income, and region also playing significant roles. This research offers valuable insights for healthcare researchers and stakeholders, aiding in selecting suitable machine learning algorithms for nurse turnover prediction. © 2023 by the authors.
Files in This Item
There are no files associated with this item.
Appears in
Collections
해양과학대학 > Department of Maritime Police and Production System > Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Park, Ju Dong photo

Park, Ju Dong
해양과학대학 (해양경찰시스템학과)
Read more

Altmetrics

Total Views & Downloads

BROWSE