Detailed Information

Cited 2 time in webofscience Cited 2 time in scopus
Metadata Downloads

Development and evaluation of a machine learning model for osteoporosis risk prediction in Korean womenopen access

Authors
Je, MinkyungHwang, SeunghyeonLee, SuwonKim, Yoona
Issue Date
Mar-2025
Publisher
BioMed Central
Keywords
Machine learning; Classification; Osteoporosis; Women; Korean genome and epidemiology study (KoGES)
Citation
BMC Women's Health, v.25, no.1
Indexed
SCIE
SSCI
SCOPUS
Journal Title
BMC Women's Health
Volume
25
Number
1
URI
https://scholarworks.gnu.ac.kr/handle/sw.gnu/77972
DOI
10.1186/s12905-025-03669-4
ISSN
1472-6874
1472-6874
Abstract
BackgroundThe aim of this study was to develop a machine learning (ML) model for classifying osteoporosis in Korean women based on a large-scale population cohort study. This study also aimed to assess ML model performance compared with traditional osteoporosis screening tools. Furthermore, this study aimed to examine the factors influencing the risk of osteoporosis through variable importance.MethodsData was collected from 4199 women aged 40-69 years in the baseline survey of the Ansan and Ansung cohort of the Korean Genome and Epidemiology Study. Osteoporosis was set as the dependent variable to develop ML classification models. Independent variables included 122 factors related to osteoporosis risk, such as socio-demographic characteristics, anthropometric parameters, lifestyle factors, reproductive factors, nutrient intakes, diet quality indices, medical history, medication history, family history, biochemical parameters, and genetic factors. The six classification models were developed using ML techniques, including decision tree, random forest, multilayer perceptron, support vector machine, light gradient boosting machine, and extreme gradient boosting (XGBoost). The six ML classification models were compared with two traditional osteoporosis screening tools, including the osteoporosis risk assessment instrument (ORAI) and the osteoporosis self-assessment tool (OST). The ML model performances were evaluated and compared using the confusion matrix and area under the curve (AUC) metrics. Variable importance was assessed using the XGBoost technique to investigate osteoporosis risk factors.ResultsThe XGBoost model showed the highest performance out of the six ML classification models, with an accuracy of 0.705, precision of 0.664, recall of 0.830, and F1 score of 0.738. Moreover, the XGBoost model showed a higher performance on AUC than ORAI and OST. Variable importance scores were identified for 69 out of the 122 variables associated with osteoporosis risk factors. Age at menopause ranked first in variable importance. Variables of arthritis, physical activities, hypertension, education level, income level; alcohol intake, potassium intake, homeostatic model assessment for insulin resistance; energy intake, vitamin C intake, gout; and dietary inflammatory index ranked in the top 20 out of the 69 variables, using the XGBoost technique.ConclusionsThis study found that an XGBoost model can be utilized to classify osteoporosis in Korean women. Age at menopause is a significant factor in osteoporosis risk, followed by arthritis, physical activities, hypertension, and education level.
Files in This Item
There are no files associated with this item.
Appears in
Collections
자연과학대학 > 식품영양학과 > Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Yoon A photo

Kim, Yoon A
자연과학대학 (식품영양학과)
Read more

Altmetrics

Total Views & Downloads

BROWSE