Revisiting the Geriatric Depression Scale: An IRT-Based 10-Item Screen Outperforms the GDS-15 in Diagnostic Accuracy and Efficiency

Han, Ji Won; Oh, Dae Jong; Kim, Tae Hui; Kwak, Kyung Phil; Kim, Bong Jo; Kim, Shin Gyeom; Kim, Jeong Lan; Moon, Seok Woo; Park, Joon Hyuk; Ryu, Seung-Ho; Youn, Jong Chul; Lee, Dong Young; Lee, Dong Woo; Lee, Seok Bum; Lee, Jung Jae; Jhoo, Jin Hyeong; Kim, Ki Woong

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Revisiting the Geriatric Depression Scale: An IRT-Based 10-Item Screen Outperforms the GDS-15 in Diagnostic Accuracy and Efficiency

Authors: Han, Ji Won; Oh, Dae Jong; Kim, Tae Hui; Kwak, Kyung Phil; Kim, Bong Jo; Kim, Shin Gyeom; Kim, Jeong Lan; Moon, Seok Woo; Park, Joon Hyuk; Ryu, Seung-Ho; Youn, Jong Chul; Lee, Dong Young; Lee, Dong Woo; Lee, Seok Bum; Lee, Jung Jae; Jhoo, Jin Hyeong; Kim, Ki Woong

Issue Date: Jan-2026

Publisher: MDPI AG

Keywords: depression; screening; item response theory; geriatric depression scale; psychometrics

Citation: Journal of Clinical Medicine, v.15, no.2

Indexed: SCIE
SCOPUS

Journal Title: Journal of Clinical Medicine

Volume: 15

Number: 2

URI: https://scholarworks.gnu.ac.kr/handle/sw.gnu/82347

DOI: 10.3390/jcm15020473

ISSN: 2077-0383

Abstract: Background/Objective: Existing abbreviated Geriatric Depression Scales (GDSs), derived via Classical Test Theory (CTT), often sacrifice accuracy for brevity and retain non-specific items. We aimed to develop a minimum-item GDS maintaining diagnostic performance equivalent to the full 30-item scale (GDS30) using Item Response Theory (IRT). Methods: This cross-sectional study employed rigorous 5:5 split-sample cross-validation. Participants included 6525 older adults (aged >= 60 years) from community-based (Korean Longitudinal Study on Cognitive Aging and Dementia) and clinical settings (geropsychiatry clinic). Depression was diagnosed through standardized clinical interviews based on DSM-IV criteria. Two-parameter logistic IRT models estimated item discrimination and difficulty parameters. Sequential item reduction with DeLong tests identified the minimum number of items required to maintain GDS30-equivalent area under the curve (AUC). Results: The 10-item IRT-optimized scale (GDS10-IRT) achieved an AUC of 0.856 (95% CI: 0.809-0.895) in the validation set, showing no significant difference from GDS30 (AUC = 0.883; p = 0.396). Conversely, the 15-item GDS (GDS15) demonstrated significantly lower AUC than GDS30 (p < 0.001) despite having more items. GDS10-IRT achieved a 234% improvement in efficiency ratio (AUC/items) over GDS30. Notably, Item 16 ("feeling downhearted and blue"), identified as the most discriminating symptom (a = 2.53), is absent from the GDS15 but included in GDS10-IRT. Conclusions: IRT-based item selection achieves GDS30-equivalent diagnostic accuracy with only 10 items, outperforming the widely used GDS15. By recovering high-discrimination items excluded by CTT, the GDS10-IRT offers a more efficient, specific screening tool for late-life depression.

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Medicine > Department of Medicine > Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Kim, Bong Cho photo

Kim, Bong Cho: 의과대학 (의학과)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

Gyeongsang National University Central Library, 501, Jinju-daero, Jinju-si, Gyeongsangnam-do, 52828, Republic of Korea+82-55-772-0534

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE