Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Ophtimus-V2-Tx: a compact domain-specific LLM for ophthalmic diagnosis and treatment planning

Full metadata record
DC Field Value Language
dc.contributor.authorKwon, Minwook-
dc.contributor.authorJang, Kuk Jin-
dc.contributor.authorBaek, Seung Ju-
dc.contributor.authorHan, Yong Seop-
dc.contributor.authorChoi, Hyonyoung-
dc.contributor.authorLee, Insup-
dc.contributor.authorKim, Jin Hyun-
dc.date.accessioned2026-01-07T08:30:12Z-
dc.date.available2026-01-07T08:30:12Z-
dc.date.issued2025-12-
dc.identifier.issn2045-2322-
dc.identifier.issn2045-2322-
dc.identifier.urihttps://scholarworks.gnu.ac.kr/handle/sw.gnu/81635-
dc.description.abstractLarge language models (LLMs) show promise for clinical decision support but often struggle with case-specific reasoning. We present Ophtimus-V2-Tx, an 8-billion-parameter ophthalmology-specialized LLM fine-tuned on more than 10,000 case reports. Evaluation is conducted on a pre-collected dataset. Alongside text metrics (ROUGE-L, BLEU, METEOR) and a semantic similarity score, we use CliBench to map outputs to standardized codes (ICD-10-CM, ATC, ICD-10-PCS) and compute hierarchical F1 (L1–L4 and Full), with code mapping used strictly as an evaluation tool. Ophtimus-V2-Tx is competitive with a state-of-the-art general model and stronger in several settings. It improves text metrics (ROUGE-L 0.40 vs. 0.18; BLEU 0.26 vs. 0.05; METEOR 0.45 vs. 0.29) with comparable semantic similarity. On CliBench, it attains a higher full-code score for secondary diagnosis and ties or leads at selected granular levels for primary diagnosis, while medication and procedure results are close with overlapping confidence intervals. Relative to other ophthalmology-tuned baselines, it shows consistently higher text-generation scores. These findings indicate that a compact, domain-adapted model can approach-or in targeted settings, exceed-large general LLMs on clinically grounded outputs while remaining feasible for on-premise use. We also describe an auditable evaluation pipeline (frozen coding agent, identical prompts, hierarchical metrics) to support reproducibility and future benchmarking.-
dc.language영어-
dc.language.isoENG-
dc.publisherNature Publishing Group-
dc.titleOphtimus-V2-Tx: a compact domain-specific LLM for ophthalmic diagnosis and treatment planning-
dc.typeArticle-
dc.publisher.location영국-
dc.identifier.doi10.1038/s41598-025-27410-1-
dc.identifier.scopusid2-s2.0-105024318079-
dc.identifier.bibliographicCitationScientific Reports, v.15, no.1-
dc.citation.titleScientific Reports-
dc.citation.volume15-
dc.citation.number1-
dc.type.docTypeArticle-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
Files in This Item
There are no files associated with this item.
Appears in
Collections
공학계열 > AI융합공학과 > Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Jin Hyun photo

Kim, Jin Hyun
IT공과대학 (AI정보공학과)
Read more

Altmetrics

Total Views & Downloads

BROWSE