Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

KL 유도 다중 업데이트 기반의 새로운 손실 인지 근접 정책 최적화 기법

Full metadata record
DC Field Value Language
dc.contributor.author반태원-
dc.date.accessioned2025-08-06T07:30:09Z-
dc.date.available2025-08-06T07:30:09Z-
dc.date.issued2025-07-
dc.identifier.issn2234-4772-
dc.identifier.issn2288-4165-
dc.identifier.urihttps://scholarworks.gnu.ac.kr/handle/sw.gnu/79639-
dc.description.abstractThis paper proposes LA-PPO-KL (Loss-Aware Proximal Policy Optimization with KL-guided Retrying), a novel reinforcement learning algorithm that improves the stability and efficiency of PPO. Unlike traditional PPO, which relies on static clipping or KL thresholds, LA-PPO-KL monitors both policy and value loss to determine when to retry policy updates. It also halts retries when KL divergence exceeds a predefined limit, preventing excessive policy shifts. Experiments in the BipedalWalker-v3 environment demonstrate that LA-PPO-KL outperforms baseline PPO by 15~20% in average return, with faster convergence and more robust learning. These results highlight the potential of adaptive retry mechanisms in improving policy optimization under complex and uncertain environments.-
dc.format.extent4-
dc.language한국어-
dc.language.isoKOR-
dc.publisher한국정보통신학회-
dc.titleKL 유도 다중 업데이트 기반의 새로운 손실 인지 근접 정책 최적화 기법-
dc.title.alternativeA New Loss-Aware Proximal Policy Optimization Based On KL-Guided Multi-Updates-
dc.typeArticle-
dc.publisher.location대한민국-
dc.identifier.bibliographicCitation한국정보통신학회논문지, v.29, no.7, pp 960 - 963-
dc.citation.title한국정보통신학회논문지-
dc.citation.volume29-
dc.citation.number7-
dc.citation.startPage960-
dc.citation.endPage963-
dc.type.docTypeY-
dc.identifier.kciidART003228585-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClasskci-
dc.subject.keywordAuthorProximal Policy Optimization-
dc.subject.keywordAuthorAdaptive Policy Update-
dc.subject.keywordAuthorReinforcement Learning Stability-
dc.subject.keywordAuthorKL Divergence Control.-
Files in This Item
There are no files associated with this item.
Appears in
Collections
ETC > Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Ban, Tae Won photo

Ban, Tae Won
IT공과대학 (AI정보공학과)
Read more

Altmetrics

Total Views & Downloads

BROWSE