Detailed Information

Cited 5 time in webofscience Cited 9 time in scopus
Metadata Downloads

Multi-Agent Reinforcement Learning for a Random Access Game

Full metadata record
DC Field Value Language
dc.contributor.authorLee, Dongwoo-
dc.contributor.authorZhao, Yu-
dc.contributor.authorSeo, Jun-Bae-
dc.contributor.authorLee, Joohyun-
dc.date.accessioned2022-12-26T05:41:35Z-
dc.date.available2022-12-26T05:41:35Z-
dc.date.issued2022-08-
dc.identifier.issn0018-9545-
dc.identifier.issn1939-9359-
dc.identifier.urihttps://scholarworks.gnu.ac.kr/handle/sw.gnu/1011-
dc.description.abstractThis work investigates a random access (RA) game for a time-slotted RA system, where N players choose a set of slots of a frame and each frame consists of M multiple time slots. We obtain the pure strategy Nash equilibria (PNEs) of this RA game, where slots are fully utilized as in the centralized scheduling. As an algorithm to realize a PNE (Pure strategy Nash Equilibrium), we propose an Exponential-weight algorithm for Exploration and Exploitation (EXP3)-based multi-agent (MA) learning algorithm, which has the computational complexity of O(N (NmaxT)-T-2). EXP3 is a bandit algorithm designed to find an optimal strategy in a multi-armed bandit (MAB) problem that users do not know the expected payoff of each strategy. Our simulation results show that the proposed algorithm can achieve PNEs. Moreover, it can adapt to time-varying environments, where the number of players varies over time.-
dc.format.extent6-
dc.language영어-
dc.language.isoENG-
dc.publisherInstitute of Electrical and Electronics Engineers-
dc.titleMulti-Agent Reinforcement Learning for a Random Access Game-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.1109/TVT.2022.3176722-
dc.identifier.scopusid2-s2.0-85130779348-
dc.identifier.wosid000846892800095-
dc.identifier.bibliographicCitationIEEE Transactions on Vehicular Technology, v.71, no.8, pp 9119 - 9124-
dc.citation.titleIEEE Transactions on Vehicular Technology-
dc.citation.volume71-
dc.citation.number8-
dc.citation.startPage9119-
dc.citation.endPage9124-
dc.type.docTypeArticle-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaTelecommunications-
dc.relation.journalResearchAreaTransportation-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryTelecommunications-
dc.relation.journalWebOfScienceCategoryTransportation Science & Technology-
dc.subject.keywordPlusALOHA-
dc.subject.keywordAuthorMulti-armed bandit-
dc.subject.keywordAuthornash equilibrium-
dc.subject.keywordAuthornon-cooperative game-
dc.subject.keywordAuthorrandom access-
Files in This Item
There are no files associated with this item.
Appears in
Collections
해양과학대학 > 지능형통신공학과 > Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Seo, Jun Bae photo

Seo, Jun Bae
IT공과대학 (AI정보공학과)
Read more

Altmetrics

Total Views & Downloads

BROWSE