Multi-Agent Reinforcement Learning for a Multichannel Uplink Random Access: Congestion Game Perspective
Citations

WEB OF SCIENCE

0
Citations

SCOPUS

3

초록

In this paper, we propose a time-slotted multichannel uplink random access (RA) game model where players do not cooperate. We first analyze its sum throughput from the congestion game (CG) perspective and obtain the pure strategy Nash equilibria (PNEs) that fully utilize each slot. Then, we propose an Upper Confidence Bound (UCB)-based multi-agent reinforcement learning (MARL) algorithm to realize the PNEs, where UCB is one of the multi-armed bandit algorithms that work by assigning a confidence level for each action. Finally, via simulation, we show that our proposed algorithm can obtain near-optimal average sum throughput in the long run. © 2023 IEEE.

키워드

congestion gameMulti-armed banditnash equilibriumrandom accessreinforcement learning
제목
Multi-Agent Reinforcement Learning for a Multichannel Uplink Random Access: Congestion Game Perspective
저자
Zhao, YuLee, JoohyunSeo, Jun-Bae
DOI
10.1109/iCAST57874.2023.10359301
발행일
2023-11
유형
Conference paper
저널명
Proceedings of 2023 12th International Conference on Awareness Science and Technology, iCAST 2023
페이지
156 ~ 160