TY - JOUR
T1 - Distributed Q-Learning Aided Uplink Grant-Free NOMA for Massive Machine-Type Communications
AU - Liu, Jiajia
AU - Shi, Zhenjiang
AU - Zhang, Shangwei
AU - Kato, Nei
N1 - Funding Information:
Manuscript received June 30, 2020; revised September 27, 2020 and November 20, 2020; accepted February 19, 2021. Date of publication May 10, 2021; date of current version June 17, 2021. This work was supported in part by the National Natural Science Foundation of China under Grant 61771374, Grant 61771373, Grant 61801360, and Grant 62001393; in part by the Natural Science Basic Research Program of Shaanxi under Grant 2020JC-15 and Grant 2020JM-109; in part by the Fundamental Research Funds for the Central Universities under Grant 31020200QD010; in part by the Xi’an Unmanned System Security and Intelligent Communications ISTC Center; and in part by the Special Funds for Central Universities Construction of World-Class Universities (Disciplines) and Special Development Guidance under Grant 0639021GH0201024. (Corresponding author: Jiajia Liu.) Jiajia Liu and Shangwei Zhang are with the National Engineering Laboratory for Integrated Aero-Space-Ground-Ocean Big Data Application Technology, Northwestern Polytechnical University, Xi’an 710072, China (e-mail: liujiajia@nwpu.edu.cn).
Publisher Copyright:
© 1983-2012 IEEE.
PY - 2021/7
Y1 - 2021/7
N2 - The explosive growth of machine-type communications (MTC) devices poses critical challenges to the existing cellular networks. Therefore, how to support massive MTC devices with limited resources is an urgent problem to be solved. Bursty traffic is an important characteristic of MTC devices, which makes it difficult for agents to learn useful experience and has a negative impact on model convergence. However, most existing reinforcement learning-based literatures assume that devices have saturate data. Towards this end, we propose two distributed Q-learning aided uplink grant-free non-orthogonal multiple access (NOMA) schemes (including all-devices distributed Q-learning (ADDQ) scheme and portion-devices distributed Q-learning (PDDQ) scheme) to maximize the number of accessible devices, where the bursty traffic of massive MTC devices is carefully considered. In order to reduce the dimension of scheduling space and mitigate the impact of bursty traffic, the idea of grouping devices as well as transmission resources and the intermittent learning mode are adopted in our schemes. Extensive numerical results demonstrate the advantages of proposed schemes from multiple perspectives.
AB - The explosive growth of machine-type communications (MTC) devices poses critical challenges to the existing cellular networks. Therefore, how to support massive MTC devices with limited resources is an urgent problem to be solved. Bursty traffic is an important characteristic of MTC devices, which makes it difficult for agents to learn useful experience and has a negative impact on model convergence. However, most existing reinforcement learning-based literatures assume that devices have saturate data. Towards this end, we propose two distributed Q-learning aided uplink grant-free non-orthogonal multiple access (NOMA) schemes (including all-devices distributed Q-learning (ADDQ) scheme and portion-devices distributed Q-learning (PDDQ) scheme) to maximize the number of accessible devices, where the bursty traffic of massive MTC devices is carefully considered. In order to reduce the dimension of scheduling space and mitigate the impact of bursty traffic, the idea of grouping devices as well as transmission resources and the intermittent learning mode are adopted in our schemes. Extensive numerical results demonstrate the advantages of proposed schemes from multiple perspectives.
KW - Bursty traffic
KW - distributed Q-learning
KW - grant-free
KW - machine-type communications
KW - non-orthogonal multiple access
UR - http://www.scopus.com/inward/record.url?scp=85105884648&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85105884648&partnerID=8YFLogxK
U2 - 10.1109/JSAC.2021.3078496
DO - 10.1109/JSAC.2021.3078496
M3 - Article
AN - SCOPUS:85105884648
SN - 0733-8716
VL - 39
SP - 2029
EP - 2041
JO - IEEE Journal on Selected Areas in Communications
JF - IEEE Journal on Selected Areas in Communications
IS - 7
M1 - 9427159
ER -