TY - GEN
T1 - Deep Reinforcement Learning Aided Online Trajectory Optimization of Cellular-Connected UAVs with Offline Map Reconstruction
AU - Hao, Qing
AU - Zhao, Haitao
AU - Huang, Hao
AU - Gui, Guan
AU - Ohtsuki, Tomoaki
AU - Adachi, Fumiyuki
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - To reduce the outage of the connection between unmanned aerial vehicles (UAVs) and cellular networks in complex real-time channel state, and reduce the energy consumption of UAV during flight mission, an online trajectory optimization scheme of UAV based on outage probability knowledge map reconstruction is proposed. The outage probability knowledge map is a database that simulates the connection between UAV and the cellular network during real hovers. The UAV first samples sparsely from the target area and calculates the outage probability of the sampling point, and then uses the Kriging algorithm to reconstruct the outage probability knowledge map. Based on the reconstructed outage probability knowledge map, with the goal of minimizing the energy consumption of UAV task execution, the UAV trajectory optimization problem is established, and a trajectory optimization algorithm based on deep reinforcement learning (DRL) is proposed to solve it. Numerical results show that the proposed online trajectory optimization scheme based on outage probability knowledge map can obtain great returns in terms of maintaining connectivity, reducing task completion time and energy consumption.
AB - To reduce the outage of the connection between unmanned aerial vehicles (UAVs) and cellular networks in complex real-time channel state, and reduce the energy consumption of UAV during flight mission, an online trajectory optimization scheme of UAV based on outage probability knowledge map reconstruction is proposed. The outage probability knowledge map is a database that simulates the connection between UAV and the cellular network during real hovers. The UAV first samples sparsely from the target area and calculates the outage probability of the sampling point, and then uses the Kriging algorithm to reconstruct the outage probability knowledge map. Based on the reconstructed outage probability knowledge map, with the goal of minimizing the energy consumption of UAV task execution, the UAV trajectory optimization problem is established, and a trajectory optimization algorithm based on deep reinforcement learning (DRL) is proposed to solve it. Numerical results show that the proposed online trajectory optimization scheme based on outage probability knowledge map can obtain great returns in terms of maintaining connectivity, reducing task completion time and energy consumption.
KW - Deep reinforcement learning
KW - Kriging
KW - cellular-connected UAV
KW - energy-efficient UAV
KW - radio map
KW - trajectory design
UR - http://www.scopus.com/inward/record.url?scp=85169790998&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85169790998&partnerID=8YFLogxK
U2 - 10.1109/VTC2023-Spring57618.2023.10200397
DO - 10.1109/VTC2023-Spring57618.2023.10200397
M3 - Conference contribution
AN - SCOPUS:85169790998
T3 - IEEE Vehicular Technology Conference
BT - 2023 IEEE 97th Vehicular Technology Conference, VTC 2023-Spring - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 97th IEEE Vehicular Technology Conference, VTC 2023-Spring
Y2 - 20 June 2023 through 23 June 2023
ER -