TY - GEN
T1 - Reinforcement Learning based Hierarchical Control for Path Tracking of a Wheeled Bipedal Robot with Sim-to-Real Framework
AU - Zhu, Wei
AU - Raza, Fahad
AU - Hayashibe, Mitsuhiro
N1 - Funding Information:
This work was supported by the Japan Society for the Promotion of Science (JSPS) Grant-in-Aid for Scientific Research (B) under Grant 18H01399.
Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - We propose a reinforcement learning (RL) based hierarchical control framework for path tracking of a wheeled bipedal robot. The framework consists of three control levels. 1) The high-level RL is used to obtain an optimal policy through trial and error in a simulated environment. 2) The middle-level Lyapunov-based non-linear controller is utilized to track a desired line with strong robustness and high stability. 3) The low-level PID-based controller is implemented to simultaneously achieve both balancing and velocity tracking for a physical wheeled bipedal robot in real world. Thanks to the middle-level controller, the offline trained policy in simulation can be directly employed on the physical robot in real time without tuning any parameters. Moreover, the high-level policy network is able to improve optimality and generality for the task of path tracking, as well to avoid the cumbersome process of manually tuning control gains. The experiment results in both simulation and real world demonstrate that the proposed hierarchical control framework can achieve quick, robust, and stable path tracking for a wheeled bipedal robot.
AB - We propose a reinforcement learning (RL) based hierarchical control framework for path tracking of a wheeled bipedal robot. The framework consists of three control levels. 1) The high-level RL is used to obtain an optimal policy through trial and error in a simulated environment. 2) The middle-level Lyapunov-based non-linear controller is utilized to track a desired line with strong robustness and high stability. 3) The low-level PID-based controller is implemented to simultaneously achieve both balancing and velocity tracking for a physical wheeled bipedal robot in real world. Thanks to the middle-level controller, the offline trained policy in simulation can be directly employed on the physical robot in real time without tuning any parameters. Moreover, the high-level policy network is able to improve optimality and generality for the task of path tracking, as well to avoid the cumbersome process of manually tuning control gains. The experiment results in both simulation and real world demonstrate that the proposed hierarchical control framework can achieve quick, robust, and stable path tracking for a wheeled bipedal robot.
UR - http://www.scopus.com/inward/record.url?scp=85126213632&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85126213632&partnerID=8YFLogxK
U2 - 10.1109/SII52469.2022.9708882
DO - 10.1109/SII52469.2022.9708882
M3 - Conference contribution
AN - SCOPUS:85126213632
T3 - 2022 IEEE/SICE International Symposium on System Integration, SII 2022
SP - 40
EP - 46
BT - 2022 IEEE/SICE International Symposium on System Integration, SII 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 IEEE/SICE International Symposium on System Integration, SII 2022
Y2 - 9 January 2022 through 12 January 2022
ER -