TY - GEN
T1 - Deep Reinforcement Learning with Gait Mode Specification for Quadrupedal Trot-Gallop Energetic Analysis
AU - Chai, Jiazheng
AU - Owaki, Dai
AU - Hayashibe, Mitsuhiro
N1 - Funding Information:
This work was supported by the JSPS Grant-in-Aid for Scientific Research on Innovative Areas Hyper-Adaptability Project under Grant 20H05458.
Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - Quadruped system is an animal-like model which has long been analyzed in terms of energy efficiency during its various gait locomotion. The generation of certain gait modes on these systems has been achieved by classical controllers which demand highly specific domain-knowledge and empirical parameter tuning. In this paper, we propose to use deep reinforcement learning (DRL) as an alternative approach to generate certain gait modes on quadrupeds, allowing potentially the same energetic analysis without the difficulty of designing an ad hoc controller. We show that by specifying a gait mode in the process of learning, it allows faster convergence of the learning process while at the same time imposing a certain gait type on the systems as opposed to the case without any gait specification. We demonstrate the advantages of using DRL as it can exploit automatically the physical condition of the robots such as the passive spring effect between the joints during the learning process, similar to the adaptation skills of an animal. The proposed system would provide a framework for quadrupedal trot-gallop energetic analysis for different body structures, body mass distributions and joint characteristics using DRL.
AB - Quadruped system is an animal-like model which has long been analyzed in terms of energy efficiency during its various gait locomotion. The generation of certain gait modes on these systems has been achieved by classical controllers which demand highly specific domain-knowledge and empirical parameter tuning. In this paper, we propose to use deep reinforcement learning (DRL) as an alternative approach to generate certain gait modes on quadrupeds, allowing potentially the same energetic analysis without the difficulty of designing an ad hoc controller. We show that by specifying a gait mode in the process of learning, it allows faster convergence of the learning process while at the same time imposing a certain gait type on the systems as opposed to the case without any gait specification. We demonstrate the advantages of using DRL as it can exploit automatically the physical condition of the robots such as the passive spring effect between the joints during the learning process, similar to the adaptation skills of an animal. The proposed system would provide a framework for quadrupedal trot-gallop energetic analysis for different body structures, body mass distributions and joint characteristics using DRL.
UR - http://www.scopus.com/inward/record.url?scp=85122548948&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85122548948&partnerID=8YFLogxK
U2 - 10.1109/EMBC46164.2021.9630547
DO - 10.1109/EMBC46164.2021.9630547
M3 - Conference contribution
C2 - 34892236
AN - SCOPUS:85122548948
T3 - Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS
SP - 4583
EP - 4587
BT - 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2021
Y2 - 1 November 2021 through 5 November 2021
ER -