PPMC Training Algorithm: A Deep Learning Based Path Planner and Motion Controller

Tamir Blum, William Jones, Kazuya Yoshida

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

In the pursuit of a fully autonomous learning agent able to interact, move, and be useful in the real world, two fundamental problems are path planning and motion control, and user-agent interaction. We address these through reinforcement learning using our Path Planning and Motion Controller (PPMC) Training Algorithm, which uses a combination of observable goals and randomization of goals during training, with a customized reward function, to teach a simulated quadruped agent to respond to user commands and to travel to designated areas. In this regard, we identified two critical components of path planning and motion control: the first is region enabled travel, or the ability to travel towards any location within a prescribed area; the second is multi-point travel, or the ability to travel to multiple points in succession. An important open ended question is how many tasks should be handled by a single policy and if a single policy can even learn to manage several tasks. We demonstrate that it is possible to contain both a maples path planner and motion controller on a single neural network, which could prove promising in future work due to their interlinked and synergistic nature. Using control group policies and various test cases and using ACKTR and PPO, we empirically validate our algorithm teaches the agent to respond to user commands as well as path planning and motion control.

Original languageEnglish
Title of host publication2020 International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages193-198
Number of pages6
ISBN (Electronic)9781728149851
DOIs
Publication statusPublished - 2020 Feb
Event2nd International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2020 - Fukuoka, Japan
Duration: 2020 Feb 192020 Feb 21

Publication series

Name2020 International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2020

Conference

Conference2nd International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2020
Country/TerritoryJapan
CityFukuoka
Period20/2/1920/2/21

Keywords

  • ACKTR
  • Autonomous Systems
  • Control and Decision Systems
  • Human Commanded Systems
  • Machine Learning
  • Path Planning
  • PPO
  • Reinforcement Learning
  • Robotics
  • Teleoperations
  • Training Algorithm

Fingerprint

Dive into the research topics of 'PPMC Training Algorithm: A Deep Learning Based Path Planner and Motion Controller'. Together they form a unique fingerprint.

Cite this