Repository logo
 

Application of a Factorial Designed Experiment to Optimize Selection of Reinforcement Learning Observations for a Hexapod Trajectory Following Task

Date

2024-07-30

Authors

Freeman, Alec

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

This thesis describes the development of a hexapod simulator built in the MATLAB Simscape environment, with the goal of testing the potential for a designed experiment to be use in the selection of observations for a reinforcement learning controlled hexapod design. The hexapod is controlled using a novel combination of a central pattern generator consisting of six coupled Hopf oscillators, and mapping functions with parameters updated via a reinforcement learning agent. The reinforcement learning agent is trained to control the hexapod using the Deep Deterministic Policy Gradient (DDPG) algorithm on a trajectory following task. Through implementation of a designed experiment testing different combinations of observations, a model is formulated to estimate the observations required to maximize the hexapod training reward. The model is validated in the simulator and the capabilities of the hexapod are further demonstrated on more complex path following tasks.

Description

Keywords

Reinforcement learning, Hexapod, Design of experiments, Mobile robotics, Central pattern generator

Citation