Application of a Factorial Designed Experiment to Optimize Selection of Reinforcement Learning Observations for a Hexapod Trajectory Following Task

Freeman, Alec

Application of a Factorial Designed Experiment to Optimize Selection of Reinforcement Learning Observations for a Hexapod Trajectory Following Task

Files

AlecFreeman2024.pdf (12.15 MB)

Date

2024-07-30

Authors

Freeman, Alec

Abstract

This thesis describes the development of a hexapod simulator built in the MATLAB Simscape environment, with the goal of testing the potential for a designed experiment to be use in the selection of observations for a reinforcement learning controlled hexapod design. The hexapod is controlled using a novel combination of a central pattern generator consisting of six coupled Hopf oscillators, and mapping functions with parameters updated via a reinforcement learning agent. The reinforcement learning agent is trained to control the hexapod using the Deep Deterministic Policy Gradient (DDPG) algorithm on a trajectory following task. Through implementation of a designed experiment testing different combinations of observations, a model is formulated to estimate the observations required to maximize the hexapod training reward. The model is validated in the simulator and the capabilities of the hexapod are further demonstrated on more complex path following tasks.

Keywords

Reinforcement learning, Hexapod, Design of experiments, Mobile robotics, Central pattern generator

URI

http://hdl.handle.net/10222/84372

Collections

Faculty of Graduate Studies Online Theses

Full item page

Application of a Factorial Designed Experiment to Optimize Selection of Reinforcement Learning Observations for a Hexapod Trajectory Following Task

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections