Application of a Factorial Designed Experiment to Optimize Selection of Reinforcement Learning Observations for a Hexapod Trajectory Following Task

Freeman, Alec

Application of a Factorial Designed Experiment to Optimize Selection of Reinforcement Learning Observations for a Hexapod Trajectory Following Task

dc.contributor.author	Freeman, Alec
dc.contributor.copyright-release	Not Applicable	en_US
dc.contributor.degree	Master of Applied Science	en_US
dc.contributor.department	Department of Mechanical Engineering	en_US
dc.contributor.ethics-approval	Not Applicable	en_US
dc.contributor.external-examiner	n/a	en_US
dc.contributor.manuscripts	Not Applicable	en_US
dc.contributor.thesis-reader	Thomas Trappenberg	en_US
dc.contributor.thesis-reader	Ted Hubbard	en_US
dc.contributor.thesis-supervisor	Robert Bauer	en_US
dc.date.accessioned	2024-08-01T12:52:34Z
dc.date.available	2024-08-01T12:52:34Z
dc.date.defence	2024-07-26
dc.date.issued	2024-07-30
dc.description.abstract	This thesis describes the development of a hexapod simulator built in the MATLAB Simscape environment, with the goal of testing the potential for a designed experiment to be use in the selection of observations for a reinforcement learning controlled hexapod design. The hexapod is controlled using a novel combination of a central pattern generator consisting of six coupled Hopf oscillators, and mapping functions with parameters updated via a reinforcement learning agent. The reinforcement learning agent is trained to control the hexapod using the Deep Deterministic Policy Gradient (DDPG) algorithm on a trajectory following task. Through implementation of a designed experiment testing different combinations of observations, a model is formulated to estimate the observations required to maximize the hexapod training reward. The model is validated in the simulator and the capabilities of the hexapod are further demonstrated on more complex path following tasks.	en_US
dc.identifier.uri	http://hdl.handle.net/10222/84372
dc.language.iso	en	en_US
dc.subject	Reinforcement learning	en_US
dc.subject	Hexapod	en_US
dc.subject	Design of experiments	en_US
dc.subject	Mobile robotics	en_US
dc.subject	Central pattern generator	en_US
dc.title	Application of a Factorial Designed Experiment to Optimize Selection of Reinforcement Learning Observations for a Hexapod Trajectory Following Task	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: AlecFreeman2024.pdf
Size:: 12.15 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Faculty of Graduate Studies Online Theses