Repository logo
 

Application of a Factorial Designed Experiment to Optimize Selection of Reinforcement Learning Observations for a Hexapod Trajectory Following Task

dc.contributor.authorFreeman, Alec
dc.contributor.copyright-releaseNot Applicableen_US
dc.contributor.degreeMaster of Applied Scienceen_US
dc.contributor.departmentDepartment of Mechanical Engineeringen_US
dc.contributor.ethics-approvalNot Applicableen_US
dc.contributor.external-examinern/aen_US
dc.contributor.manuscriptsNot Applicableen_US
dc.contributor.thesis-readerThomas Trappenbergen_US
dc.contributor.thesis-readerTed Hubbarden_US
dc.contributor.thesis-supervisorRobert Baueren_US
dc.date.accessioned2024-08-01T12:52:34Z
dc.date.available2024-08-01T12:52:34Z
dc.date.defence2024-07-26
dc.date.issued2024-07-30
dc.description.abstractThis thesis describes the development of a hexapod simulator built in the MATLAB Simscape environment, with the goal of testing the potential for a designed experiment to be use in the selection of observations for a reinforcement learning controlled hexapod design. The hexapod is controlled using a novel combination of a central pattern generator consisting of six coupled Hopf oscillators, and mapping functions with parameters updated via a reinforcement learning agent. The reinforcement learning agent is trained to control the hexapod using the Deep Deterministic Policy Gradient (DDPG) algorithm on a trajectory following task. Through implementation of a designed experiment testing different combinations of observations, a model is formulated to estimate the observations required to maximize the hexapod training reward. The model is validated in the simulator and the capabilities of the hexapod are further demonstrated on more complex path following tasks.en_US
dc.identifier.urihttp://hdl.handle.net/10222/84372
dc.language.isoenen_US
dc.subjectReinforcement learningen_US
dc.subjectHexapoden_US
dc.subjectDesign of experimentsen_US
dc.subjectMobile roboticsen_US
dc.subjectCentral pattern generatoren_US
dc.titleApplication of a Factorial Designed Experiment to Optimize Selection of Reinforcement Learning Observations for a Hexapod Trajectory Following Tasken_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
AlecFreeman2024.pdf
Size:
12.15 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: