Repository logo
 

Providing Real-Valued Actions for Tangled Program Graphs Under the CartPole Benchmark

dc.contributor.authorWright, Matthew
dc.contributor.copyright-releaseNot Applicableen_US
dc.contributor.degreeMaster of Computer Scienceen_US
dc.contributor.departmentFaculty of Computer Scienceen_US
dc.contributor.ethics-approvalNot Applicableen_US
dc.contributor.external-examinern/aen_US
dc.contributor.graduate-coordinatorMichael McAllisteren_US
dc.contributor.manuscriptsNot Applicableen_US
dc.contributor.thesis-readerNur Zincir-Heywooden_US
dc.contributor.thesis-readerGarnett Wilsonen_US
dc.contributor.thesis-supervisorMalcolm Heywooden_US
dc.date.accessioned2020-08-20T14:12:41Z
dc.date.available2020-08-20T14:12:41Z
dc.date.defence2020-08-14
dc.date.issued2020-08-20T14:12:41Z
dc.descriptionExtending TPG to perform real-valued actions.en_US
dc.description.abstractThe Tangled Program Graph framework (TPG) is a genetic programming approach to reinforcement learning. Canonical TPG is limited to performing discrete actions. This thesis investigates mechanisms by which TPG might perform real-valued actions. Two approaches are proposed. In the first, a decision-making network extracts state from TPG's internal structure. A gradient-based learning method tailors the network to this representation. In the second, TPG is modified to generate a state representation in an external matrix visible to the decision-making network. No additional learning algorithm is used to configure the decision-making network. Instead, TPG adapts to use the default configuration. This thesis applies these approaches to a modified version of the classic CartPole environment that accepts real-valued actions. This enables the comparison between discrete action configurations of the task and the real-valued formulation. Results indicate that there is no additional complexity in TPG solutions under real-valued action versus discrete action configurations.en_US
dc.identifier.urihttp://hdl.handle.net/10222/79675
dc.language.isoenen_US
dc.subjectgenetic programmingen_US
dc.subjectmachine learningen_US
dc.subjectreinforcement learningen_US
dc.titleProviding Real-Valued Actions for Tangled Program Graphs Under the CartPole Benchmarken_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Wright-Matthew-MCSc-August-2020.pdf
Size:
1.68 MB
Format:
Adobe Portable Document Format
Description:
Thesis

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: