Repository logo
 

EVALUATING SIMPLE REACTIVE AGENTS IN VISUAL REINFORCEMENT LEARNING TASKS

dc.contributor.authorBayer, Caleidgh Grace
dc.contributor.copyright-releaseNot Applicableen_US
dc.contributor.degreeMaster of Computer Scienceen_US
dc.contributor.departmentFaculty of Computer Scienceen_US
dc.contributor.ethics-approvalNot Applicableen_US
dc.contributor.external-examinern/aen_US
dc.contributor.graduate-coordinatorDr. Michael McAllisteren_US
dc.contributor.manuscriptsNot Applicableen_US
dc.contributor.thesis-readerDr. Xiao Luoen_US
dc.contributor.thesis-readerDr. Garnett Wilsonen_US
dc.contributor.thesis-supervisorDr. Malcolm Heywooden_US
dc.date.accessioned2023-08-25T14:33:34Z
dc.date.available2023-08-25T14:33:34Z
dc.date.defence2023-08-21
dc.date.issued2023-08-25
dc.description.abstractVisual formulations of reinforcement learning tasks are potentially challenging because (1) the state space is large and composed from pixels (so unlikely to be directly correlated with actions), (2) the underlying task might be partially observable despite the high dimensionality, and (3) rewards can be sparse, so do not necessarily discriminate between useful and not useful decisions. In this thesis we compare the classic deep Q-network (a temporal difference reinforcement learning approach) with tangled program graphs (TPG) (a genetic programming approach) under complete and partially observable visual reinforcement learning tasks from ViZDoom. We demonstrate that TPG is particularly effective at imparting structure on the partially observable task (resulting in a general policy for navigating a labyrinth), but is relatively poor at solving a fully observable (aiming) task. Conversely, DQN is very effective when presented with the complete information aiming task, but is unable to discover general solutions to the partially observable navigation task. We attribute these preferences to the different approaches TPG and DQN assume for addressing representation/feature construction versus credit assignment.en_US
dc.identifier.urihttp://hdl.handle.net/10222/82837
dc.language.isoenen_US
dc.subjectreinforcement learningen_US
dc.subjectgenetic programmingen_US
dc.titleEVALUATING SIMPLE REACTIVE AGENTS IN VISUAL REINFORCEMENT LEARNING TASKSen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
CaleidghGraceBayer2023.pdf
Size:
145.86 MB
Format:
Adobe Portable Document Format
Description:
Main Article

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: