A Comparison of Traversal Strategies for Tangled Program Graphs under the Arcade Learning Environment

Ianta, Alexandru

A Comparison of Traversal Strategies for Tangled Program Graphs under the Arcade Learning Environment

Files

AlexandruIanta2021.pdf (4.51 MB)

Date

2021-04-27T16:24:15Z

Authors

Ianta, Alexandru

Abstract

Tangled program graphs provides a framework for constructing modular genetic programming solutions to visual reinforcement learning tasks. In order to guard against the development of cycles within the resulting graph, and therefore introduce the halting problem, a traversal strategy forbidding the revisiting of vertices was originally assumed. In this thesis an alternative traversal strategy wherein vertex revisits are allowed but edge revisits are not is explored. An empirical study is performed using 20 game titles from the Arcade Learning Environment in order to assess the relative impact of the different traversal strategies on the resulting agent behaviours and underlying graph characteristics. Ultimately both strategies appear to result in behaviours that are statistically very similar. The most notable differences appear in distributions of actions used to reach the same performance.

Keywords

Tangled Program Graphs, Reinforcement Learning, Scalable Research, Genetic Programming, Arcade Learning Environment, Atari

URI

http://hdl.handle.net/10222/80428

Collections

Faculty of Graduate Studies Online Theses

Full item page

A Comparison of Traversal Strategies for Tangled Program Graphs under the Arcade Learning Environment

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections