Reinforcement Learning Your Way : Agent Characterization through Policy Regularization

Maree, Charl; Omlin, Christian Walter Peter

Maree, Charl; Omlin, Christian Walter Peter

Journal article, Peer reviewed

Published version

View/Open

Article.pdf (348.4Kb)

URI

https://hdl.handle.net/11250/3046910

Date

2022

Metadata

Show full item record

Collections

Original version

Maree, C. & Omlin, C. W. P. (2022). Reinforcement Learning Your Way : Agent Characterization through Policy Regularization. AI, 3(2), 250-259. https://doi.org/10.3390/ai3020015

Abstract

The increased complexity of state-of-the-art reinforcement learning (RL) algorithms has resulted in an opacity that inhibits explainability and understanding. This has led to the development of several post hoc explainability methods that aim to extract information from learned policies, thus aiding explainability. These methods rely on empirical observations of the policy, and thus aim to generalize a characterization of agents’ behaviour. In this study, we have instead developed a method to imbue agents’ policies with a characteristic behaviour through regularization of their objective functions. Our method guides the agents’ behaviour during learning, which results in an intrinsic characterization; it connects the learning process with model explanation. We provide a formal argument and empirical evidence for the viability of our method. In future work, we intend to employ it to develop agents that optimize individual financial customers’ investment portfolios based on their spending personalities.

Publisher

MDPI

Journal

Copyright

Except where otherwise noted, this item's license is described as Navngivelse 4.0 Internasjonal