Reinforcement Learning Your Way : Agent Characterization through Policy Regularization

Maree, Charl; Omlin, Christian Walter Peter

dc.contributor.author	Maree, Charl
dc.contributor.author	Omlin, Christian Walter Peter
dc.date.accessioned	2023-01-27T14:10:46Z
dc.date.available	2023-01-27T14:10:46Z
dc.date.created	2022-03-29T11:22:20Z
dc.date.issued	2022
dc.identifier.citation	Maree, C. & Omlin, C. W. P. (2022). Reinforcement Learning Your Way : Agent Characterization through Policy Regularization. AI, 3(2), 250-259.	en_US
dc.identifier.issn	2673-2688
dc.identifier.uri	https://hdl.handle.net/11250/3046910
dc.description.abstract	The increased complexity of state-of-the-art reinforcement learning (RL) algorithms has resulted in an opacity that inhibits explainability and understanding. This has led to the development of several post hoc explainability methods that aim to extract information from learned policies, thus aiding explainability. These methods rely on empirical observations of the policy, and thus aim to generalize a characterization of agents’ behaviour. In this study, we have instead developed a method to imbue agents’ policies with a characteristic behaviour through regularization of their objective functions. Our method guides the agents’ behaviour during learning, which results in an intrinsic characterization; it connects the learning process with model explanation. We provide a formal argument and empirical evidence for the viability of our method. In future work, we intend to employ it to develop agents that optimize individual financial customers’ investment portfolios based on their spending personalities.	en_US
dc.language.iso	eng	en_US
dc.publisher	MDPI	en_US
dc.rights	Navngivelse 4.0 Internasjonal	*
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/deed.no	*
dc.title	Reinforcement Learning Your Way : Agent Characterization through Policy Regularization	en_US
dc.type	Journal article	en_US
dc.type	Peer reviewed	en_US
dc.description.version	publishedVersion	en_US
dc.rights.holder	© 2022 The Author(s)	en_US
dc.subject.nsi	VDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550	en_US
dc.source.pagenumber	250-259	en_US
dc.source.volume	3	en_US
dc.source.journal	AI	en_US
dc.source.issue	2	en_US
dc.identifier.doi	https://doi.org/10.3390/ai3020015
dc.identifier.cristin	2013258
cristin.qualitycode	1

Tilhørende fil(er)

Filnavn:: Article.pdf
Størrelse:: 348.4Kb
Format:: PDF

Åpne

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse 4.0 Internasjonal