Vis enkel innførsel

dc.contributor.authorMaree, Charl
dc.contributor.authorOmlin, Christian Walter Peter
dc.date.accessioned2023-01-27T14:10:46Z
dc.date.available2023-01-27T14:10:46Z
dc.date.created2022-03-29T11:22:20Z
dc.date.issued2022
dc.identifier.citationMaree, C. & Omlin, C. W. P. (2022). Reinforcement Learning Your Way : Agent Characterization through Policy Regularization. AI, 3(2), 250-259.en_US
dc.identifier.issn2673-2688
dc.identifier.urihttps://hdl.handle.net/11250/3046910
dc.description.abstractThe increased complexity of state-of-the-art reinforcement learning (RL) algorithms has resulted in an opacity that inhibits explainability and understanding. This has led to the development of several post hoc explainability methods that aim to extract information from learned policies, thus aiding explainability. These methods rely on empirical observations of the policy, and thus aim to generalize a characterization of agents’ behaviour. In this study, we have instead developed a method to imbue agents’ policies with a characteristic behaviour through regularization of their objective functions. Our method guides the agents’ behaviour during learning, which results in an intrinsic characterization; it connects the learning process with model explanation. We provide a formal argument and empirical evidence for the viability of our method. In future work, we intend to employ it to develop agents that optimize individual financial customers’ investment portfolios based on their spending personalities.en_US
dc.language.isoengen_US
dc.publisherMDPIen_US
dc.rightsNavngivelse 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/deed.no*
dc.titleReinforcement Learning Your Way : Agent Characterization through Policy Regularizationen_US
dc.typeJournal articleen_US
dc.typePeer revieweden_US
dc.description.versionpublishedVersionen_US
dc.rights.holder© 2022 The Author(s)en_US
dc.subject.nsiVDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550en_US
dc.source.pagenumber250-259en_US
dc.source.volume3en_US
dc.source.journalAIen_US
dc.source.issue2en_US
dc.identifier.doihttps://doi.org/10.3390/ai3020015
dc.identifier.cristin2013258
cristin.qualitycode1


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Navngivelse 4.0 Internasjonal
Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse 4.0 Internasjonal