Unsupervised State Representation Learning in Partially Observable Atari Games

Meng, Li; Goodwin, Morten; Yazidi, Anis; Engelstad, Paal E.

dc.contributor.author	Meng, Li
dc.contributor.author	Goodwin, Morten
dc.contributor.author	Yazidi, Anis
dc.contributor.author	Engelstad, Paal E.
dc.date.accessioned	2024-07-30T16:43:53Z
dc.date.available	2024-07-30T16:43:53Z
dc.date.created	2023-11-29T17:58:12Z
dc.date.issued	2023
dc.identifier.citation	Meng, L., Goodwin, M., Yazidi, A. & Engelstad, P. (2023). Unsupervised State Representation Learning in Partially Observable Atari Games. Lecture Notes in Computer Science, 14185, 212-222.	en_US
dc.identifier.isbn	978-3-031-44239-1
dc.identifier.issn	1611-3349
dc.identifier.uri	https://hdl.handle.net/11250/3143771
dc.description	Author's accepted manuscript.	en_US
dc.description	Available from 21/09/2024.
dc.description.abstract	State representation learning aims to capture latent factors of an environment. Although some researchers realize the connections between masked image modeling and contrastive representation learning, the effort is focused on using masks as an augmentation technique to represent the latent generative factors better. Partially observable environments in reinforcement learning have not yet been carefully studied using unsupervised state representation learning methods. In this article, we create an unsupervised state representation learning scheme for partially observable states. We conducted our experiment on a previous Atari 2600 framework designed to evaluate representation learning models. A contrastive method called Spatiotemporal DeepInfomax (ST-DIM) has shown state-of-the-art performance on this benchmark but remains inferior to its supervised counterpart. Our approach improves ST-DIM when the environment is not fully observable and achieves higher F1 scores and accuracy scores than the supervised learning counterpart. The mean accuracy score averaged over categories of our approach is 66%, compared to 38% of supervised learning. The mean F1 score is 64% to 33%. The code can be found on https://github.com/mengli11235/MST_DIM.	en_US
dc.language.iso	eng	en_US
dc.publisher	Springer	en_US
dc.relation.ispartofseries	Lecture Notes in Computer Science; no. 14185
dc.title	Unsupervised State Representation Learning in Partially Observable Atari Games	en_US
dc.type	Journal article	en_US
dc.type	Peer reviewed	en_US
dc.description.version	acceptedVersion	en_US
dc.rights.holder	© 2023 The Author(s)	en_US
dc.subject.nsi	VDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550	en_US
dc.source.pagenumber	212-222	en_US
dc.source.volume	14185	en_US
dc.source.journal	Lecture Notes in Computer Science	en_US
dc.identifier.doi	https://doi.org/10.1007/978-3-031-44240-7_21
dc.identifier.cristin	2205633
cristin.qualitycode	1

Tilhørende fil(er)

Filnavn:: Article.pdf
Størrelse:: 573.7Kb
Format:: PDF

Låst

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel