The Hierarchical Discrete Learning Automaton Suitable for Environments with Many Actions and High Accuracy Requirements

Omslandseter, Rebekka Olsson; Jiao, Lei; Zhang, Xuan; Yazidi, Anis; Oommen, John

dc.contributor.author	Omslandseter, Rebekka Olsson
dc.contributor.author	Jiao, Lei
dc.contributor.author	Zhang, Xuan
dc.contributor.author	Yazidi, Anis
dc.contributor.author	Oommen, John
dc.date.accessioned	2023-05-25T12:25:37Z
dc.date.available	2023-05-25T12:25:37Z
dc.date.created	2023-01-10T12:32:54Z
dc.date.issued	2022
dc.identifier.citation	Omslandseter, R. O., Jiao, L., Zhang, X., Yazidi, A. & Oommen, J. (2022). The Hierarchical Discrete Learning Automaton Suitable for Environments with Many Actions and High Accuracy Requirements. Lecture Notes in Computer Science (LNCS), 13151, 507-518.	en_US
dc.identifier.issn	1611-3349
dc.identifier.uri	https://hdl.handle.net/11250/3069016
dc.description	Author's accepted manuscript	en_US
dc.description.abstract	Since its early beginning, the paradigm of Learning Automata (LA), has attracted much interest. Over the last decades, new concepts and various improvements have been introduced to increase the LA’s speed and accuracy, including employing probability updating functions, discretizing the probability space, and implementing the “Pursuit” concept. The concept of incorporating “structure” into the ordering of the LA’s actions is one of the latest advancements to the field, leading to the ϵ-optimal Hierarchical Continuous Pursuit LA (HCPA) that has superior performance to other LA variants when the number of actions is large. Although the previously proposed HCPA is powerful, its speed has a handicap when the required action probability of an action is approaching unity. The reason for this slow convergence is that the learning parameter operates in a multiplicative manner within the probability space, making the increment of the action probability smaller as its probability becomes close to unity. Therefore, we propose the novel Hierarchical Discrete Learning Automata (HDPA) in this paper, which does not possess the same impediment as the HCPA. The proposed machine infuse the principle of discretization into the action probability vector’s updating functionality, where this type of updating is invoked recursively at every depth within a hierarchical tree structure and we pursue the best estimated action in all iterations through utilization of the Estimator phenomenon. The proposed machine is ϵ-optimal, and our experimental results demonstrate that the number of iterations required before convergence is significantly reduced for the HDPA, when compared with the HCPA.	en_US
dc.language.iso	eng	en_US
dc.publisher	Springer	en_US
dc.title	The Hierarchical Discrete Learning Automaton Suitable for Environments with Many Actions and High Accuracy Requirements	en_US
dc.type	Peer reviewed	en_US
dc.type	Journal article	en_US
dc.description.version	acceptedVersion	en_US
dc.rights.holder	© 2022 Springer Nature Switzerland AG	en_US
dc.subject.nsi	VDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550	en_US
dc.source.pagenumber	507-518	en_US
dc.source.volume	13151	en_US
dc.source.journal	Lecture Notes in Computer Science (LNCS)	en_US
dc.identifier.doi	https://doi.org/10.1007/978-3-030-97546-3_41
dc.identifier.cristin	2104034
cristin.qualitycode	1

Tilhørende fil(er)

Filnavn:: Article.pdf
Størrelse:: 527.4Kb
Format:: PDF

Åpne

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel