On using the theory of regular functions to prove the ε-Optimality of the Continuous Pursuit Learning Automaton

Zhang, Xuan; Granmo, Ole-Christoffer; Oommen, B. John; Jiao, Lei

dc.contributor.author	Zhang, Xuan
dc.contributor.author	Granmo, Ole-Christoffer
dc.contributor.author	Oommen, B. John
dc.contributor.author	Jiao, Lei
dc.date.accessioned	2014-01-17T09:44:49Z
dc.date.available	2014-01-17T09:44:49Z
dc.date.issued	2013
dc.identifier.citation	Zhang, X., Granmo, O.-C., Oommen, B. J., & Jiao, L. (2013). On using the theory of regular functions to prove the ε-Optimality of the Continuous Pursuit Learning Automaton. In M. Ali, T. Bosse, K. Hindriks, M. Hoogendoorn, C. Jonker & J. Treur (Eds.), Recent Trends in Applied Artificial Intelligence (Vol. 7906, pp. 262-271): Springer.	no_NO
dc.identifier.isbn	978-3-642-38576-6
dc.identifier.uri	http://hdl.handle.net/11250/138018
dc.description	Published version of a chapter in the book: Recent Trends in Applied Artificial Intelligence. Also available from the publisher at: http://dx.doi.org/10.1007/978-3-642-38577-3_27	no_NO
dc.description.abstract	There are various families of Learning Automata (LA) such as Fixed Structure, Variable Structure, Discretized etc. Informally, if the environment is stationary, their ε-optimality is defined as their ability to converge to the optimal action with an arbitrarily large probability, if the learning parameter is sufficiently small/large. Of these LA families, Estimator Algorithms (EAs) are certainly the fastest, and within this family, the set of Pursuit algorithms have been considered to be the pioneering schemes. The existing proofs of the ε-optimality of all the reported EAs follow the same fundamental principles. Recently, it has been reported that the previous proofs for the ε-optimality of all the reported EAs have a common flaw. In other words, people have worked with this flawed reasoning for almost three decades. The flaw lies in the condition which apparently supports the so-called “monotonicity” property of the probability of selecting the optimal action, explained in the paper. In this paper, we provide a new method to prove the ε-optimality of the Continuous Pursuit Algorithm (CPA), which was the pioneering EA. The new proof follows the same outline of the previous proofs, but instead of examining the monotonicity property of the action probabilities, it rather examines their submartingale property, and then, unlike the traditional approach, invokes the theory of Regular functions to prove the ε-optimality. We believe that the proof is both unique and pioneering, and that it can form the basis for formally demonstrating the ε-optimality of other EAs.	no_NO
dc.language.iso	eng	no_NO
dc.publisher	Springer	no_NO
dc.relation.ispartofseries	Lecture Notes in Computer Science;7906
dc.subject	pursuit algorithms	no_NO
dc.subject	Continuous Pursuit Algorithm	no_NO
dc.subject	ε-optimality	no_NO
dc.title	On using the theory of regular functions to prove the ε-Optimality of the Continuous Pursuit Learning Automaton	no_NO
dc.type	Chapter	no_NO
dc.type	Peer reviewed	no_NO
dc.subject.nsi	VDP::Mathematics and natural science: 400::Information and communication science: 420::Algorithms and computability theory: 422	no_NO
dc.source.pagenumber	262-271	no_NO
dc.identifier.doi	10.1007/978-3-642-38577-3_27

Tilhørende fil(er)

Filnavn:: Zhang_2013_On.pdf
Størrelse:: 201.4Kb
Format:: PDF

Åpne

Denne innførselen finnes i følgende samling(er)

Scientific Publications in Information and Communication Technology [687]

Vis enkel innførsel