Vis enkel innførsel

dc.contributor.authorZhang, Xuan
dc.contributor.authorGranmo, Ole-Christoffer
dc.contributor.authorOommen, B. John
dc.contributor.authorJiao, Lei
dc.date.accessioned2014-01-17T09:44:49Z
dc.date.available2014-01-17T09:44:49Z
dc.date.issued2013
dc.identifier.citationZhang, X., Granmo, O.-C., Oommen, B. J., & Jiao, L. (2013). On using the theory of regular functions to prove the ε-Optimality of the Continuous Pursuit Learning Automaton. In M. Ali, T. Bosse, K. Hindriks, M. Hoogendoorn, C. Jonker & J. Treur (Eds.), Recent Trends in Applied Artificial Intelligence (Vol. 7906, pp. 262-271): Springer.no_NO
dc.identifier.isbn978-3-642-38576-6
dc.identifier.urihttp://hdl.handle.net/11250/138018
dc.descriptionPublished version of a chapter in the book: Recent Trends in Applied Artificial Intelligence. Also available from the publisher at: http://dx.doi.org/10.1007/978-3-642-38577-3_27no_NO
dc.description.abstractThere are various families of Learning Automata (LA) such as Fixed Structure, Variable Structure, Discretized etc. Informally, if the environment is stationary, their ε-optimality is defined as their ability to converge to the optimal action with an arbitrarily large probability, if the learning parameter is sufficiently small/large. Of these LA families, Estimator Algorithms (EAs) are certainly the fastest, and within this family, the set of Pursuit algorithms have been considered to be the pioneering schemes. The existing proofs of the ε-optimality of all the reported EAs follow the same fundamental principles. Recently, it has been reported that the previous proofs for the ε-optimality of all the reported EAs have a common flaw. In other words, people have worked with this flawed reasoning for almost three decades. The flaw lies in the condition which apparently supports the so-called “monotonicity” property of the probability of selecting the optimal action, explained in the paper. In this paper, we provide a new method to prove the ε-optimality of the Continuous Pursuit Algorithm (CPA), which was the pioneering EA. The new proof follows the same outline of the previous proofs, but instead of examining the monotonicity property of the action probabilities, it rather examines their submartingale property, and then, unlike the traditional approach, invokes the theory of Regular functions to prove the ε-optimality. We believe that the proof is both unique and pioneering, and that it can form the basis for formally demonstrating the ε-optimality of other EAs.no_NO
dc.language.isoengno_NO
dc.publisherSpringerno_NO
dc.relation.ispartofseriesLecture Notes in Computer Science;7906
dc.subjectpursuit algorithmsno_NO
dc.subjectContinuous Pursuit Algorithmno_NO
dc.subjectε-optimalityno_NO
dc.titleOn using the theory of regular functions to prove the ε-Optimality of the Continuous Pursuit Learning Automatonno_NO
dc.typeChapterno_NO
dc.typePeer reviewedno_NO
dc.subject.nsiVDP::Mathematics and natural science: 400::Information and communication science: 420::Algorithms and computability theory: 422no_NO
dc.source.pagenumber262-271no_NO
dc.identifier.doi10.1007/978-3-642-38577-3_27


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel