Vis enkel innførsel

dc.contributor.advisorGranmo, Ole-Christoffer
dc.contributor.advisorNyberg, Svein Olav Glesaaen
dc.contributor.authorGlimsdal, Sondre
dc.date.accessioned2020-05-20T21:21:53Z
dc.date.available2020-05-20T21:21:53Z
dc.date.created2020-05-13T15:32:17Z
dc.date.issued2020
dc.identifier.citationGlimsdal, S. (2020). Towards Thompson Sampling for Complex Bayesian Reasoning (Doctoral thesis). University of Agder, Oslo.en_US
dc.identifier.isbn978-82-7117-976-2
dc.identifier.issn1504-9272
dc.identifier.urihttps://hdl.handle.net/11250/2655240
dc.descriptionPaper III, IV, and VI are not available as a part of the dissertation due to the copyright.
dc.description.abstractThompson Sampling (TS) is a state-of-art algorithm for bandit problems set in a Bayesian framework. Both the theoretical foundation and the empirical efficiency of TS is wellexplored for plain bandit problems. However, the Bayesian underpinning of TS means that TS could potentially be applied to other, more complex, problems as well, beyond the bandit problem, if suitable Bayesian structures can be found. The objective of this thesis is the development and analysis of TS-based schemes for more complex optimization problems, founded on Bayesian reasoning. We address several complex optimization problems where the previous state-of-art relies on a relatively myopic perspective on the problem. These includes stochastic searching on the line, the Goore game, the knapsack problem, travel time estimation, and equipartitioning. Instead of employing Bayesian reasoning to obtain a solution, they rely on carefully engineered rules. In all brevity, we recast each of these optimization problems in a Bayesian framework, introducing dedicated TS based solution schemes. For all of the addressed problems, the results show that besides being more effective, the TS based approaches we introduce are also capable of solving more adverse versions of the problems, such as dealing with stochastic liars.en_US
dc.language.isoengen_US
dc.publisherWittusen & Jensenen_US
dc.relation.ispartofseriesDoctoral Dissertation at the University of Agder; no. 275
dc.relation.haspartPaper I: Glimsdal, S. & Granmo, O.-C. (2018). A Bayesian network based solution scheme for the constrained Stochastic On-line Equi-Partitioning Problem. Applied Intelligence, 48, 3735–3747. doi: https://doi.org/10.1007/s10489-018-1172-8. Author´s accepted manuscript. Full-text is not available in AURA as a separate file.en_US
dc.relation.haspartPaper II: Glimsdal, S. & Granmo, O.-C. (2019). Thompson Sampling Guided Stochastic Searching on the Line for Deceptive Environments with Applications to Root-Finding Problems. Journal of Machine Learning Research, 20(52). http://jmlr.org/papers/v20/18-263.html Published version. Full-text is not available in AURA as a separate file.en_US
dc.relation.haspartPaper III: Glimsdal, S. & Granmo, O.-C. (2015). Thompson Sampling Guided Stochastic Searching on the Line for Non-Stationary Adversarial Learning. In 2015 IEEE 14th International Conference on Machine Learning and Applications, 687-692. doi: https://doi.org/10.1109/ICMLA.2015.203. Published version. Full-text is not available in AURA as a separate file.en_US
dc.relation.haspartPaper IV: Glimsdal, S. & Granmo, O.-C. (2013). Accelerated Bayesian learning for decentralized two-armed bandit based decision making with applications to the Goore Game. Applied Intelligence, 38, 479–488. doi: https://doi.org/10.1007/s10489-012-0346-z. Published version. Full-text is not available in AURA as a separate file.en_US
dc.relation.haspartPaper V: Glimsdal, S. & Granmo, O.-C. (2013). Gaussian Process Based Optimistic Knapsack Sampling with Applications to Stochastic Resource Allocation. In M. Glass, S. Hettiarachchi & R. Finkbine (Eds.), Proceedings of the 24th Midwest Artificial Intelligence and Cognitive Science Conference (1348, p. 43-50). http://ceur-ws.org/Vol-1348/. Published version. Full-text is not available in AURA as a separate file.en_US
dc.relation.haspartPaper VI: Glimsdal, S. & Granmo, O.-C. (2019). Thompson Sampling Based Active Learning in Probabilistic Programs with Application to Travel Time Estimation. In F. Wotawa, G. Friedrich, I. Pill, R. Koitz-Hristov & M. Ali (Eds), Advances and Trends in Artificial Intelligence. From Theory to Practice (11606, p. 71-78). Cham: Springer. doi: https://doi.org/10.1007/978-3-030-22999-3_7. Published version. Full-text is not available in AURA as a separate file.en_US
dc.titleTowards Thompson Sampling for Complex Bayesian Reasoningen_US
dc.typeDoctoral thesisen_US
dc.description.versionpublishedVersionen_US
dc.rights.holder© 2020 Sondre Glimsdalen_US
dc.subject.nsiVDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550en_US
dc.source.pagenumber169en_US
dc.source.issue275en_US
dc.identifier.cristin1810830


Tilhørende fil(er)

Thumbnail
Thumbnail
Thumbnail
Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel