dc.contributor.advisor	Granmo, Ole-Christoffer
dc.contributor.advisor	Nyberg, Svein Olav Glesaaen
dc.contributor.author	Glimsdal, Sondre
dc.date.accessioned	2020-05-20T21:21:53Z
dc.date.available	2020-05-20T21:21:53Z
dc.date.created	2020-05-13T15:32:17Z
dc.date.issued	2020
dc.identifier.citation	Glimsdal, S. (2020). Towards Thompson Sampling for Complex Bayesian Reasoning (Doctoral thesis). University of Agder, Oslo.	en_US
dc.identifier.isbn	978-82-7117-976-2
dc.identifier.issn	1504-9272
dc.identifier.uri	https://hdl.handle.net/11250/2655240
dc.description	Paper III, IV, and VI are not available as a part of the dissertation due to the copyright.
dc.description.abstract	Thompson Sampling (TS) is a state-of-art algorithm for bandit problems set in a Bayesian framework. Both the theoretical foundation and the empirical efficiency of TS is wellexplored for plain bandit problems. However, the Bayesian underpinning of TS means that TS could potentially be applied to other, more complex, problems as well, beyond the bandit problem, if suitable Bayesian structures can be found. The objective of this thesis is the development and analysis of TS-based schemes for more complex optimization problems, founded on Bayesian reasoning. We address several complex optimization problems where the previous state-of-art relies on a relatively myopic perspective on the problem. These includes stochastic searching on the line, the Goore game, the knapsack problem, travel time estimation, and equipartitioning. Instead of employing Bayesian reasoning to obtain a solution, they rely on carefully engineered rules. In all brevity, we recast each of these optimization problems in a Bayesian framework, introducing dedicated TS based solution schemes. For all of the addressed problems, the results show that besides being more effective, the TS based approaches we introduce are also capable of solving more adverse versions of the problems, such as dealing with stochastic liars.	en_US
dc.language.iso	eng	en_US
dc.publisher	Wittusen & Jensen	en_US
dc.relation.ispartofseries	Doctoral Dissertation at the University of Agder; no. 275
dc.relation.haspart	Paper I: Glimsdal, S. & Granmo, O.-C. (2018). A Bayesian network based solution scheme for the constrained Stochastic On-line Equi-Partitioning Problem. Applied Intelligence, 48, 3735–3747. doi: https://doi.org/10.1007/s10489-018-1172-8. Author´s accepted manuscript. Full-text is not available in AURA as a separate file.	en_US
dc.relation.haspart	Paper II: Glimsdal, S. & Granmo, O.-C. (2019). Thompson Sampling Guided Stochastic Searching on the Line for Deceptive Environments with Applications to Root-Finding Problems. Journal of Machine Learning Research, 20(52). http://jmlr.org/papers/v20/18-263.html Published version. Full-text is not available in AURA as a separate file.	en_US
dc.relation.haspart	Paper III: Glimsdal, S. & Granmo, O.-C. (2015). Thompson Sampling Guided Stochastic Searching on the Line for Non-Stationary Adversarial Learning. In 2015 IEEE 14th International Conference on Machine Learning and Applications, 687-692. doi: https://doi.org/10.1109/ICMLA.2015.203. Published version. Full-text is not available in AURA as a separate file.	en_US
dc.relation.haspart	Paper IV: Glimsdal, S. & Granmo, O.-C. (2013). Accelerated Bayesian learning for decentralized two-armed bandit based decision making with applications to the Goore Game. Applied Intelligence, 38, 479–488. doi: https://doi.org/10.1007/s10489-012-0346-z. Published version. Full-text is not available in AURA as a separate file.	en_US
dc.relation.haspart	Paper V: Glimsdal, S. & Granmo, O.-C. (2013). Gaussian Process Based Optimistic Knapsack Sampling with Applications to Stochastic Resource Allocation. In M. Glass, S. Hettiarachchi & R. Finkbine (Eds.), Proceedings of the 24th Midwest Artificial Intelligence and Cognitive Science Conference (1348, p. 43-50). http://ceur-ws.org/Vol-1348/. Published version. Full-text is not available in AURA as a separate file.	en_US
dc.relation.haspart	Paper VI: Glimsdal, S. & Granmo, O.-C. (2019). Thompson Sampling Based Active Learning in Probabilistic Programs with Application to Travel Time Estimation. In F. Wotawa, G. Friedrich, I. Pill, R. Koitz-Hristov & M. Ali (Eds), Advances and Trends in Artificial Intelligence. From Theory to Practice (11606, p. 71-78). Cham: Springer. doi: https://doi.org/10.1007/978-3-030-22999-3_7. Published version. Full-text is not available in AURA as a separate file.	en_US
dc.title	Towards Thompson Sampling for Complex Bayesian Reasoning	en_US
dc.type	Doctoral thesis	en_US
dc.description.version	publishedVersion	en_US
dc.rights.holder	© 2020 Sondre Glimsdal	en_US
dc.subject.nsi	VDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550	en_US
dc.source.pagenumber	169	en_US
dc.source.issue	275	en_US
dc.identifier.cristin	1810830

Towards Thompson Sampling for Complex Bayesian Reasoning

Tilhørende fil(er)

Denne innførselen finnes i følgende samling(er)