Brådland, Terje; Norheim, Thomas (Master thesis, 2009)
The two-armed bandit problem is a classical optimization problem where a player sequentially
selects and pulls one of two arms attached to a gambling machine, and each arm pull results in
either a reward or penalty to ...