Multi-armed bandits (MABs) have been studied extensively in the literature and have applications in a wealth of domains, including recommendation systems, dynamic pricing, and investment management. On the one hand, the current MAB literature largely seems to focus on the setting where each arm is available to play at each time step, and ignores how agents move between the arms. On the other hand, there is work that takes the movement between arms into account, but this work models the problem as a Markov decision process and applies generic reinforcement learning (RL) algorithms, like Q-learning. This thesis examines an extension of the MAB problem to a setting where the set of available arms at each round depends on which arm was played i...
International audienceWe consider the problem of finding the best arm in a stochastic multi-armed ba...
In the advent of Big Data and Machine Learning, there is a demand for improved decision making in un...
The aim of the research presented in this dissertation is to construct a model for personalised item...
Multi-armed bandits (MABs) have been studied extensively in the literature and have applications in ...
The multi-armed bandit(MAB) problem is a simple yet powerful framework that has been extensively stu...
This thesis investigates a new method to estimate the system norm using reinforcement learning. Give...
This thesis studies several extensions of multi-armed bandit problem, where a learner sequentially s...
This thesis considers the multi-armed bandit (MAB) problem, both the traditional bandit feedback and...
A Multi-Armed Bandits (MAB) is a learning problem where an agent sequentially chooses an action amon...
The multi-armed bandit (MAB) problem is a mathematical formulation of the exploration-exploitation t...
Cataloged from PDF version of article.Thesis (M.S.): Bilkent University, Department of Industrial En...
In this thesis we address the multi-armed bandit (MAB) problem with stochastic rewards and correlate...
Abstract—We present a formal model of human decision-making in explore-exploit tasks using the conte...
In this paper we explore the field of reinforcement learning which has proven to be successful at so...
We study, to the best of our knowledge, the first Bayesian algorithm for unimodal Multi-Armed Bandit...
International audienceWe consider the problem of finding the best arm in a stochastic multi-armed ba...
In the advent of Big Data and Machine Learning, there is a demand for improved decision making in un...
The aim of the research presented in this dissertation is to construct a model for personalised item...
Multi-armed bandits (MABs) have been studied extensively in the literature and have applications in ...
The multi-armed bandit(MAB) problem is a simple yet powerful framework that has been extensively stu...
This thesis investigates a new method to estimate the system norm using reinforcement learning. Give...
This thesis studies several extensions of multi-armed bandit problem, where a learner sequentially s...
This thesis considers the multi-armed bandit (MAB) problem, both the traditional bandit feedback and...
A Multi-Armed Bandits (MAB) is a learning problem where an agent sequentially chooses an action amon...
The multi-armed bandit (MAB) problem is a mathematical formulation of the exploration-exploitation t...
Cataloged from PDF version of article.Thesis (M.S.): Bilkent University, Department of Industrial En...
In this thesis we address the multi-armed bandit (MAB) problem with stochastic rewards and correlate...
Abstract—We present a formal model of human decision-making in explore-exploit tasks using the conte...
In this paper we explore the field of reinforcement learning which has proven to be successful at so...
We study, to the best of our knowledge, the first Bayesian algorithm for unimodal Multi-Armed Bandit...
International audienceWe consider the problem of finding the best arm in a stochastic multi-armed ba...
In the advent of Big Data and Machine Learning, there is a demand for improved decision making in un...
The aim of the research presented in this dissertation is to construct a model for personalised item...