This thesis is concerned with the study of sequential decision problems motivated by the challenge of selecting questions to give to students in an online educational environment. In online education there is the potential to develop personalized and adaptive learning environments, where students can receive individualized sequences of questions which update as the student is observed to be struggling or flourishing. In order to achieve this personalization, we must learn about how good each question is, while simultaneously giving students good questions. Multi-armed bandits are a popular technique for sequential decision making under uncertainty. Due to their online nature and their ability to balance the trade-off between exploitation an...
In this thesis we study sequential decision making through the lens of Online Learning. Online Learn...
International audienceLearning from prior tasks and transferring that experience to improve future p...
In this thesis we address the multi-armed bandit (MAB) problem with stochastic rewards and correlate...
Sequential decision-making is a natural model for machine learning applications where the learner mu...
University of Technology Sydney. Faculty of Engineering and Information Technology.The sequential de...
This dissertation focuses on sequential learning and inference under unknown models. In this class o...
One of the goals of Artificial Intelligence (AI) is to enable multiple agents to interact, co-ordina...
In a bandit problem there is a set of arms, each of which when played by an agent yields some reward...
We present an approach to Intelligent Tutoring Systems which adaptively personalizes sequences of le...
Many applications can be modeled as follows: an agent is given access to several distributions and s...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
In this dissertation, we study several Markovian problems of optimal sequential decisions by focusin...
Bayes sequential decision problems are an extensive problem class with wide application. They involv...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
Designing an adaptive sequence of exercises in Intelligent Tutoring Systems (ITS) requiresto charact...
In this thesis we study sequential decision making through the lens of Online Learning. Online Learn...
International audienceLearning from prior tasks and transferring that experience to improve future p...
In this thesis we address the multi-armed bandit (MAB) problem with stochastic rewards and correlate...
Sequential decision-making is a natural model for machine learning applications where the learner mu...
University of Technology Sydney. Faculty of Engineering and Information Technology.The sequential de...
This dissertation focuses on sequential learning and inference under unknown models. In this class o...
One of the goals of Artificial Intelligence (AI) is to enable multiple agents to interact, co-ordina...
In a bandit problem there is a set of arms, each of which when played by an agent yields some reward...
We present an approach to Intelligent Tutoring Systems which adaptively personalizes sequences of le...
Many applications can be modeled as follows: an agent is given access to several distributions and s...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
In this dissertation, we study several Markovian problems of optimal sequential decisions by focusin...
Bayes sequential decision problems are an extensive problem class with wide application. They involv...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
Designing an adaptive sequence of exercises in Intelligent Tutoring Systems (ITS) requiresto charact...
In this thesis we study sequential decision making through the lens of Online Learning. Online Learn...
International audienceLearning from prior tasks and transferring that experience to improve future p...
In this thesis we address the multi-armed bandit (MAB) problem with stochastic rewards and correlate...