Many well-studied online decision-making and learning models rely on the assumption that the environment remains intact and unaffected by the agent’s actions. This assumption, however, can be considered somewhat unrealistic for several real-life applications. In recommendation systems, for instance, the preferences of a particular user (and, thus, the “best” products to recommend) can change over time. This change can occur either naturally (e.g., the user discovers a new category of desired products) or as a response to past recommendations (e.g., incessantly recommended products can be perceived as “spam” and, thus, be ignored in the future). In this dissertation, we consider generalizations of multi-armed bandit and optimal-stopping sett...
We consider a learning problem where the decision maker interacts with a standard Markov decision pr...
We study the problem of decision-making under uncertainty in the bandit setting. This thesis goes be...
We study the online learning model: a widely applicable model for making repeated choices in an inte...
Many well-studied online decision-making and learning models rely on the assumption that the environ...
We describe and study a model for an Automated Online Recommendation System (AORS) in which a user's...
This dissertation considers a problem of online learning and online decision making where an agent o...
We introduce and study a partial-information model of online learning, where a decision maker repeat...
We study how to adapt to smoothly-varying (‘easy’) environments in well-known online learning proble...
Abstract. We present and study a partial-information model of online learning, where a decision make...
We introduce and study a partial-information model of online learning, where a decision maker repeat...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
In a bandit problem there is a set of arms, each of which when played by an agent yields some reward...
AI and machine learning methods are increasingly interacting with and seeking information from peopl...
Recommender system has been a persistent research goal for decades, which aims at recommending suita...
International audienceThis paper addresses the on-line recommendation problem facing new users and n...
We consider a learning problem where the decision maker interacts with a standard Markov decision pr...
We study the problem of decision-making under uncertainty in the bandit setting. This thesis goes be...
We study the online learning model: a widely applicable model for making repeated choices in an inte...
Many well-studied online decision-making and learning models rely on the assumption that the environ...
We describe and study a model for an Automated Online Recommendation System (AORS) in which a user's...
This dissertation considers a problem of online learning and online decision making where an agent o...
We introduce and study a partial-information model of online learning, where a decision maker repeat...
We study how to adapt to smoothly-varying (‘easy’) environments in well-known online learning proble...
Abstract. We present and study a partial-information model of online learning, where a decision make...
We introduce and study a partial-information model of online learning, where a decision maker repeat...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
In a bandit problem there is a set of arms, each of which when played by an agent yields some reward...
AI and machine learning methods are increasingly interacting with and seeking information from peopl...
Recommender system has been a persistent research goal for decades, which aims at recommending suita...
International audienceThis paper addresses the on-line recommendation problem facing new users and n...
We consider a learning problem where the decision maker interacts with a standard Markov decision pr...
We study the problem of decision-making under uncertainty in the bandit setting. This thesis goes be...
We study the online learning model: a widely applicable model for making repeated choices in an inte...