We study the problem of using causal models to improve the rate at which good interventions can be learned online in a stochastic environment. Our formalism combines multi-arm bandits and causal inference to model a novel type of bandit feedback that is not exploited by existing approaches. We propose a new algorithm that exploits the causal feedback and prove a bound on its simple regret that is strictly better (in all quantities) than algorithms that do not use the additional causal informatio
Interactive systems that interact with and learn from user behavior are ubiquitous today. Machine le...
We consider a collaborative online learning paradigm, wherein a group of agents connected through a ...
We develop a learning principle and an efficient algorithm for batch learning from logged bandit fee...
Modern learning systems like recommendation engines, computational advertising systems, online param...
Bandits and Markov Decision Processes are powerful sequential decision making paradigms that have be...
In combinatorial causal bandits (CCB), the learning agent chooses at most K variables in each round ...
Causal inference provides a set of principles and tools that allows one to combine data and knowledg...
In this thesis, we propose to use Causal Models, which play a central role in dealing with uncertain...
Causal knowledge is sought after throughout data-driven fields due to its explanatory power and pote...
We study the problem of determining the best intervention in a Causal Bayesian Network (CBN) specifi...
Many open problems in machine learning are intrinsically related to causality, however, the use of c...
Recent work has discussed the limitations of counterfactual explanations to recommend actions for al...
We explore relationships between machine learning (ML) and causal inference. We focus on improvement...
Interactive systems that interact with and learn from user behavior are ubiquitous today. Machine le...
We consider the problem of learning to play a repeated multi-agent game with an unknown reward funct...
Interactive systems that interact with and learn from user behavior are ubiquitous today. Machine le...
We consider a collaborative online learning paradigm, wherein a group of agents connected through a ...
We develop a learning principle and an efficient algorithm for batch learning from logged bandit fee...
Modern learning systems like recommendation engines, computational advertising systems, online param...
Bandits and Markov Decision Processes are powerful sequential decision making paradigms that have be...
In combinatorial causal bandits (CCB), the learning agent chooses at most K variables in each round ...
Causal inference provides a set of principles and tools that allows one to combine data and knowledg...
In this thesis, we propose to use Causal Models, which play a central role in dealing with uncertain...
Causal knowledge is sought after throughout data-driven fields due to its explanatory power and pote...
We study the problem of determining the best intervention in a Causal Bayesian Network (CBN) specifi...
Many open problems in machine learning are intrinsically related to causality, however, the use of c...
Recent work has discussed the limitations of counterfactual explanations to recommend actions for al...
We explore relationships between machine learning (ML) and causal inference. We focus on improvement...
Interactive systems that interact with and learn from user behavior are ubiquitous today. Machine le...
We consider the problem of learning to play a repeated multi-agent game with an unknown reward funct...
Interactive systems that interact with and learn from user behavior are ubiquitous today. Machine le...
We consider a collaborative online learning paradigm, wherein a group of agents connected through a ...
We develop a learning principle and an efficient algorithm for batch learning from logged bandit fee...