Contextual combinatorial cascading bandit ( $C^{3}$ -bandit) is a powerful multi-armed bandit framework that balances the tradeoff between exploration and exploitation in the learning process. It well captures users’ click behavior and has been applied in a broad spectrum of real-world applications such as recommender systems and search engines. However, such a framework does not provide a performance guarantee of the initial exploration phase. To that end, we propose conservative contextual combinatorial cascading bandit ( $C^{4}$ -bandit) model, aiming to address the aforementioned crucial modeling issues. In this problem, the learning agent is given some contexts and recommends a list of items not worse than the baseline strategy,...
A standard assumption in contextual multi-arm bandit is that the true context is perfectly known bef...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
The data explosion and development of artificial intelligence (AI) has fueled the demand for recomme...
In many fields such as digital marketing, healthcare, finance, and robotics, it is common to have a ...
In this paper, we study a family of conservative bandit problems (CBPs) with sample-path reward cons...
We present a new algorithm for the contextual bandit learning problem, where the learner repeat-edly...
We present a new algorithm for the contextual bandit learning problem, where the learner repeatedly ...
Upper confidence bound (UCB) based contextual bandit algorithms require one to know the tail propert...
Cataloged from PDF version of article.Thesis (M.S.): Bilkent University, Department of Industrial En...
International audienceContextual bandit algorithms are essential for solving many real-world interac...
Abstract—The contextual bandit problem is typically used to model online applications such as articl...
We develop a learning principle and an efficient algorithm for batch learning from logged bandit fee...
The bandit problem models a sequential decision process between a player and an environment. In the ...
We develop a learning principle and an efficient algorithm for batch learning from logged bandit fee...
© 2019 Neural information processing systems foundation. All rights reserved. In the classical conte...
A standard assumption in contextual multi-arm bandit is that the true context is perfectly known bef...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
The data explosion and development of artificial intelligence (AI) has fueled the demand for recomme...
In many fields such as digital marketing, healthcare, finance, and robotics, it is common to have a ...
In this paper, we study a family of conservative bandit problems (CBPs) with sample-path reward cons...
We present a new algorithm for the contextual bandit learning problem, where the learner repeat-edly...
We present a new algorithm for the contextual bandit learning problem, where the learner repeatedly ...
Upper confidence bound (UCB) based contextual bandit algorithms require one to know the tail propert...
Cataloged from PDF version of article.Thesis (M.S.): Bilkent University, Department of Industrial En...
International audienceContextual bandit algorithms are essential for solving many real-world interac...
Abstract—The contextual bandit problem is typically used to model online applications such as articl...
We develop a learning principle and an efficient algorithm for batch learning from logged bandit fee...
The bandit problem models a sequential decision process between a player and an environment. In the ...
We develop a learning principle and an efficient algorithm for batch learning from logged bandit fee...
© 2019 Neural information processing systems foundation. All rights reserved. In the classical conte...
A standard assumption in contextual multi-arm bandit is that the true context is perfectly known bef...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
The data explosion and development of artificial intelligence (AI) has fueled the demand for recomme...