The cold-start problem has attracted extensive attention among various online services that provide personalized rec-ommendation. Many online vendors employ contextual ban-dit strategies to tackle the so-called exploration/exploitation dilemma rooted from the cold-start problem. However, due to high-dimensional user/item features and the underlying characteristics of bandit policies, it is often difficult for ser-vice providers to obtain and deploy an appropriate algorithm to achieve acceptable and robust economic profit. In this paper, we explore ensemble strategies of contextual bandit algorithms to obtain robust predicted click-through rate (CTR) of web objects. The ensemble is acquired by aggregating different pulling policies of bandit...
International audienceThis paper addresses the on-line recommendation problem facing new users and n...
We perform online interactive recommendation via a factorization-based bandit algorithm. Low-rank ma...
Classical collaborative filtering, and content-based filtering methods try to learn a static recomme...
Machine Learning algorithms play an active role in modern day business activities and have been put ...
This work presents an extension of Thompson Sampling bandit policy for orchestrating the collection ...
News recommendation is a field different from traditional recommendation fields. News articles are c...
Multi-armed bandit (MAB) problem is derived from slot machines in the casino. It is about how a gamb...
Multi-armed bandit problems formalize the exploration-exploitation trade-offs arising in several ind...
In our ongoing work we extend the Thompson Sampling (TS) bandit policy for orchestrating the collect...
This work presents an extension of Thompson Sampling bandit policy for orchestrating the collection ...
Multi-armed bandit problems formalize the exploration-exploitation trade-offs arising in several ind...
Machine and Statistical Learning techniques are used in almost all online advertisement systems. The...
Recommender system has been a persistent research goal for decades, which aims at recommending suita...
Traditional recommendation systems (RecSys) suffer from two problems: the exploitation-exploration d...
We study recommendation in scenarios where there's no prior information about the quality of content...
International audienceThis paper addresses the on-line recommendation problem facing new users and n...
We perform online interactive recommendation via a factorization-based bandit algorithm. Low-rank ma...
Classical collaborative filtering, and content-based filtering methods try to learn a static recomme...
Machine Learning algorithms play an active role in modern day business activities and have been put ...
This work presents an extension of Thompson Sampling bandit policy for orchestrating the collection ...
News recommendation is a field different from traditional recommendation fields. News articles are c...
Multi-armed bandit (MAB) problem is derived from slot machines in the casino. It is about how a gamb...
Multi-armed bandit problems formalize the exploration-exploitation trade-offs arising in several ind...
In our ongoing work we extend the Thompson Sampling (TS) bandit policy for orchestrating the collect...
This work presents an extension of Thompson Sampling bandit policy for orchestrating the collection ...
Multi-armed bandit problems formalize the exploration-exploitation trade-offs arising in several ind...
Machine and Statistical Learning techniques are used in almost all online advertisement systems. The...
Recommender system has been a persistent research goal for decades, which aims at recommending suita...
Traditional recommendation systems (RecSys) suffer from two problems: the exploitation-exploration d...
We study recommendation in scenarios where there's no prior information about the quality of content...
International audienceThis paper addresses the on-line recommendation problem facing new users and n...
We perform online interactive recommendation via a factorization-based bandit algorithm. Low-rank ma...
Classical collaborative filtering, and content-based filtering methods try to learn a static recomme...