Efficient Ordered Combinatorial Semi-Bandits for Whole-Page Recommendation

Wang, Yingfei
Ouyang, Hua
Wang, Chu
Chen, Jianhui
Asamov, Tsvetan
Chang, Yi

Publication date

February 2017

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Abstract

Multi-Armed Bandit (MAB) framework has been successfully applied in many web applications. However, many complex real-world applications that involve multiple content recommendations cannot fit into the traditional MAB setting. To address this issue, we consider an ordered combinatorial semi-bandit problem where the learner recommends S actions from a base set of K actions, and displays the results in S (out of M) different positions. The aim is to maximize the cumulative reward with respect to the best possible subset and positions in hindsight. By the adaptation of a minimum-cost maximum-flow network, a practical algorithm based on Thompson sampling is derived for the (contextual) combinatorial problem, thus resolving the problem of compu...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Efficient Ordered Combinatorial Semi-Bandits for Whole-Page Recommendation

Abstract

Extracted data

Efficient Ordered Combinatorial Semi-Bandits for Whole-Page Recommendation

Abstract

Extracted data

Related items

Related items