The design of effective bandit algorithms to learn the optimal price is a task of extraordinary importance in all the settings in which the demand curve is not a priori known and the estimation process takes a long time, as customary, e.g., in e-commerce scenarios. In particular, the adoption of effective pricing algorithms may allow companies to increase their profits dramatically. In this paper, we exploit the structure of the pricing problem in online scenarios to improve the performance of state-of-the-art general-purpose bandit algorithms. More specifically, we make use of the monotonicity of the customer demand curve, which suggests the same behavior of the conversion rates, and we exploit the fact that, in many scenarios, companies h...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
In this work, we explore an online reinforcement learning problem called the multi-armed bandit for ...
We present a general framework for stochastic online maximization problems with combinatorial feasib...
The design of effective bandit algorithms to learn the optimal price is a task of extraordinary impo...
A lot of software systems today need to make real-time decisions to optimize an objective of interes...
In the online linear optimization problem, a learner must choose, in each round, a decision from a s...
In the online linear optimization problem, a learner must choose, in each round, a decision from a s...
We consider revenue maximization in online auctions and pricing. A seller sells an identical item in...
According to the main international reports, more pervasive industrial and business-process automati...
We investigate a number of multi-armed bandit problems that model different aspects of online advert...
We consider a price-based network revenue management problem in which a retailer aims to maximize re...
International audienceWe consider online bandit learning in which at every time step, an algorithm h...
In several e-commerce scenarios, pricing long-tail products effectively is a central task for the co...
Online search is a basic online problem. The fact that its optimal deterministic/randomized solution...
We study the multi-armed bandit problems with budget constraint and variable costs (MAB-BV). In this...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
In this work, we explore an online reinforcement learning problem called the multi-armed bandit for ...
We present a general framework for stochastic online maximization problems with combinatorial feasib...
The design of effective bandit algorithms to learn the optimal price is a task of extraordinary impo...
A lot of software systems today need to make real-time decisions to optimize an objective of interes...
In the online linear optimization problem, a learner must choose, in each round, a decision from a s...
In the online linear optimization problem, a learner must choose, in each round, a decision from a s...
We consider revenue maximization in online auctions and pricing. A seller sells an identical item in...
According to the main international reports, more pervasive industrial and business-process automati...
We investigate a number of multi-armed bandit problems that model different aspects of online advert...
We consider a price-based network revenue management problem in which a retailer aims to maximize re...
International audienceWe consider online bandit learning in which at every time step, an algorithm h...
In several e-commerce scenarios, pricing long-tail products effectively is a central task for the co...
Online search is a basic online problem. The fact that its optimal deterministic/randomized solution...
We study the multi-armed bandit problems with budget constraint and variable costs (MAB-BV). In this...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
In this work, we explore an online reinforcement learning problem called the multi-armed bandit for ...
We present a general framework for stochastic online maximization problems with combinatorial feasib...