Rapid development of data science technologies have enabled data-driven algorithms for many important operational problems. Existing data-driven solutions often requires the operational environments being stationary. However, recent examples have shown that the operational environments can change dynamically. It is thus imperative to design data-driven algorithms that is capable of working in time-varying environments. We first introduce data-driven decision-making algorithms that achieve state-of-the-art dynamic regret bounds for non-stationary bandit and reinforcement learning settings. These settings capture applications such as advertisement allocation, dynamic pricing, and inventory control in changing environments. Our main contrib...
In several e-commerce scenarios, pricing long-tail products effectively is a central task for the co...
Many well-studied online decision-making and learning models rely on the assumption that the environ...
Traditionally, stochastic models in operations research use specific probabilistic assumptions to mo...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
Companies such as Zara and World Co. have recently implemented novel product development processes a...
A lot of software systems today need to make real-time decisions to optimize an objective of interes...
We consider a price-based network revenue management problem in which a retailer aims to maximize re...
Thesis (Ph.D.)--University of Washington, 2018The objective of this study is to propose novel dynami...
For a seller operating in a nonstationary demand setting, a key question is how to collect and filte...
This thesis investigates how sellers in e-commerce can maximize revenue by utilizing dynamic pricing...
The design of effective bandit algorithms to learn the optimal price is a task of extraordinary impo...
Decision makers rely on observations to make better decisions. Hence mastering the interplay between...
We develop a learning principle and an efficient algorithm for batch learning from logged bandit fee...
We develop a learning principle and an efficient algorithm for batch learning from logged bandit fee...
We consider here a single-item lot sizing problem with fixed costs, lead time, and both backorders a...
In several e-commerce scenarios, pricing long-tail products effectively is a central task for the co...
Many well-studied online decision-making and learning models rely on the assumption that the environ...
Traditionally, stochastic models in operations research use specific probabilistic assumptions to mo...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
Companies such as Zara and World Co. have recently implemented novel product development processes a...
A lot of software systems today need to make real-time decisions to optimize an objective of interes...
We consider a price-based network revenue management problem in which a retailer aims to maximize re...
Thesis (Ph.D.)--University of Washington, 2018The objective of this study is to propose novel dynami...
For a seller operating in a nonstationary demand setting, a key question is how to collect and filte...
This thesis investigates how sellers in e-commerce can maximize revenue by utilizing dynamic pricing...
The design of effective bandit algorithms to learn the optimal price is a task of extraordinary impo...
Decision makers rely on observations to make better decisions. Hence mastering the interplay between...
We develop a learning principle and an efficient algorithm for batch learning from logged bandit fee...
We develop a learning principle and an efficient algorithm for batch learning from logged bandit fee...
We consider here a single-item lot sizing problem with fixed costs, lead time, and both backorders a...
In several e-commerce scenarios, pricing long-tail products effectively is a central task for the co...
Many well-studied online decision-making and learning models rely on the assumption that the environ...
Traditionally, stochastic models in operations research use specific probabilistic assumptions to mo...