New Models qnd Algorithms for Bandits and Markets

Amin, Kareem

Open PDF

Open link

Publication date

January 2015

Publisher

ScholarlyCommons

Language

English

Abstract

Inspired by advertising markets, we consider large-scale sequential decision making problems in which a learner must deploy an algorithm to behave optimally under uncertainty. Although many of these problems can be modeled as contextual bandit problems, we argue that the tools and techniques for analyzing bandit problems with large numbers of actions and contexts can be greatly expanded. While convexity and metric-similarity assumptions on the process generating rewards have yielded some algorithms in existing literature, certain types of assumptions that have been fruitful in offline supervised learning settings have yet to even be considered. Notably missing, for example, is any kind of graphical model approach to assuming structured rewa...

Extracted data

We use cookies to provide a better user experience.

Data Protection

New Models qnd Algorithms for Bandits and Markets

Abstract

Extracted data

New Models qnd Algorithms for Bandits and Markets

Abstract

Extracted data

Related items

Related items