We are interested in the problem of utilizing collected data to inform and direct learning towards a stated goal. In this work, a controller is presented with a finite set of actions that may be sequentially (and repeatedly) taken towards the achievement of some goal. While the outcome of any action is stochastic, the result provides information about future results of that action, and potentially others. By following a rule or control policy, the controller wishes to sequentially take actions, collect information, and utilize it towards future action decisions, in such a way as to approach the stated goal. In the first model, at least one action is `best', and the goal is to identify and take such an action as frequently as possible. Th...
This paper presents a model allowing to tune continual exploration in an optimal way by integrating ...
We consider the problem of sequential estimation of a random parameter under a controlled setting. U...
Summary The paper outlines some basic principles of geometric and nonasymp-totic theory of learning ...
Research on the implications of learning-by-doing has typically been restricted to specifications of...
The principles of statistical mechanics and information theory play an important role in learning an...
Sequential decision-making is an iterative process between a learning agent and an environment. We s...
Institute of Perception, Action and BehaviourRecently there has been a good deal of interest in usin...
We introduce a class of learning problems where the agent is presented with a series of tasks. Intui...
Data-driven approaches to the design of control policies for robotic systems have the potential to r...
Learning to make good choices in a probabilistic environment requires that the Decision Maker resolv...
This paper considers a problem of optimal learning by experimentation by a single decisionmaker. Mos...
The MDP formalism and its variants are usually used to control the state of a system through an agen...
We consider the problem of sequential estimation of a random parameter under a controlled setting. U...
The MDP formalism and its variants are usually used to control the state of a system through an agen...
We consider the problem of sequential estimation of a random parameter under a controlled setting. U...
This paper presents a model allowing to tune continual exploration in an optimal way by integrating ...
We consider the problem of sequential estimation of a random parameter under a controlled setting. U...
Summary The paper outlines some basic principles of geometric and nonasymp-totic theory of learning ...
Research on the implications of learning-by-doing has typically been restricted to specifications of...
The principles of statistical mechanics and information theory play an important role in learning an...
Sequential decision-making is an iterative process between a learning agent and an environment. We s...
Institute of Perception, Action and BehaviourRecently there has been a good deal of interest in usin...
We introduce a class of learning problems where the agent is presented with a series of tasks. Intui...
Data-driven approaches to the design of control policies for robotic systems have the potential to r...
Learning to make good choices in a probabilistic environment requires that the Decision Maker resolv...
This paper considers a problem of optimal learning by experimentation by a single decisionmaker. Mos...
The MDP formalism and its variants are usually used to control the state of a system through an agen...
We consider the problem of sequential estimation of a random parameter under a controlled setting. U...
The MDP formalism and its variants are usually used to control the state of a system through an agen...
We consider the problem of sequential estimation of a random parameter under a controlled setting. U...
This paper presents a model allowing to tune continual exploration in an optimal way by integrating ...
We consider the problem of sequential estimation of a random parameter under a controlled setting. U...
Summary The paper outlines some basic principles of geometric and nonasymp-totic theory of learning ...