The idea of temporal abstraction, i.e. learning, planning and representing the world at multiple time scales, has been a constant thread in AI research, spanning sub-fields from classical planning and search to control and reinforcement learning. For example, programming a robot typically involves making decisions over a set of controllers, rather than working at the level of motor torques. While temporal abstraction is a very natural concept, learning such abstractions with no human input has proved quite daunting. In this paper, we present a general architecture, called option-critic, which allows learning temporal abstractions automatically, end-to-end, simply from the agent’s experience. This approach allows continual learning and provi...
Humans use prior knowledge to efficiently solve novel tasks, but how they structure past knowledge d...
We present a new method for automatically creating useful temporal abstractions in reinforcement lea...
We present a new method for automatically creating useful temporal abstractions in reinforcement lea...
Temporal abstraction is a key idea in decision making that is seen as crucial for creating artificia...
Throughout this thesis, I develop the idea that the problem of learning good temporal abstractions i...
Decision making usually involves choosing among different courses of action over a broad range of ti...
The ability to create and to use abstractions in complex environments, that is, to systematically ig...
Learning temporal abstractions which are partial solutions to a task and could be reused for other s...
Temporal abstraction is key to scaling up learning and planning in reinforcement learning. While pla...
Learning, planning, and representing knowledge at multiple levels of temporal abstraction are key, l...
AbstractLearning, planning, and representing knowledge at multiple levels of temporal abstraction ar...
Learning, planning, and representing knowledge at multiple levels of temporal abstraction are key, l...
Learning, planning, and representing knowledge at multiple levels of temporal abstraction are key, l...
Reasoning at multiple levels of temporal abstraction is one of the key attributes of intelligence. I...
AbstractLearning, planning, and representing knowledge at multiple levels of temporal abstraction ar...
Humans use prior knowledge to efficiently solve novel tasks, but how they structure past knowledge d...
We present a new method for automatically creating useful temporal abstractions in reinforcement lea...
We present a new method for automatically creating useful temporal abstractions in reinforcement lea...
Temporal abstraction is a key idea in decision making that is seen as crucial for creating artificia...
Throughout this thesis, I develop the idea that the problem of learning good temporal abstractions i...
Decision making usually involves choosing among different courses of action over a broad range of ti...
The ability to create and to use abstractions in complex environments, that is, to systematically ig...
Learning temporal abstractions which are partial solutions to a task and could be reused for other s...
Temporal abstraction is key to scaling up learning and planning in reinforcement learning. While pla...
Learning, planning, and representing knowledge at multiple levels of temporal abstraction are key, l...
AbstractLearning, planning, and representing knowledge at multiple levels of temporal abstraction ar...
Learning, planning, and representing knowledge at multiple levels of temporal abstraction are key, l...
Learning, planning, and representing knowledge at multiple levels of temporal abstraction are key, l...
Reasoning at multiple levels of temporal abstraction is one of the key attributes of intelligence. I...
AbstractLearning, planning, and representing knowledge at multiple levels of temporal abstraction ar...
Humans use prior knowledge to efficiently solve novel tasks, but how they structure past knowledge d...
We present a new method for automatically creating useful temporal abstractions in reinforcement lea...
We present a new method for automatically creating useful temporal abstractions in reinforcement lea...