Recent work has shown the promise of creating generalist, transformer-based, models for language, vision, and sequential decision-making problems. To create such models, we generally require centralized training objectives, data, and compute. It is of interest if we can more flexibly create generalist policies by merging together multiple, task-specific, individually trained policies. In this work, we take a preliminary step in this direction through merging, or averaging, subsets of Decision Transformers in parameter space trained on different MuJoCo locomotion problems, forming multi-task models without centralized training. We also demonstrate the importance of various methodological choices when merging policies, such as utilizing commo...
The central contribution of this thesis is providing a reliable framework and algorithms to make ro...
Our situated environment is full of uncertainty and highly dynamic, thus hindering the widespread ad...
The final publication is available at link.springer.comIn the context of assistive robotics, robots ...
Humans utilise a large diversity of control and reasoning methods to solve different robot manipula...
Policy gradient algorithms have shown consider-able recent success in solving high-dimensional seque...
This work focuses on generating multiple coordinated motor skills for intelligent systems and studie...
Large scale machine learning lies at the core of many artificial intelligence’s recent successes exe...
Abstract — Learning policies that generalize across multiple tasks is an important and challenging r...
The grand aim of having a single robot that can manipulate arbitrary objects in diverse settings is ...
Moving from narrow robots specializing in specific tasks to generalist robots excelling in multiple ...
Learning provides a powerful tool for vision-based navigation, but the capabilities of learning-base...
Recent work suggests that transformer models are capable of multi-task learning on diverse NLP tasks...
Dynamical System (DS) based Learning from Demonstration (LfD) allows learning of reactive motion pol...
Averaging the parameters of models that have the same architecture and initialization can provide a ...
In this work, we study rapid, step-wise improvements of the loss in transformers when being confront...
The central contribution of this thesis is providing a reliable framework and algorithms to make ro...
Our situated environment is full of uncertainty and highly dynamic, thus hindering the widespread ad...
The final publication is available at link.springer.comIn the context of assistive robotics, robots ...
Humans utilise a large diversity of control and reasoning methods to solve different robot manipula...
Policy gradient algorithms have shown consider-able recent success in solving high-dimensional seque...
This work focuses on generating multiple coordinated motor skills for intelligent systems and studie...
Large scale machine learning lies at the core of many artificial intelligence’s recent successes exe...
Abstract — Learning policies that generalize across multiple tasks is an important and challenging r...
The grand aim of having a single robot that can manipulate arbitrary objects in diverse settings is ...
Moving from narrow robots specializing in specific tasks to generalist robots excelling in multiple ...
Learning provides a powerful tool for vision-based navigation, but the capabilities of learning-base...
Recent work suggests that transformer models are capable of multi-task learning on diverse NLP tasks...
Dynamical System (DS) based Learning from Demonstration (LfD) allows learning of reactive motion pol...
Averaging the parameters of models that have the same architecture and initialization can provide a ...
In this work, we study rapid, step-wise improvements of the loss in transformers when being confront...
The central contribution of this thesis is providing a reliable framework and algorithms to make ro...
Our situated environment is full of uncertainty and highly dynamic, thus hindering the widespread ad...
The final publication is available at link.springer.comIn the context of assistive robotics, robots ...