A novel reinforcement learning-based sensor scan optimisation scheme is presented for the purpose of multi-target tracking and threat evaluation from helicopter platforms. Reinforcement learn-ing is an unsupervised learning technique that has been shown to be effective in highly dynamic and noisy environments. The problem is made suitable for the use of reinforcement learning by its casting into a “sensor scheduling ” framework. An innovative action ex-ploration policy utilising a Gibbs distribution is shown to improve agent performance over a more conventional random action selec-tion policy. The efficiency of the proposed architecture in terms of the prioritisation of targets is illustrated via simulation examples. 1
Searching indoor environments in the presence of unknown obstacles with multiple UAV agents remains ...
International audienceOwing to the advantages of Unmanned Aerial Vehicle (UAV), such as the extendib...
The main purpose of this paper is to explore and investigate the role of deep reinforcement learning...
Publication in the conference proceedings of EUSIPCO, Lausanne, Switzerland, 200
Cyber-physical systems, such as mobile robots, must respond adaptively to dynamic operating conditio...
In this thesis we present the implementation of a coordinated decision-making agent for emergency re...
Deep reinforcement learning has advanced signifi-cantly in recent years, and it is now used in embed...
This work addresses the problem of scheduling the resources of agile sensors. We advocate an informa...
Reinforcement learning is the problem of autonomously learning a policy guided only by a reward func...
This thesis formulates a stochastic scheduler for use in adaptive resource management of a single Gr...
While autonomous mobile robots used to be built for domain specific tasks in factories or similar sa...
In several reinforcement learning (RL) scenarios, mainly in security settings, there may be adversar...
Due to the flexibility and ease of control, unmanned aerial vehicles (UAVs) have been increasingly u...
In this paper, we present an algorithm called Learning-based Preferential Surveillance Algorithm (LP...
International audienceThis paper considers the problem of multi-target detection for massive multipl...
Searching indoor environments in the presence of unknown obstacles with multiple UAV agents remains ...
International audienceOwing to the advantages of Unmanned Aerial Vehicle (UAV), such as the extendib...
The main purpose of this paper is to explore and investigate the role of deep reinforcement learning...
Publication in the conference proceedings of EUSIPCO, Lausanne, Switzerland, 200
Cyber-physical systems, such as mobile robots, must respond adaptively to dynamic operating conditio...
In this thesis we present the implementation of a coordinated decision-making agent for emergency re...
Deep reinforcement learning has advanced signifi-cantly in recent years, and it is now used in embed...
This work addresses the problem of scheduling the resources of agile sensors. We advocate an informa...
Reinforcement learning is the problem of autonomously learning a policy guided only by a reward func...
This thesis formulates a stochastic scheduler for use in adaptive resource management of a single Gr...
While autonomous mobile robots used to be built for domain specific tasks in factories or similar sa...
In several reinforcement learning (RL) scenarios, mainly in security settings, there may be adversar...
Due to the flexibility and ease of control, unmanned aerial vehicles (UAVs) have been increasingly u...
In this paper, we present an algorithm called Learning-based Preferential Surveillance Algorithm (LP...
International audienceThis paper considers the problem of multi-target detection for massive multipl...
Searching indoor environments in the presence of unknown obstacles with multiple UAV agents remains ...
International audienceOwing to the advantages of Unmanned Aerial Vehicle (UAV), such as the extendib...
The main purpose of this paper is to explore and investigate the role of deep reinforcement learning...