This paper proposes a general framework to detect unsafe states of a system whose basic realtime param-eters are captured by multi-sensors. Our approach is to learn a danger level function which can be used to alert the users in advance of dangerous situations. The main challenge to this learning problem is the labelling issue, i.e., it is difficult to assign an objective danger level at each time step to the training data, except at the collapse points where a penalty can be assigned and at the successful ends where a certain reward can be assigned. In this paper, we treat the danger level as ex-pected future reward (penalty is regarded as negative reward) and use temporal difference (TD) learning [2] to learn a function to approximate the...
Nowadays, information control systems based on databases develop dynamically worldwide. These system...
To achieve robust autonomy, robots must avoid getting stuck in states from which they cannot recove...
Abstract. We present and analyze monitoring algorithms for a safety fragment of metric temporal logi...
The increased complexity of modern systems necessitates automated anomaly detection methods to detec...
We propose a new approach to value function approximation which combines linear temporal difference ...
We propose a new approach to value function approximation which combines lin-ear temporal difference...
Signal Temporal Logic is a linear-time temporal logic designed for classifying the time-dependent si...
In online monitoring of critical systems, it is important to detect an abnormal behavior as early as...
Accident prevention and system safety are important considerations for many industries, especially l...
Self-learning approaches, such as reinforcement learning, offer new possibilities for autonomous con...
Emerging evidence shows that safety-critical systems are evolving towards operating in uncertain con...
Self-learning approaches, such as reinforcement learning, offer new possibilities for autonomous con...
in Computer Science There has been recent interest in using a class of incremental learning algorith...
Reinforcement learning algorithms discover policies that maximize reward, but do not necessarily gua...
To achieve robust autonomy, robots must avoid getting stuck in states from which they cannot recover...
Nowadays, information control systems based on databases develop dynamically worldwide. These system...
To achieve robust autonomy, robots must avoid getting stuck in states from which they cannot recove...
Abstract. We present and analyze monitoring algorithms for a safety fragment of metric temporal logi...
The increased complexity of modern systems necessitates automated anomaly detection methods to detec...
We propose a new approach to value function approximation which combines linear temporal difference ...
We propose a new approach to value function approximation which combines lin-ear temporal difference...
Signal Temporal Logic is a linear-time temporal logic designed for classifying the time-dependent si...
In online monitoring of critical systems, it is important to detect an abnormal behavior as early as...
Accident prevention and system safety are important considerations for many industries, especially l...
Self-learning approaches, such as reinforcement learning, offer new possibilities for autonomous con...
Emerging evidence shows that safety-critical systems are evolving towards operating in uncertain con...
Self-learning approaches, such as reinforcement learning, offer new possibilities for autonomous con...
in Computer Science There has been recent interest in using a class of incremental learning algorith...
Reinforcement learning algorithms discover policies that maximize reward, but do not necessarily gua...
To achieve robust autonomy, robots must avoid getting stuck in states from which they cannot recover...
Nowadays, information control systems based on databases develop dynamically worldwide. These system...
To achieve robust autonomy, robots must avoid getting stuck in states from which they cannot recove...
Abstract. We present and analyze monitoring algorithms for a safety fragment of metric temporal logi...