In this paper we present an event aggregation strategy to convert the output of an event camera into frames processable by traditional Computer Vision algorithms. The proposed method first generates sequences of intermediate binary representations, which are then losslessly transformed into a compact format by simply applying a binary-to-decimal conversion. This strategy allows us to encode temporal information directly into pixel values, which are then interpreted by deep learning models. We apply our strategy, called Temporal Binary Representation, to the task of Gesture Recognition, obtaining state of the art results on the popular DVS128 Gesture Dataset. To underline the effectiveness of the proposed method compared to existing ones, we...
Recently developed event cameras demonstrate increasing potential in computer vision applications. T...
As an important branch of video analysis, human action recognition has attracted extensive research ...
As research in computer vision has shifted from only processing sin-gle, static images to the manipu...
In this paper we present an event aggregation strategy to convert the output of an event camera into...
International audienceThe interest in action and gesture recognition has grown considerably in the l...
A reduced version of this paper appeared appeared in the Proceedings of 12th IEEE International Conf...
© 2015 IEEE. In this paper we present a method to capture video-wide temporal information for action...
Automatic action and gesture recognition research field has growth in interest over the last few yea...
MasterThis thesis proposes the mixed temporal kernel depthwise-separable convolution network that ex...
Recent studies have demonstrated the power of recurrent neural networks for machine translation, ima...
Today, a frame-based camera is the sensor of choice for machine vision applications. However, these ...
This paper contributes a new boosting paradigm to achieve detection of events in video. Previous boo...
From wearable devices to depth cameras, researchers have exploited various multimodal data to recogn...
We propose a function-based temporal pooling method that captures the latent structure of the video ...
The use of pose estimation for human action recognition has seen a resurgence in previous years, due...
Recently developed event cameras demonstrate increasing potential in computer vision applications. T...
As an important branch of video analysis, human action recognition has attracted extensive research ...
As research in computer vision has shifted from only processing sin-gle, static images to the manipu...
In this paper we present an event aggregation strategy to convert the output of an event camera into...
International audienceThe interest in action and gesture recognition has grown considerably in the l...
A reduced version of this paper appeared appeared in the Proceedings of 12th IEEE International Conf...
© 2015 IEEE. In this paper we present a method to capture video-wide temporal information for action...
Automatic action and gesture recognition research field has growth in interest over the last few yea...
MasterThis thesis proposes the mixed temporal kernel depthwise-separable convolution network that ex...
Recent studies have demonstrated the power of recurrent neural networks for machine translation, ima...
Today, a frame-based camera is the sensor of choice for machine vision applications. However, these ...
This paper contributes a new boosting paradigm to achieve detection of events in video. Previous boo...
From wearable devices to depth cameras, researchers have exploited various multimodal data to recogn...
We propose a function-based temporal pooling method that captures the latent structure of the video ...
The use of pose estimation for human action recognition has seen a resurgence in previous years, due...
Recently developed event cameras demonstrate increasing potential in computer vision applications. T...
As an important branch of video analysis, human action recognition has attracted extensive research ...
As research in computer vision has shifted from only processing sin-gle, static images to the manipu...