Video image recognition has been extensively studied with rapid progress recently. However, most methods focus on short-term rather than long-term (contextual) video recognition. Convolutional recurrent neural networks (ConvRNNs) provide robust spatio-temporal information processing capabilities for contextual video recognition, but require extensive computation that slows down training. Inspired by normalization and detrending methods, in this paper we propose "adaptive detrending" (AD) for temporal normalization in order to accelerate the training of ConvRNNs, especially of convolutional gated recurrent unit (ConvGRU). For each neuron in a recurrent neural network (RNN), AD identifies the trending change within a sequence and subtracts it...
Convolutional Neural Networks (CNNs) have been es-tablished as a powerful class of models for image ...
Human action recognition in videos is an important task with a broad range of applications. In this ...
Scene labeling is a technique that consist on giving a label to every pixel in an im-age according t...
The temporal events in video sequences often have long-term dependencies which are difficult to be h...
Applying convolutional neural networks to large images is computationally ex-pensive because the amo...
The amount of video content generated increases daily, three hundred hours of video content is uploa...
Recurrent neural network (RNN) telah mencapai kesuksesan dalam memproses data sekuensial dan menjadi...
Convolutional neural network(CNN) models have been extensively used in recent years to solve the pro...
© 2016 IEEE. Recently, deep learning approach, especially deep Convolutional Neural Networks (ConvNe...
This work proposes and compare two different approaches for real-time human action recognition (HAR)...
This paper addresses the moving objects segmentation in videos, i.e. Background Subtraction (BGS) us...
In this paper, we present contextual relationship-based learning model using deep neural network for...
Violent behavior recognition is an important direction of behavior recognition research. For traditi...
Recurrent models are a popular choice for video enhancement tasks such as video denoising or super-r...
In this paper, we propose a novel temporal spiking recurrent neural network (TSRNN) to perform robus...
Convolutional Neural Networks (CNNs) have been es-tablished as a powerful class of models for image ...
Human action recognition in videos is an important task with a broad range of applications. In this ...
Scene labeling is a technique that consist on giving a label to every pixel in an im-age according t...
The temporal events in video sequences often have long-term dependencies which are difficult to be h...
Applying convolutional neural networks to large images is computationally ex-pensive because the amo...
The amount of video content generated increases daily, three hundred hours of video content is uploa...
Recurrent neural network (RNN) telah mencapai kesuksesan dalam memproses data sekuensial dan menjadi...
Convolutional neural network(CNN) models have been extensively used in recent years to solve the pro...
© 2016 IEEE. Recently, deep learning approach, especially deep Convolutional Neural Networks (ConvNe...
This work proposes and compare two different approaches for real-time human action recognition (HAR)...
This paper addresses the moving objects segmentation in videos, i.e. Background Subtraction (BGS) us...
In this paper, we present contextual relationship-based learning model using deep neural network for...
Violent behavior recognition is an important direction of behavior recognition research. For traditi...
Recurrent models are a popular choice for video enhancement tasks such as video denoising or super-r...
In this paper, we propose a novel temporal spiking recurrent neural network (TSRNN) to perform robus...
Convolutional Neural Networks (CNNs) have been es-tablished as a powerful class of models for image ...
Human action recognition in videos is an important task with a broad range of applications. In this ...
Scene labeling is a technique that consist on giving a label to every pixel in an im-age according t...