© 2020, Science China Press and Springer-Verlag GmbH Germany, part of Springer Nature. Scene text recognition has recently been widely treated as a sequence-to-sequence prediction problem, where traditional fully-connected-LSTM (FC-LSTM) has played a critical role. Owing to the limitation of FC-LSTM, existing methods have to convert 2-D feature maps into 1-D sequential feature vectors, resulting in severe damages of the valuable spatial and structural information of text images. In this paper, we argue that scene text recognition is essentially a spatiotemporal prediction problem for its 2-D image inputs, and propose a convolution LSTM (ConvLSTM)-based scene text recognizer, namely, FACLSTM, i.e., focused attention ConvLSTM, where the spati...
Recognizing texts in images plays an important role in many applications, such as industrial intelli...
Humans have a remarkable ability to quickly discern regions con-taining text from other noisy region...
Text detection and recognition in natural images have long been considered as two separate tasks tha...
The real-life scene images exhibit a range of variations in text appearances, including complex shap...
Inspired by speech recognition, recent state-of-the-art algorithms mostly consider scene text recogn...
Connectionist temporal classification (CTC) is a favored decoder in scene text recognition (STR) for...
In spite of significant research efforts, the existing scene text detection methods fall short of th...
Connectionist Temporal Classification (CTC) and attention mechanism are two main approaches used in ...
Driven by deep learning and a large volume of data, scene text recognition has evolved rapidly in re...
We develop a Deep-Text Recurrent Network (DTRN) that regards scene text reading as a sequence labell...
Dominant scene text recognition models commonly contain two building blocks, a visual model for feat...
In image scene, text contains high-level of important information that helps to analyze and consider...
The scene Text Recognition process has become a hot research topic and a challenging task owing to t...
Scene Text Detection (STD) is critical for obtaining textual information from natural scenes, servin...
International audienceThe current trend in object detection and localization is to learn predictions...
Recognizing texts in images plays an important role in many applications, such as industrial intelli...
Humans have a remarkable ability to quickly discern regions con-taining text from other noisy region...
Text detection and recognition in natural images have long been considered as two separate tasks tha...
The real-life scene images exhibit a range of variations in text appearances, including complex shap...
Inspired by speech recognition, recent state-of-the-art algorithms mostly consider scene text recogn...
Connectionist temporal classification (CTC) is a favored decoder in scene text recognition (STR) for...
In spite of significant research efforts, the existing scene text detection methods fall short of th...
Connectionist Temporal Classification (CTC) and attention mechanism are two main approaches used in ...
Driven by deep learning and a large volume of data, scene text recognition has evolved rapidly in re...
We develop a Deep-Text Recurrent Network (DTRN) that regards scene text reading as a sequence labell...
Dominant scene text recognition models commonly contain two building blocks, a visual model for feat...
In image scene, text contains high-level of important information that helps to analyze and consider...
The scene Text Recognition process has become a hot research topic and a challenging task owing to t...
Scene Text Detection (STD) is critical for obtaining textual information from natural scenes, servin...
International audienceThe current trend in object detection and localization is to learn predictions...
Recognizing texts in images plays an important role in many applications, such as industrial intelli...
Humans have a remarkable ability to quickly discern regions con-taining text from other noisy region...
Text detection and recognition in natural images have long been considered as two separate tasks tha...