Scene Text Recognition (STR) is the problem of recognizing the correct word or character sequence in a cropped word image. To obtain more robust output sequences, the notion of bidirectional STR has been introduced. So far, bidirectional STRs have been implemented by using two separate decoders; one for left-to-right decoding and one for right-to-left. Having two separate decoders for almost the same task with the same output space is undesirable from a computational and optimization point of view. We introduce the bidirectional Scene Text Transformer (Bi-STET), a novel bidirectional STR method with a single decoder for bidirectional text decoding. With its single decoder, Bi-STET outperforms methods that apply bidirectional decoding by usi...
Scene text recognition is the task of recognizing character sequences in images of natural scenes. T...
Driven by deep learning and a large volume of data, scene text recognition has evolved rapidly in re...
Scene text recognition has inspired great interests from the computer vision community in recent yea...
Leveraging the advances of natural language processing, most recent scene text recognizers adopt an ...
A new approach for real-time scene text recognition is proposed in this paper. A novel binary convol...
Transformers have become popular in building end-to-end automatic speech recognition (ASR) systems. ...
The area of scene text recognition focuses on the problem of recognizing arbitrary text in images of...
Generating sentence descriptions for images is an area of research combining computer vision and nat...
Dominant scene text recognition models commonly contain two building blocks, a visual model for feat...
This paper focuses on improving the recognition of text in images of natural scenes, such as storefr...
Image captioning involves generating a sentence that describes an image. More recently, it has been ...
Although an automated reader for the blind first appeared nearly two-hundred years ago, computers ca...
Inspired by speech recognition, recent state-of-the-art algorithms mostly consider scene text recogn...
The real-life scene images exhibit a range of variations in text appearances, including complex shap...
Scene text recognition brings various new challenges occurs in recent years. Detecting and recognizi...
Scene text recognition is the task of recognizing character sequences in images of natural scenes. T...
Driven by deep learning and a large volume of data, scene text recognition has evolved rapidly in re...
Scene text recognition has inspired great interests from the computer vision community in recent yea...
Leveraging the advances of natural language processing, most recent scene text recognizers adopt an ...
A new approach for real-time scene text recognition is proposed in this paper. A novel binary convol...
Transformers have become popular in building end-to-end automatic speech recognition (ASR) systems. ...
The area of scene text recognition focuses on the problem of recognizing arbitrary text in images of...
Generating sentence descriptions for images is an area of research combining computer vision and nat...
Dominant scene text recognition models commonly contain two building blocks, a visual model for feat...
This paper focuses on improving the recognition of text in images of natural scenes, such as storefr...
Image captioning involves generating a sentence that describes an image. More recently, it has been ...
Although an automated reader for the blind first appeared nearly two-hundred years ago, computers ca...
Inspired by speech recognition, recent state-of-the-art algorithms mostly consider scene text recogn...
The real-life scene images exhibit a range of variations in text appearances, including complex shap...
Scene text recognition brings various new challenges occurs in recent years. Detecting and recognizi...
Scene text recognition is the task of recognizing character sequences in images of natural scenes. T...
Driven by deep learning and a large volume of data, scene text recognition has evolved rapidly in re...
Scene text recognition has inspired great interests from the computer vision community in recent yea...