The low quality of scanned documents prevents one from recognizing the contents of them. However, often there are only a few important words that need to be distinguished. Spotting these words is a complicated task, which can be accomplished by utilizing the underlying document structure. The structure of the documents enables one to tackle the problem with the help of machine learning. This thesis investigates the boundaries of the object detection problem; however, unlike the previous research, we apply the detection pipeline to discover the words in structured documents
Neural networks are a powerful technology for classification of visual inputs arising from documents...
The computer vision and object detection techniques developed in recent years are dominating the sta...
In spite of significant research efforts, the existing scene text detection methods fall short of th...
Manually supervising a million documents yearly becomes an exhaustive task. A step towards automatin...
International audiencePattern spotting consists of searching in a collection of historical document ...
The technology for obtaining information from big data has broad application prospects. Among them, ...
This research contributes to the problem of classifying document images. The main addition of this t...
International audienceDespite recent achievements in handwritten text recognition due to major advan...
Deep learning-based object detection method has been applied in various fields, such as ITS (intelli...
Structured content such as figures, tables, graphs, captions, and other graphical material often cap...
This work researches named entity recognition (NER) with respect to images of documents with a domai...
This work will focus on named entity recognition on documents with a strong layout using deep recurr...
Understanding the contents of handwritten texts from document images has long been a traditional fie...
In the last few years, deep convolutional neural networks have become ubiquitous in computer vision,...
The segmentation of individual words is a crucial step in several data mining methods for historical...
Neural networks are a powerful technology for classification of visual inputs arising from documents...
The computer vision and object detection techniques developed in recent years are dominating the sta...
In spite of significant research efforts, the existing scene text detection methods fall short of th...
Manually supervising a million documents yearly becomes an exhaustive task. A step towards automatin...
International audiencePattern spotting consists of searching in a collection of historical document ...
The technology for obtaining information from big data has broad application prospects. Among them, ...
This research contributes to the problem of classifying document images. The main addition of this t...
International audienceDespite recent achievements in handwritten text recognition due to major advan...
Deep learning-based object detection method has been applied in various fields, such as ITS (intelli...
Structured content such as figures, tables, graphs, captions, and other graphical material often cap...
This work researches named entity recognition (NER) with respect to images of documents with a domai...
This work will focus on named entity recognition on documents with a strong layout using deep recurr...
Understanding the contents of handwritten texts from document images has long been a traditional fie...
In the last few years, deep convolutional neural networks have become ubiquitous in computer vision,...
The segmentation of individual words is a crucial step in several data mining methods for historical...
Neural networks are a powerful technology for classification of visual inputs arising from documents...
The computer vision and object detection techniques developed in recent years are dominating the sta...
In spite of significant research efforts, the existing scene text detection methods fall short of th...