The segmentation of individual words is a crucial step in several data mining methods for historical handwritten documents. Examples of applications include visual searching for query words (word spotting) and character-by-character text recognition. In this paper, we present a novel method for word segmentation that is adapted from recent advances in computer vision, deep learning and generic object detection. Our method has unique capabilities and it has found practical use in our current research project. It can easily be trained for different kinds of historical documents, uses full gray scale information, does not require binarization as pre-processing or prior segmentation of individual text lines. We evaluate its performance using es...
Today, work with historical manuscripts is nearly exclusively done manually, by researchers in the h...
Computerized analysis of handwritten documents is an active research area in image analysis and comp...
An algorithm for segmenting unconstrained printed and cursive words is proposed. The algorithm initi...
The segmentation of individual words is a crucial step in several data mining methods for historical...
The aim of this thesis is to build and evaluate how a word segmentation algorithm performs when extr...
Historical manuscripts are the main source of information about past. In recent years, digitization ...
Abstract Word spotting on degraded and noisy historical documents can become a challenging task cons...
Word spotting on degraded and noisy historical documents can become a challenging task considering t...
Indexing large archives of historical manuscripts, like the papers of George Washington, is required...
This paper presents novel results for word spotting based on dynamic time warping applied to medieva...
Indexing and searching collections of handwritten archival documents and manuscripts has always been...
Many libraries, museums, and other organizations contain large collections of handwritten historical...
Abstract. “Character recognition ” refers to the procedure of ‘reading ’ text using a computer, taki...
Indexing and searching collections of handwritten archival documents and manuscripts has always been...
A new intelligent segmentation technique is proposed that can be used in conjunction with a neural c...
Today, work with historical manuscripts is nearly exclusively done manually, by researchers in the h...
Computerized analysis of handwritten documents is an active research area in image analysis and comp...
An algorithm for segmenting unconstrained printed and cursive words is proposed. The algorithm initi...
The segmentation of individual words is a crucial step in several data mining methods for historical...
The aim of this thesis is to build and evaluate how a word segmentation algorithm performs when extr...
Historical manuscripts are the main source of information about past. In recent years, digitization ...
Abstract Word spotting on degraded and noisy historical documents can become a challenging task cons...
Word spotting on degraded and noisy historical documents can become a challenging task considering t...
Indexing large archives of historical manuscripts, like the papers of George Washington, is required...
This paper presents novel results for word spotting based on dynamic time warping applied to medieva...
Indexing and searching collections of handwritten archival documents and manuscripts has always been...
Many libraries, museums, and other organizations contain large collections of handwritten historical...
Abstract. “Character recognition ” refers to the procedure of ‘reading ’ text using a computer, taki...
Indexing and searching collections of handwritten archival documents and manuscripts has always been...
A new intelligent segmentation technique is proposed that can be used in conjunction with a neural c...
Today, work with historical manuscripts is nearly exclusively done manually, by researchers in the h...
Computerized analysis of handwritten documents is an active research area in image analysis and comp...
An algorithm for segmenting unconstrained printed and cursive words is proposed. The algorithm initi...