In this paper, as a first step to an easy and convenient way to access the manuscripts of Atatürk with a word based search engine, the preprocessing of digitalized documents and their line and word segmentation is studied. The techniques that are applied on printed documents may not yield satisfactory results. Due to this fact, more developed techniques are decided to be applied consisting of a technique based on Hough transform [1] for line segmentation and a technique that is based on dealing with skewness of lines for word segmentation. The results, which are acquired through studies that are conducted on the documents provided by Afet İnan and consisting of 30 pages [2], prove to be highly accurate and promising for future researches. ©...
Indexing and searching collections of handwritten archival documents and manuscripts has always been...
In this paper, a word-spotting approach is presented that can help in reading handwritten Arabic Arc...
A method is presented for the efficient segmentation of text lines from scanned images of technical ...
Many researches and historians from all around the world are interested in historical Ottoman archiv...
Indexing large archives of historical manuscripts, like the papers of George Washington, is required...
International audienceWe present in this paper a digitization project of cultural heritage manuscrip...
Abstract. “Character recognition ” refers to the procedure of ‘reading ’ text using a computer, taki...
ACM Computing Classification System (1998): I.7, I.7.5.In this paper an approach to document line se...
In this paper, as a pre-processing part of character segmentation system, an automatic document skew...
International audienceIn this paper we propose a new approach to improve electronic editions of huma...
Alternating horizontal and vertical projection profiles are extracted from nested sub-blocks of scan...
The aim of this thesis is to build and evaluate how a word segmentation algorithm performs when extr...
Line and word segmentation is a key-step in any document image analysis system. It can be used for i...
Intelligent document segmentation can bring electronic browsing within the reach of most users. The ...
The main objective of this thesis is to develop a system to automatically segment and label a variet...
Indexing and searching collections of handwritten archival documents and manuscripts has always been...
In this paper, a word-spotting approach is presented that can help in reading handwritten Arabic Arc...
A method is presented for the efficient segmentation of text lines from scanned images of technical ...
Many researches and historians from all around the world are interested in historical Ottoman archiv...
Indexing large archives of historical manuscripts, like the papers of George Washington, is required...
International audienceWe present in this paper a digitization project of cultural heritage manuscrip...
Abstract. “Character recognition ” refers to the procedure of ‘reading ’ text using a computer, taki...
ACM Computing Classification System (1998): I.7, I.7.5.In this paper an approach to document line se...
In this paper, as a pre-processing part of character segmentation system, an automatic document skew...
International audienceIn this paper we propose a new approach to improve electronic editions of huma...
Alternating horizontal and vertical projection profiles are extracted from nested sub-blocks of scan...
The aim of this thesis is to build and evaluate how a word segmentation algorithm performs when extr...
Line and word segmentation is a key-step in any document image analysis system. It can be used for i...
Intelligent document segmentation can bring electronic browsing within the reach of most users. The ...
The main objective of this thesis is to develop a system to automatically segment and label a variet...
Indexing and searching collections of handwritten archival documents and manuscripts has always been...
In this paper, a word-spotting approach is presented that can help in reading handwritten Arabic Arc...
A method is presented for the efficient segmentation of text lines from scanned images of technical ...