International audienceThe document binarization is a fundamental processing step toward Optical Character Recognition (OCR). It aims to separate the foreground text from the document background. In this article, we propose a novel binarization technique combining local and global approaches using the clustering algorithm Kmeans. The proposed Hybrid Binarization, based on Kmeans (HBK), performs a robust binarization on scanned documents. According to several experiments, we demonstrate that the HBK method improves the binarization quality while minimizing the amount of distortion. Moreover, it out-performs several well-known state of the art methods in the OCR evaluation
[[abstract]]In this paper, we propose a novel binarization method for document images produced by ca...
In this dissertation, we introduce a set of algorithms for document image process- ing, which are in...
International audienceSeveral approaches were proposed in order to extract text from scanned documen...
International audienceThe document binarization is a fundamental processing step toward Optical Char...
International audienceNowadays, more and more scanned documents are converted into editable electron...
Binarization methods play a central role in document image processing. It is usually performed in th...
Optical Character Recognition (OCR) from document photos taken by cell phones is a challenging task....
One of the key operations during the image preprocessing step in Optical Character Recognition (OCR)...
International audienceThe Optical Character Recognition (OCR) is a process that converts characters ...
Binarization is one of sub phases ofpreprocessing step of optical character recognition(OCR). Binari...
Document images are usually degraded in the course of photocopying, faxing, printing, or scanning. D...
Binarization of gray scale document images is one of the most important steps in automatic document ...
This paper presents a new adaptive binarization technique for degraded hand-held camera-captured doc...
In document binarization, text is segmented from the background. This is an important step, since th...
In this paper, we propose a novel technique for unsupervised text binarization in handwritten histor...
[[abstract]]In this paper, we propose a novel binarization method for document images produced by ca...
In this dissertation, we introduce a set of algorithms for document image process- ing, which are in...
International audienceSeveral approaches were proposed in order to extract text from scanned documen...
International audienceThe document binarization is a fundamental processing step toward Optical Char...
International audienceNowadays, more and more scanned documents are converted into editable electron...
Binarization methods play a central role in document image processing. It is usually performed in th...
Optical Character Recognition (OCR) from document photos taken by cell phones is a challenging task....
One of the key operations during the image preprocessing step in Optical Character Recognition (OCR)...
International audienceThe Optical Character Recognition (OCR) is a process that converts characters ...
Binarization is one of sub phases ofpreprocessing step of optical character recognition(OCR). Binari...
Document images are usually degraded in the course of photocopying, faxing, printing, or scanning. D...
Binarization of gray scale document images is one of the most important steps in automatic document ...
This paper presents a new adaptive binarization technique for degraded hand-held camera-captured doc...
In document binarization, text is segmented from the background. This is an important step, since th...
In this paper, we propose a novel technique for unsupervised text binarization in handwritten histor...
[[abstract]]In this paper, we propose a novel binarization method for document images produced by ca...
In this dissertation, we introduce a set of algorithms for document image process- ing, which are in...
International audienceSeveral approaches were proposed in order to extract text from scanned documen...