Analysis and recognition of historical documents faces many challenges, one of which is the scarcity of the ground truth data needed for most machine learning techniques, deep learning in particular. In this paper, we present a novel approach which significantly augments the word image samples generated from an existing dataset of Khmer ancient palm leaf manuscripts. Instead of segmenting real Khmer words, we combine the annotated glyphs into groups called sub-syllabes. A new text recognition method is also proposed to take into account the spatially complex structure of Khmer writing. The proposed method is compoused of two main modules: a feature generator and a decoder. The generator utilizes convolutional blocks, inception blocks, and a...
Abstract: Palm leaf manuscripts were one of the earliest forms of written media and were used in So...
Text line segmentation is one of the most essential pre-processing steps in character recognition an...
The paper addresses the automation of the task of an epigraphist in reading and deciphering inscript...
This paper presents methods for two historical document analysis tasks on digitized Khmer palm leaf ...
Analysis of ancient Khmer documents can be quite challenging due to the elaborated shape of Khmer ha...
Palm leaves have been used as one of the major sources of writing and painting in many Southeast Asi...
This paper presents a comprehensive test of the principal tasks in document image analysis (DIA), st...
Khmer inscriptions are primary sources of information on the history of Cambodia. Weathering,polluti...
The motivation of this study is to develop a compact offline recognition model for Khmer handwritten...
Images of Historical Vietnamese stone engravings provide historians with a unique opportunity to stu...
Historical manuscripts are the main source of information about past. In recent years, digitization ...
Images of historical Vietnamese steles allow historians to discover invaluable information regarding...
The collection of palm leaf manuscripts is an important part of Southeast Asian people’s culture and...
The current state of the art for automatic transcription of historical manuscripts is typically limi...
Recognizing the content in the ancient inscriptions unlocks many gateways to the undiscovered histo...
Abstract: Palm leaf manuscripts were one of the earliest forms of written media and were used in So...
Text line segmentation is one of the most essential pre-processing steps in character recognition an...
The paper addresses the automation of the task of an epigraphist in reading and deciphering inscript...
This paper presents methods for two historical document analysis tasks on digitized Khmer palm leaf ...
Analysis of ancient Khmer documents can be quite challenging due to the elaborated shape of Khmer ha...
Palm leaves have been used as one of the major sources of writing and painting in many Southeast Asi...
This paper presents a comprehensive test of the principal tasks in document image analysis (DIA), st...
Khmer inscriptions are primary sources of information on the history of Cambodia. Weathering,polluti...
The motivation of this study is to develop a compact offline recognition model for Khmer handwritten...
Images of Historical Vietnamese stone engravings provide historians with a unique opportunity to stu...
Historical manuscripts are the main source of information about past. In recent years, digitization ...
Images of historical Vietnamese steles allow historians to discover invaluable information regarding...
The collection of palm leaf manuscripts is an important part of Southeast Asian people’s culture and...
The current state of the art for automatic transcription of historical manuscripts is typically limi...
Recognizing the content in the ancient inscriptions unlocks many gateways to the undiscovered histo...
Abstract: Palm leaf manuscripts were one of the earliest forms of written media and were used in So...
Text line segmentation is one of the most essential pre-processing steps in character recognition an...
The paper addresses the automation of the task of an epigraphist in reading and deciphering inscript...