This package provides an evaluation framework, training and test data for semi-automatic recognition of sections of historical diplomatic manuscripts. The data collection consists of 57 Latin charters issued by the Royal Chancellery of 7 different types. Documents were created in the era of John the Blind, King of Bohemia (1310–1346) and Count of Luxembourg. Manuscripts were digitized, transcribed, and typical sections of medieval charters ('corroboratio', 'datatio', 'dispositio', 'inscriptio', 'intitulatio', 'narratio', and 'publicatio') were manually tagged. Manuscripts also contain additional metadata, such as manually marked named entities and short Czech abstracts. Recognition models are first trained using manually marked sections in...
The poster will present the project "Between Composition and Reception: the Authority of Medieval Ch...
This paper addresses the question of objective categories of medieval scripts and their elaboration ...
In Codice Ratio is a research project to study tools and techniques for analyzing the contents of hi...
This package provides an evaluation framework, training and test data for semi-automatic recognition...
International audienceThis paper presents a model aiming to automatically detect sections in medieva...
In this thesis, we present two computer models to structure textual information for large databases ...
TEI XML edition of the original charters and coeval copies written in Tuscany between AD 714 and 774...
Nous présentons dans cette thèse deux modèles informatiques développés pour délivrer de l'informatio...
Annotated dataset for training named entities recognition models for medieval charters in Latin, Fre...
International audienceThe work on the named entity recognition (NER) in databases of historical text...
This is an open dataset of sentences from 19th and 20th century letterpress reprints of documents fr...
This paper seeks to develop a digital diplomatic approach for reducing medieval charters’ documentar...
The objective of this paper is to present a meta-corpus of diplomatic documents entitled Cartae Euro...
This paper present a novel segmentation and handwritten text recognition dataset for Medieval Latin,...
This is the best HTR model for documentary Latin and French manuscripts presented in the paper: Serg...
The poster will present the project "Between Composition and Reception: the Authority of Medieval Ch...
This paper addresses the question of objective categories of medieval scripts and their elaboration ...
In Codice Ratio is a research project to study tools and techniques for analyzing the contents of hi...
This package provides an evaluation framework, training and test data for semi-automatic recognition...
International audienceThis paper presents a model aiming to automatically detect sections in medieva...
In this thesis, we present two computer models to structure textual information for large databases ...
TEI XML edition of the original charters and coeval copies written in Tuscany between AD 714 and 774...
Nous présentons dans cette thèse deux modèles informatiques développés pour délivrer de l'informatio...
Annotated dataset for training named entities recognition models for medieval charters in Latin, Fre...
International audienceThe work on the named entity recognition (NER) in databases of historical text...
This is an open dataset of sentences from 19th and 20th century letterpress reprints of documents fr...
This paper seeks to develop a digital diplomatic approach for reducing medieval charters’ documentar...
The objective of this paper is to present a meta-corpus of diplomatic documents entitled Cartae Euro...
This paper present a novel segmentation and handwritten text recognition dataset for Medieval Latin,...
This is the best HTR model for documentary Latin and French manuscripts presented in the paper: Serg...
The poster will present the project "Between Composition and Reception: the Authority of Medieval Ch...
This paper addresses the question of objective categories of medieval scripts and their elaboration ...
In Codice Ratio is a research project to study tools and techniques for analyzing the contents of hi...