This dataset is a subset of 596 documents from the Registre d'Hipoteques de Girona of 1769 collection, guarded by the Arxiu Històric de Girona. This collection, is composed by hundreds of thousands of notarial deeds from the XVIII-XIX century (1768-1862). Sales, redemption of censuses, inheritance and matrimonial chapters are among the most common documentary typologies in the collection. This dataset is composed of more than 23700 text lines written by a single hand, covering more that 50 different topics (documentary typologies) and a vocabulary of more than 2400 different words. The documents are transcribed using the so-called diplomatic criteria. Additionally, transcripts were tagged with extra enriching/complementary information (e....
Este trabajo constituye la primera parte de uno más amplio cuyo objetivo es analizar lingüísticament...
This repository contains the dataset of the article "Towards a general open dataset and models for l...
This package provides an evaluation framework, training and test data for semi-automatic recognition...
Annotation of digitized pages from historical document collections is very important to research on ...
The transformation of social, political and administrative models all along the Mediterranean medie...
Este trabajo es una colección diplomática de los documentos relativos a la población de Portugalete ...
The Rodrigo corpus was obtained from the digitisation of the book “Historia de España del arçobispo ...
2 data files; 1 documentation fileThe author of this dataset will provide a detailed treatment of th...
""© Owner/Author 2014. This is the author's version of the work. It is posted here for your personal...
The National Archives of the Netherlands and the Noord-Hollands Archief started a colloboration with...
Version 0.0.1 Full Changelog: https://github.com/HTRomance-Project/middle-ages-in-spain/commits/v0.0...
© ACM 2013. This is the author's version of the work. It is posted here for your personal use. Not ...
This training dataset includes a total of 34,913 manually transcribed text segments. It is dedicated...
613 p.Podemos considerar Ponferrada (León), villa de realengo y cabeza de partido judicial del mismo...
[EN] In this final degree project, the realization of both a historical and technical study and the...
Este trabajo constituye la primera parte de uno más amplio cuyo objetivo es analizar lingüísticament...
This repository contains the dataset of the article "Towards a general open dataset and models for l...
This package provides an evaluation framework, training and test data for semi-automatic recognition...
Annotation of digitized pages from historical document collections is very important to research on ...
The transformation of social, political and administrative models all along the Mediterranean medie...
Este trabajo es una colección diplomática de los documentos relativos a la población de Portugalete ...
The Rodrigo corpus was obtained from the digitisation of the book “Historia de España del arçobispo ...
2 data files; 1 documentation fileThe author of this dataset will provide a detailed treatment of th...
""© Owner/Author 2014. This is the author's version of the work. It is posted here for your personal...
The National Archives of the Netherlands and the Noord-Hollands Archief started a colloboration with...
Version 0.0.1 Full Changelog: https://github.com/HTRomance-Project/middle-ages-in-spain/commits/v0.0...
© ACM 2013. This is the author's version of the work. It is posted here for your personal use. Not ...
This training dataset includes a total of 34,913 manually transcribed text segments. It is dedicated...
613 p.Podemos considerar Ponferrada (León), villa de realengo y cabeza de partido judicial del mismo...
[EN] In this final degree project, the realization of both a historical and technical study and the...
Este trabajo constituye la primera parte de uno más amplio cuyo objetivo es analizar lingüísticament...
This repository contains the dataset of the article "Towards a general open dataset and models for l...
This package provides an evaluation framework, training and test data for semi-automatic recognition...