This work presents an enriched version of the Parish Memories (1758–1761), an essential Portuguese historical source manually transcribed. It is enriched with annotations of named entities of the types PERSON, LOCATION, and ORGANIZATION. The annotation was done automatically for the whole collection where two researchers annotated a portion of it manually for evaluation purposes. In this dataset, we provide the tagged texts, the lists of extracted entities, and frequency counts. The corpus is useful for historians, allowing, for instance, comparative analyses between parishes and regions or to calculate the area of influence of a locality. The paper describes the creation and evaluation of the corpus, discusses its applications and limitati...
After decades of massive digitisation, an unprecedented amount of historical documents is available ...
This paper presents an overview of the second edition of HIPE (Identifying Historical People, Places...
Recent years have seen an important increase of digitization projects in the cultural heritage domai...
This work presents an enriched version of the Parish Memories (1758–1761), an essential Portuguese h...
The Parish Memories with Named Entities dataset consists of 366 transcribed texts from the original ...
This paper reviews a stage of the process of annotating named entities in 18th-century texts to enri...
Extracting data and knowledge dispersed along Portuguese old medical records is important especially...
The amount of information preserved in Portuguese archives has increased over the years. These docum...
Although spanning thousands of years and genres as diverse as liturgy, historiography, lyric and oth...
To accelerate the annotation of named entities (NEs) in historical newspapers like Sarawak Gazette,...
International audienceThe work on the named entity recognition (NER) in databases of historical text...
Recognition and identification of real-world entities is at the core of virtually any text mining ap...
The amount of information present in Portuguese archives has been increasing exponentially over the...
The field of Spatial Humanities has advanced substantially in the past years. The identification and...
At the moment, the vast majority of Portuguese archives with an online presence use a software solu...
After decades of massive digitisation, an unprecedented amount of historical documents is available ...
This paper presents an overview of the second edition of HIPE (Identifying Historical People, Places...
Recent years have seen an important increase of digitization projects in the cultural heritage domai...
This work presents an enriched version of the Parish Memories (1758–1761), an essential Portuguese h...
The Parish Memories with Named Entities dataset consists of 366 transcribed texts from the original ...
This paper reviews a stage of the process of annotating named entities in 18th-century texts to enri...
Extracting data and knowledge dispersed along Portuguese old medical records is important especially...
The amount of information preserved in Portuguese archives has increased over the years. These docum...
Although spanning thousands of years and genres as diverse as liturgy, historiography, lyric and oth...
To accelerate the annotation of named entities (NEs) in historical newspapers like Sarawak Gazette,...
International audienceThe work on the named entity recognition (NER) in databases of historical text...
Recognition and identification of real-world entities is at the core of virtually any text mining ap...
The amount of information present in Portuguese archives has been increasing exponentially over the...
The field of Spatial Humanities has advanced substantially in the past years. The identification and...
At the moment, the vast majority of Portuguese archives with an online presence use a software solu...
After decades of massive digitisation, an unprecedented amount of historical documents is available ...
This paper presents an overview of the second edition of HIPE (Identifying Historical People, Places...
Recent years have seen an important increase of digitization projects in the cultural heritage domai...