This record contains the datasets and models used and produced for the work reported in the paper "Combining Visual and Textual Features for Semantic Segmentation of Historical Newspapers" (link). Please cite this paper if you are using the models/datasets or find it relevant to your research: @article{barman_combining_2020, title = {{Combining Visual and Textual Features for Semantic Segmentation of Historical Newspapers}}, author = {Raphaël Barman and Maud Ehrmann and Simon Clematide and Sofia Ares Oliveira and Frédéric Kaplan}, journal= {Journal of Data Mining \& Digital Humanities}, volume= {HistoInformatics} DOI = {10.5281/zenodo.4065271}, year = {2021}, url = {https://jdmdh.episciences.org/7097}, } Plea...
Historical newspapers are mirrors of past societies, keeping track of the small and great history an...
A collection of Swedish diachronic word embedding models trained on historical newspaper data Simon...
Embeddins built on 19th century Finnish and Swedish newspapers from Finalnd. Used in the following ...
The massive amounts of digitized historical documents acquired over the last decades naturally lend ...
Mass digitization and the opening of digital libraries gave access to a huge amount of historical ne...
Digitisation projects preserve and make available vast quantities of historical text. Among these, n...
Large quantities of historical newspapers are being digitized and OCRd. We describe a framework for ...
These are the slides from the 2021 Workshop ‘Historical Newspaper Content Mining: findings from the ...
In 2022, it is a common place that digital historical newspapers (DHN) have become increasingly avai...
Paper on mapping texts and combining text-mining and geo-visualization to unlock the research potent...
In recent decades, major efforts to digitize historical documents led to the creation of large machi...
impresso. Media Monitoring of the Past is an interdisciplinary research project in which a team of c...
In this age of Big Data this paper describes how digital libraries can apply at large scale innovati...
International audienceNewspapers are documents made of news item and informative articles. They are ...
We introduce the development of the NewsEye resource, a multilingual dataset for named entity recogn...
Historical newspapers are mirrors of past societies, keeping track of the small and great history an...
A collection of Swedish diachronic word embedding models trained on historical newspaper data Simon...
Embeddins built on 19th century Finnish and Swedish newspapers from Finalnd. Used in the following ...
The massive amounts of digitized historical documents acquired over the last decades naturally lend ...
Mass digitization and the opening of digital libraries gave access to a huge amount of historical ne...
Digitisation projects preserve and make available vast quantities of historical text. Among these, n...
Large quantities of historical newspapers are being digitized and OCRd. We describe a framework for ...
These are the slides from the 2021 Workshop ‘Historical Newspaper Content Mining: findings from the ...
In 2022, it is a common place that digital historical newspapers (DHN) have become increasingly avai...
Paper on mapping texts and combining text-mining and geo-visualization to unlock the research potent...
In recent decades, major efforts to digitize historical documents led to the creation of large machi...
impresso. Media Monitoring of the Past is an interdisciplinary research project in which a team of c...
In this age of Big Data this paper describes how digital libraries can apply at large scale innovati...
International audienceNewspapers are documents made of news item and informative articles. They are ...
We introduce the development of the NewsEye resource, a multilingual dataset for named entity recogn...
Historical newspapers are mirrors of past societies, keeping track of the small and great history an...
A collection of Swedish diachronic word embedding models trained on historical newspaper data Simon...
Embeddins built on 19th century Finnish and Swedish newspapers from Finalnd. Used in the following ...