This paper presents work aimed at the realization of a gold standard for cross-document coreference resolution of person entities in a corpus of Italian news. The gold standard has been created selecting a number of person names occurring in Adige-500K, a corpus composed of all the news stories published by the local newspaper `L`Adige` from 1999 to 2006. The corpus consists of 535,000 news stories, for a total of around 200 million tokens.To sample the person names in the corpus, we identified two dimensions, corresponding to two phenomena we intended to study, namely (i) the fame of the person entities and (ii) the ambiguity of person names. The first version of the gold standard is composed of 209 person names corresponding to 709 entiti...
This paper describes the ICoN corpus, a corpus of academic written Italian, some of the directions o...
none6noopenAntici, Francesco; Bolognini, Luca; Inajetovic, Matteo Antonio; Ivasiuk, Bogdan; Galassi,...
This paper describes a newly created text corpus of news articles that has been annotated for cross-...
This paper describes the News People Search (NePS) Task organized as part of EVALITA 2011. The NePS ...
This paper describes the News People Search (NePS) Task orga-nized as part of EVALITA 2011. The NePS...
This paper presents a scheme for annotating coreference across news articles, extending beyond tradi...
English. This paper presents a system-atic evaluation of linguistic components required to build a c...
In this paper we present work in progress for the creation of the Italian Content Annotation Bank (I...
Nowadays, surfing the Web and looking for persons seems to be one of the most common activities of I...
In this paper we present KIND, an Italian dataset for Named-entity recognition. It contains more tha...
This paper reports on the development and evaluation of an Italian broadcast news corpus at ITC-irst...
none5Interfaccia Web al corpus la Repubblica/SSLMIT, al momento il piu' grande corpus di italiano sc...
Currently, big corpora, coming from Open Government Data projects, in several research areas, allow ...
In this paper we address the problem of first name and last name identification in a news collectio...
A corpus of the Italian local press. This paper introduces CoSIL, a corpus of articles from Italian ...
This paper describes the ICoN corpus, a corpus of academic written Italian, some of the directions o...
none6noopenAntici, Francesco; Bolognini, Luca; Inajetovic, Matteo Antonio; Ivasiuk, Bogdan; Galassi,...
This paper describes a newly created text corpus of news articles that has been annotated for cross-...
This paper describes the News People Search (NePS) Task organized as part of EVALITA 2011. The NePS ...
This paper describes the News People Search (NePS) Task orga-nized as part of EVALITA 2011. The NePS...
This paper presents a scheme for annotating coreference across news articles, extending beyond tradi...
English. This paper presents a system-atic evaluation of linguistic components required to build a c...
In this paper we present work in progress for the creation of the Italian Content Annotation Bank (I...
Nowadays, surfing the Web and looking for persons seems to be one of the most common activities of I...
In this paper we present KIND, an Italian dataset for Named-entity recognition. It contains more tha...
This paper reports on the development and evaluation of an Italian broadcast news corpus at ITC-irst...
none5Interfaccia Web al corpus la Repubblica/SSLMIT, al momento il piu' grande corpus di italiano sc...
Currently, big corpora, coming from Open Government Data projects, in several research areas, allow ...
In this paper we address the problem of first name and last name identification in a news collectio...
A corpus of the Italian local press. This paper introduces CoSIL, a corpus of articles from Italian ...
This paper describes the ICoN corpus, a corpus of academic written Italian, some of the directions o...
none6noopenAntici, Francesco; Bolognini, Luca; Inajetovic, Matteo Antonio; Ivasiuk, Bogdan; Galassi,...
This paper describes a newly created text corpus of news articles that has been annotated for cross-...