We address large-scale multilingual multi-word entity (MWEntity) recognition and variant matching. Firstly, we recognise MWEntities in 22 different languages, iden- tify monolingual variant spellings and link equivalent groups of variants across all languages. We then use the previously recognised MWEntities to learn new recog- nition rules based on distributional patterns. Not requiring any linguistic tools, the method is suitable for our highly multilingual environment. When adding the new rules to the original rule-based NER system, F1 performance for Spanish increases from 42.4% to 50% (18% increase) and for English from 43.4% to 44.5% (2.5% in- crease). Besides aiming at turning free text into semi-structured data for search and for ma...
The increasing diversity of languages used on the web introduces a new level of complexity to Inform...
Named Entity Recognition and Classification (NERC) is an important component of applications like Opi...
Recent advances in generating monolingual word embeddings based on word co-occurrence for universal ...
In this chapter, we present our contribution in addressing multi-word entity (MWEntity) recognition ...
This paper reports on an approach and experiments to automatically build a cross-lingual multi-word ...
This paper presents HITS ’ system for mono-lingual and cross-lingual entity linking at TAC 2012. We ...
This paper explores a very basic linguis-tic phenomenon in multilingualism: the lexicalizations of e...
The Entity Linking (EL) task is concerned with linking entity mentions in a text collection with the...
A major challenge in Entity Linking (EL) is making effective use of contextual information to disamb...
We are presenting a mature text analysis application that relies heavily on multilingual Named Entit...
In this paper, we study direct transfer methods for multilingual named entity recognition. Specifica...
Abstract. We present a named-entity recognition (NER) system for parallel multilingual text. Our sys...
Building named entity recognition (NER) models for languages that do not have much training data is ...
To stimulate research in cross-language entity linking, we present a new test collection for evaluat...
International audienceThis paper presents a multilingual system designed to recognize named entities...
The increasing diversity of languages used on the web introduces a new level of complexity to Inform...
Named Entity Recognition and Classification (NERC) is an important component of applications like Opi...
Recent advances in generating monolingual word embeddings based on word co-occurrence for universal ...
In this chapter, we present our contribution in addressing multi-word entity (MWEntity) recognition ...
This paper reports on an approach and experiments to automatically build a cross-lingual multi-word ...
This paper presents HITS ’ system for mono-lingual and cross-lingual entity linking at TAC 2012. We ...
This paper explores a very basic linguis-tic phenomenon in multilingualism: the lexicalizations of e...
The Entity Linking (EL) task is concerned with linking entity mentions in a text collection with the...
A major challenge in Entity Linking (EL) is making effective use of contextual information to disamb...
We are presenting a mature text analysis application that relies heavily on multilingual Named Entit...
In this paper, we study direct transfer methods for multilingual named entity recognition. Specifica...
Abstract. We present a named-entity recognition (NER) system for parallel multilingual text. Our sys...
Building named entity recognition (NER) models for languages that do not have much training data is ...
To stimulate research in cross-language entity linking, we present a new test collection for evaluat...
International audienceThis paper presents a multilingual system designed to recognize named entities...
The increasing diversity of languages used on the web introduces a new level of complexity to Inform...
Named Entity Recognition and Classification (NERC) is an important component of applications like Opi...
Recent advances in generating monolingual word embeddings based on word co-occurrence for universal ...