in this paper we present a novel clustering technique for compound words. By mapping compounds onto their semantic heads, the technique is able to estimate n-gram probabilities for unseen compounds. We argue that compounds are well represented by their heads which allows the clustering of rare words and reduces the risk of over-generalization. The semantic heads arc obtained by a two-step process which consists of constituent generation and best head selection based on corpus statistics. Experiments on Dutch read speech show that our technique is capable of correctly identifying compounds and their semantic heads with a precision of 80.25% and a recall of 85.97%. A class-based language model with compound-head clusters achieves a significan...
This article describes the first attempt to semantically analyse Dutch noun-noun compounds using the...
In order to achieve the long-range goal of semantic interpretation of noun compounds, it is often ...
International audienceWe applied different clustering algorithms to the task of clus- tering multi-w...
in this paper we present a novel clustering technique for compound words. By mapping compounds onto ...
© 2015 IEEE. Compounding is one of the most productive word formation processes in many languages an...
Compounding is one of the most productive word formation processes in many languages and is therefor...
Abstract. We present an approach for knowledge-free and unsuper-vised recognition of compound nouns ...
This research addresses the problem of clustering the results of brainstorm sessions. Going through ...
Pelemans J., Van hamme H., Wambacq P., ''Translation-based word clustering for language models'', Bo...
In this article we investigate statistical machine translation (SMT) into Germanic languages, with a...
In this article we investigate statistical machine translation (SMT) into Germanic languages, with a...
Research on the discovery of terms from corpora has focused on word sequences whose recurrent occurr...
Compounds pose a problem for applications that rely on precise word alignments such as bilingual ter...
Coreference resolution is the task of determining whether two noun phrases refer to the same entity ...
In this paper, we introduce a new similarity measure between words, and a graph-based word clusterin...
This article describes the first attempt to semantically analyse Dutch noun-noun compounds using the...
In order to achieve the long-range goal of semantic interpretation of noun compounds, it is often ...
International audienceWe applied different clustering algorithms to the task of clus- tering multi-w...
in this paper we present a novel clustering technique for compound words. By mapping compounds onto ...
© 2015 IEEE. Compounding is one of the most productive word formation processes in many languages an...
Compounding is one of the most productive word formation processes in many languages and is therefor...
Abstract. We present an approach for knowledge-free and unsuper-vised recognition of compound nouns ...
This research addresses the problem of clustering the results of brainstorm sessions. Going through ...
Pelemans J., Van hamme H., Wambacq P., ''Translation-based word clustering for language models'', Bo...
In this article we investigate statistical machine translation (SMT) into Germanic languages, with a...
In this article we investigate statistical machine translation (SMT) into Germanic languages, with a...
Research on the discovery of terms from corpora has focused on word sequences whose recurrent occurr...
Compounds pose a problem for applications that rely on precise word alignments such as bilingual ter...
Coreference resolution is the task of determining whether two noun phrases refer to the same entity ...
In this paper, we introduce a new similarity measure between words, and a graph-based word clusterin...
This article describes the first attempt to semantically analyse Dutch noun-noun compounds using the...
In order to achieve the long-range goal of semantic interpretation of noun compounds, it is often ...
International audienceWe applied different clustering algorithms to the task of clus- tering multi-w...