The paper presents a large-scale computational subcategorisation lexicon for several thousand German verbs. The lexical entries were obtained by unsupervised learning in a statistical grammar framework: a German context-free grammar containing frame-predicting grammar rules and information about lexical heads was trained on 18.7 million words of a large German newspaper corpus. We developed a simple methodology to utilise frequency distributions in the lexicalised version of the probabilistic grammar for inducing syntactic verb frame descriptions. The frame definition is variable with respect to the inclusion of prepositional phrase refinement. An evaluation against a manual dictionary justifies the utilisation of the machine-readable lexic...
International audienceTreeLex is a subcategorization lexicon of French verbs, automatically extracte...
This article describes a lexical substitution dataset for German. The whole dataset contains 2,040 s...
We present a phonological probabilistic context-free grammar, which describes the word and syllable...
We describe a state-of-the-art automatic system that can acquire subcategorisation frames from raw t...
We present a method of applying a broad-coverage LFG grammar of German in the process of semi-automa...
The paper describes an experiment in inside-outside estimation of a lexicalized probabilistic contex...
This paper presents LexSchem – the first large, fully automatically acquired subcategorization lexic...
International audienceThis paper presents LexSchem – the first large, fully automatically acquired s...
This paper describes the integration of corpus-based syntactic subcategorization frames and correlat...
Traditionally, deep, wide-coverage linguistic resources are hand-crafted and their creation is time-...
Manual development of deep linguistic resources is time-consuming and costly and therefore often des...
We present a method of applying a broad-coverage LFG grammar of German in the process of semi-automa...
International audienceThis paper introduces LexFr, a corpus-based French lexical resource built by a...
This paper describes the integration of corpus-based syntactic subcategorization frames and correlat...
In this paper, we reported experiments of unsupervised automatic acquisition of Italian and English ...
International audienceTreeLex is a subcategorization lexicon of French verbs, automatically extracte...
This article describes a lexical substitution dataset for German. The whole dataset contains 2,040 s...
We present a phonological probabilistic context-free grammar, which describes the word and syllable...
We describe a state-of-the-art automatic system that can acquire subcategorisation frames from raw t...
We present a method of applying a broad-coverage LFG grammar of German in the process of semi-automa...
The paper describes an experiment in inside-outside estimation of a lexicalized probabilistic contex...
This paper presents LexSchem – the first large, fully automatically acquired subcategorization lexic...
International audienceThis paper presents LexSchem – the first large, fully automatically acquired s...
This paper describes the integration of corpus-based syntactic subcategorization frames and correlat...
Traditionally, deep, wide-coverage linguistic resources are hand-crafted and their creation is time-...
Manual development of deep linguistic resources is time-consuming and costly and therefore often des...
We present a method of applying a broad-coverage LFG grammar of German in the process of semi-automa...
International audienceThis paper introduces LexFr, a corpus-based French lexical resource built by a...
This paper describes the integration of corpus-based syntactic subcategorization frames and correlat...
In this paper, we reported experiments of unsupervised automatic acquisition of Italian and English ...
International audienceTreeLex is a subcategorization lexicon of French verbs, automatically extracte...
This article describes a lexical substitution dataset for German. The whole dataset contains 2,040 s...
We present a phonological probabilistic context-free grammar, which describes the word and syllable...