We report a number of computational experiments in supervised learning whose goal is to automatically classify a set of verbs into lexical semantic classes, based on frequency distribution approximations of grammatical features extracted from a very large annotated corpus. Distributions of five syntactic features that approximate transitivity alternations and thematic role assignments are sufficient to reduce error rate by 56% over chance. We conclude that corpus data is a usable repository of verb class information, and that corpusdriven extraction of grammatical features is a promising methodology for automatic lexical acquisition. 1 Introduction Recent years have witnessed a shift in grammar development methodology, from...
Ambiguity resolution in the parsing of natural language requires a vast repository of knowledge to g...
We cluster verbs into lexical semantic classes, using a general set of noisy features that cap-ture ...
Adult knowledge of a language involves correctly balancing lexically-based and more language-general...
We apply machine learning techniques to classify automatically a set of verbs into lexical semanti...
We give a report on a detailed study of automatic lexical disambiguation of 30 sample English verbs....
The paper presents a large-scale computational subcategorisation lexicon for several thousand German...
This paper demonstrates how unsupervised techniques can be used to learn models of deep linguistic s...
Lexical semantic classes of verbs play an important role in structuring complex predicate informatio...
grantor: University of TorontoLexical semantic classes incorporate both syntactic and sema...
Database ' The coding is based on the Bank of English corpus at COBUILD and uses a simple notat...
Abstract We present an unsupervised method for inducing verb classes from verb uses in gigaword corp...
Advances in representation learning have enabled natural language processing models to derive non-ne...
In this paper, I discuss some methodological problems arising from the use of corpus data for semant...
We use children's noun learning as a probe into the nature of their syntactic prediction mechanism a...
Recognising the grammatical categories of words is a necessary skill for the acquisition of syntax a...
Ambiguity resolution in the parsing of natural language requires a vast repository of knowledge to g...
We cluster verbs into lexical semantic classes, using a general set of noisy features that cap-ture ...
Adult knowledge of a language involves correctly balancing lexically-based and more language-general...
We apply machine learning techniques to classify automatically a set of verbs into lexical semanti...
We give a report on a detailed study of automatic lexical disambiguation of 30 sample English verbs....
The paper presents a large-scale computational subcategorisation lexicon for several thousand German...
This paper demonstrates how unsupervised techniques can be used to learn models of deep linguistic s...
Lexical semantic classes of verbs play an important role in structuring complex predicate informatio...
grantor: University of TorontoLexical semantic classes incorporate both syntactic and sema...
Database ' The coding is based on the Bank of English corpus at COBUILD and uses a simple notat...
Abstract We present an unsupervised method for inducing verb classes from verb uses in gigaword corp...
Advances in representation learning have enabled natural language processing models to derive non-ne...
In this paper, I discuss some methodological problems arising from the use of corpus data for semant...
We use children's noun learning as a probe into the nature of their syntactic prediction mechanism a...
Recognising the grammatical categories of words is a necessary skill for the acquisition of syntax a...
Ambiguity resolution in the parsing of natural language requires a vast repository of knowledge to g...
We cluster verbs into lexical semantic classes, using a general set of noisy features that cap-ture ...
Adult knowledge of a language involves correctly balancing lexically-based and more language-general...