The LE-2111 SPARKLE (Shallow Parsing and Knowledge extraction for Language Engineering) project is aimed at the automatic extraction of lexical and semantic information from textual corpora in order to improve the performances of NLP systems. In this paper we describe an algorithm for the extraction of subcategorization patterns for Italian verbs. The extraction procedure is carried out on the basis of an efficient and accurate analogy-based engine and pre- and post-filters based on simple linguistic constraints. Despite the simplicity of the analogy-based algorithm the amount of lost information is negligible, and precision and recall over a set of hand-crafted subcategorization patterns (namely those produced within the L...
The presented work investigates methods for semi-automatic extraction of lexico-syntactic informatio...
The work presented in this paper reviews in depth the computational methods and tecniques developed ...
The goal of this paper is to introduce T-PAS, a resource of typed predicate argument structures for ...
Lexica of predicate-argument structures constitute a useful tool for several tasks in NLP. This pape...
This paper reports on the experience of developing and applyinga shallow parsing scheme (chunking) t...
The aim of this paper is to introduce LexIt, a computational framework for the automatic acquisition...
In this paper, we reported experiments of unsupervised automatic acquisition of Italian and English ...
We describe a novel technique and implemented system for constructing a subcategorization dictionary...
Subcategorization is a kind of knowledge which can be considered as crucial in several NLP tasks, su...
We describe a state-of-the-art automatic system that can acquire subcategorisation frames from raw t...
The paper describes a system for extracting subcategorization frames of verbs not found in existing ...
This paper reports on work, carried out in the framework of the CombiNet project, focusing on the au...
The theoretical characterisation of multiword expressions (MWEs) is tightlyconnected to their actual...
This paper reports on work, carried out in the framework of the CombiNet project, focusing on the au...
In this paper, we outline the methodology we adopted to develop a FrameNet for Italian. The main ele...
The presented work investigates methods for semi-automatic extraction of lexico-syntactic informatio...
The work presented in this paper reviews in depth the computational methods and tecniques developed ...
The goal of this paper is to introduce T-PAS, a resource of typed predicate argument structures for ...
Lexica of predicate-argument structures constitute a useful tool for several tasks in NLP. This pape...
This paper reports on the experience of developing and applyinga shallow parsing scheme (chunking) t...
The aim of this paper is to introduce LexIt, a computational framework for the automatic acquisition...
In this paper, we reported experiments of unsupervised automatic acquisition of Italian and English ...
We describe a novel technique and implemented system for constructing a subcategorization dictionary...
Subcategorization is a kind of knowledge which can be considered as crucial in several NLP tasks, su...
We describe a state-of-the-art automatic system that can acquire subcategorisation frames from raw t...
The paper describes a system for extracting subcategorization frames of verbs not found in existing ...
This paper reports on work, carried out in the framework of the CombiNet project, focusing on the au...
The theoretical characterisation of multiword expressions (MWEs) is tightlyconnected to their actual...
This paper reports on work, carried out in the framework of the CombiNet project, focusing on the au...
In this paper, we outline the methodology we adopted to develop a FrameNet for Italian. The main ele...
The presented work investigates methods for semi-automatic extraction of lexico-syntactic informatio...
The work presented in this paper reviews in depth the computational methods and tecniques developed ...
The goal of this paper is to introduce T-PAS, a resource of typed predicate argument structures for ...