15 pagesThis paper focuses on the design and the development of a text processing architecture exploiting specialized NLP tools, to produce linguistically annotated documents. This architecture is instanciated using existing NLP modules and resources which need to be tuned to specific domains. Taking as an example the biological domain, we show how a syntactic analyser can be adapted to this domain. We focus on parsing since it exhibits various kinds of adaptation, ranging from unknown words analysis to specific vocabulary (terms, named entities) and structure identification
Abstract. Due to the inherent difficulties associated with manual ontology building, knowledge acqui...
The paper describes the ALVIS annotation format designed for the indexing of large collections of do...
In the information society large amounts of information are being generated and transmitted constant...
International audienceWeb semantic access in specific domains calls for specialized search engines w...
This paper proposes a simple mechanism for supporting multiple overlapping layers of annotations for...
This chapter describes perspectives for utilizing natural language processing (NLP) to analyze artif...
12 pagesInternational audienceWeb semantic access in specific domains calls for specialised search e...
Annotated corpora are sets of structured text used to enable Natural Language Pro-cessing (NLP) task...
Abstract. The use of simple and easy to understand language is an essential requirement for web docu...
In this paper we present a user friendly approach to annotate websites with machine-processable info...
Research in Natural Language Processing (NLP) has in recent years benefited from the enormous amount...
In this paper, we describe Langforia, a multilingual processing pipeline to annotate texts with mult...
Due to the inherent difficulties associated with manual ontology building, knowledge acquisition app...
A large portion of the useful information on the web is in the form of unstructured natural language...
Many experiments have shown that traditional approaches to both Natural Language Processing (NLP) an...
Abstract. Due to the inherent difficulties associated with manual ontology building, knowledge acqui...
The paper describes the ALVIS annotation format designed for the indexing of large collections of do...
In the information society large amounts of information are being generated and transmitted constant...
International audienceWeb semantic access in specific domains calls for specialized search engines w...
This paper proposes a simple mechanism for supporting multiple overlapping layers of annotations for...
This chapter describes perspectives for utilizing natural language processing (NLP) to analyze artif...
12 pagesInternational audienceWeb semantic access in specific domains calls for specialised search e...
Annotated corpora are sets of structured text used to enable Natural Language Pro-cessing (NLP) task...
Abstract. The use of simple and easy to understand language is an essential requirement for web docu...
In this paper we present a user friendly approach to annotate websites with machine-processable info...
Research in Natural Language Processing (NLP) has in recent years benefited from the enormous amount...
In this paper, we describe Langforia, a multilingual processing pipeline to annotate texts with mult...
Due to the inherent difficulties associated with manual ontology building, knowledge acquisition app...
A large portion of the useful information on the web is in the form of unstructured natural language...
Many experiments have shown that traditional approaches to both Natural Language Processing (NLP) an...
Abstract. Due to the inherent difficulties associated with manual ontology building, knowledge acqui...
The paper describes the ALVIS annotation format designed for the indexing of large collections of do...
In the information society large amounts of information are being generated and transmitted constant...