Corpora, which are text collections selected for specific purposes, are playing an increasing role in Linguistics and Natural Language Processing (NLP). They are conceived as knowledge sources on natural language use, as much as knowledge on the entities designated by linguistic expressions, and they are used in particular to evaluate NLP application performances. The criteria prevailing on their constitution have an obvious, though still delicate to characterize, impact on (i) the major linguistic structures they contain, (ii) the knowledge conveyed, and, (iii) computational systems' success on a give task. This thesis studies methodologies of automatic extraction of semantic relations on written text corpora. Such a topic calls for a deta...
National audienceWhich type of mathematical tools must be used to represent the meaning of a linguis...
The major part of the information available on the web is provided in textual form, i.e. in unstruct...
International audienceThe diversity of applications met today under the term "language industry" cov...
Extracting information from linguistic data has gain more and more attention in the last decades inr...
Why is it so difficult to automatically understand a language even when what is targeted is only a l...
This thesis focuses on the formalisms that make it possible to mathematically represent not only the...
By using theorical and practical advances in computer sciences and linguistics, contemporary lexicog...
Computer systems are more and more important in everyday life, and errors into those systems can mak...
The past decade witnessed significant advances in the field of relation extraction from text, facili...
This study takes dialogue as a kind of discourse and focuses on its coherence. We consider coherence...
For a long time, polysemy used to be considered as a marginal or accidental phenomenon in language. ...
The proliferation of digital data has enabled scientific and practitioner communities to createnew d...
The omnipresence of polysemy in natural languages compels us to consider the comprehension of langua...
It is not possible for a science computing system to process a text when sequences, like words or se...
Translation techniques constitute an important subject in translation studies and in linguistics. Wh...
National audienceWhich type of mathematical tools must be used to represent the meaning of a linguis...
The major part of the information available on the web is provided in textual form, i.e. in unstruct...
International audienceThe diversity of applications met today under the term "language industry" cov...
Extracting information from linguistic data has gain more and more attention in the last decades inr...
Why is it so difficult to automatically understand a language even when what is targeted is only a l...
This thesis focuses on the formalisms that make it possible to mathematically represent not only the...
By using theorical and practical advances in computer sciences and linguistics, contemporary lexicog...
Computer systems are more and more important in everyday life, and errors into those systems can mak...
The past decade witnessed significant advances in the field of relation extraction from text, facili...
This study takes dialogue as a kind of discourse and focuses on its coherence. We consider coherence...
For a long time, polysemy used to be considered as a marginal or accidental phenomenon in language. ...
The proliferation of digital data has enabled scientific and practitioner communities to createnew d...
The omnipresence of polysemy in natural languages compels us to consider the comprehension of langua...
It is not possible for a science computing system to process a text when sequences, like words or se...
Translation techniques constitute an important subject in translation studies and in linguistics. Wh...
National audienceWhich type of mathematical tools must be used to represent the meaning of a linguis...
The major part of the information available on the web is provided in textual form, i.e. in unstruct...
International audienceThe diversity of applications met today under the term "language industry" cov...