Discovering relevant knowledge out of unstructured text in not a trivial task. Search engines relying on full-text indexing of content reach their limits when confronted to poor quality, ambiguity, or multiple languages. Some of these shortcomings can be addressed by information extraction and related natural language processing techniques, but it still falls short of adequate knowledge representation. In this thesis, we defend a generic approach striving to be as language-independent, domain-independent, and content-independent as possible. To reach this goal, we offer to disambiguate terms with their corresponding identifiers in Linked Data knowledge bases, paving the way for full-scale semantic enrichment of textual content. The added va...
In recent years, the ever-increasing quantities of entities in large knowledge bases on the Web, suc...
Recent years have witnessed a surge in the amount of semantic information published on the Web. Inde...
Given a document collection, Document Retrieval is the task of returning the most relevant documents...
Discovering relevant knowledge out of unstructured text in not a trivial task. Search engines relyin...
This paper introduces MERCKX, a Multilingual Entity/Resource Combiner & Knowledge eXtractor. A case ...
Abstract. Valuable local information is often available on the web, but encoded in a foreign languag...
The administration of electronic publication in the Information Era congregates old and new problems...
The administration of electronic publication in the Information Era congregates old and new problems...
Abstract The growth of multilingual web content and increasing internationaliza-tion portends the ne...
International audienceThe Web bears the potential of being the world's greatest encyclopedic source,...
TutorialInternational audienceThe Web bears the potential of being the world's greatest encyclopedic...
My research gravitates around digital documents: knowledge extraction, knowledge exploitation, and s...
The World Wide Web (WWW) is a huge information network within which searching for relevant quality c...
The task of Relation Extraction (RE) is concerned with creating extractors that automatically find s...
Extracting knowledge from texts is highly contextual and depends on the domain and on the task. We s...
In recent years, the ever-increasing quantities of entities in large knowledge bases on the Web, suc...
Recent years have witnessed a surge in the amount of semantic information published on the Web. Inde...
Given a document collection, Document Retrieval is the task of returning the most relevant documents...
Discovering relevant knowledge out of unstructured text in not a trivial task. Search engines relyin...
This paper introduces MERCKX, a Multilingual Entity/Resource Combiner & Knowledge eXtractor. A case ...
Abstract. Valuable local information is often available on the web, but encoded in a foreign languag...
The administration of electronic publication in the Information Era congregates old and new problems...
The administration of electronic publication in the Information Era congregates old and new problems...
Abstract The growth of multilingual web content and increasing internationaliza-tion portends the ne...
International audienceThe Web bears the potential of being the world's greatest encyclopedic source,...
TutorialInternational audienceThe Web bears the potential of being the world's greatest encyclopedic...
My research gravitates around digital documents: knowledge extraction, knowledge exploitation, and s...
The World Wide Web (WWW) is a huge information network within which searching for relevant quality c...
The task of Relation Extraction (RE) is concerned with creating extractors that automatically find s...
Extracting knowledge from texts is highly contextual and depends on the domain and on the task. We s...
In recent years, the ever-increasing quantities of entities in large knowledge bases on the Web, suc...
Recent years have witnessed a surge in the amount of semantic information published on the Web. Inde...
Given a document collection, Document Retrieval is the task of returning the most relevant documents...