In order to address increasing demands of real-world applications, the research for knowledge-intensive NLP (KI-NLP) should advance by capturing the challenges of a truly open-domain environment: web-scale knowledge, lack of structure, inconsistent quality and noise. To this end, we propose a new setup for evaluating existing knowledge intensive tasks in which we generalize the background corpus to a universal web snapshot. We investigate a slate of NLP tasks which rely on knowledge - either factual or common sense, and ask systems to use a subset of CCNet - the Sphere corpus - as a knowledge source. In contrast to Wikipedia, otherwise a common background corpus in KI-NLP, Sphere is orders of magnitude larger and better reflects the full di...
There are many opportunities to improve the interactivity of information retrieval systems beyond th...
The hyperlink structure of Wikipedia constitutes a key resource for many Natural Language Processing...
This paper describes SW1, the first version of a semantically annotated snapshot of the EnglishWikip...
The Web has evolved into a huge mine of knowledge carved in different forms, the predominant one sti...
TutorialInternational audienceThe Web bears the potential of being the world's greatest encyclopedic...
International audienceThe Web bears the potential of being the world's greatest encyclopedic source,...
Thesauri are useful knowledge structures for assisting information retrieval. Yet their production i...
Research in Natural Language Processing (NLP) has in recent years benefited from the enormous amount...
Recent advents in the machine learning community, driven by larger datasets and novel algorithmic ap...
Information retrieval and data interpretation on the web, for the purpose of gaining knowledgeable i...
AAAI 2010 Workshop on Collaboratively-built Knowledge Sources and Artificial Intelligence (WikiAI 20...
Abstract. The 60-year-old dream of computational linguistics is to make computers capable of communi...
Natural Language Processing systems crucially depend on the availability of lexical and conceptual k...
Wikipedia provides a semantic network for computing semantic relatedness in a more structured fashio...
{zesch,gurevych,max} (at) tk.informatik.tu-darmstadt.de Abstract. We analyze Wikipedia as a lexical ...
There are many opportunities to improve the interactivity of information retrieval systems beyond th...
The hyperlink structure of Wikipedia constitutes a key resource for many Natural Language Processing...
This paper describes SW1, the first version of a semantically annotated snapshot of the EnglishWikip...
The Web has evolved into a huge mine of knowledge carved in different forms, the predominant one sti...
TutorialInternational audienceThe Web bears the potential of being the world's greatest encyclopedic...
International audienceThe Web bears the potential of being the world's greatest encyclopedic source,...
Thesauri are useful knowledge structures for assisting information retrieval. Yet their production i...
Research in Natural Language Processing (NLP) has in recent years benefited from the enormous amount...
Recent advents in the machine learning community, driven by larger datasets and novel algorithmic ap...
Information retrieval and data interpretation on the web, for the purpose of gaining knowledgeable i...
AAAI 2010 Workshop on Collaboratively-built Knowledge Sources and Artificial Intelligence (WikiAI 20...
Abstract. The 60-year-old dream of computational linguistics is to make computers capable of communi...
Natural Language Processing systems crucially depend on the availability of lexical and conceptual k...
Wikipedia provides a semantic network for computing semantic relatedness in a more structured fashio...
{zesch,gurevych,max} (at) tk.informatik.tu-darmstadt.de Abstract. We analyze Wikipedia as a lexical ...
There are many opportunities to improve the interactivity of information retrieval systems beyond th...
The hyperlink structure of Wikipedia constitutes a key resource for many Natural Language Processing...
This paper describes SW1, the first version of a semantically annotated snapshot of the EnglishWikip...