AbstractThe online encyclopedia Wikipedia is a vast, constantly evolving tapestry of interlinked articles. For developers and researchers it represents a giant multilingual database of concepts and semantic relations, a potential resource for natural language processing and many other research areas. This paper introduces the Wikipedia Miner toolkit, an open-source software system that allows researchers and developers to integrate Wikipediaʼs rich semantics into their own applications. The toolkit creates databases that contain summarized versions of Wikipediaʼs content and structure, and includes a Java API to provide access to them. Wikipediaʼs articles, categories and redirects are represented as classes, and can be efficiently searched...
We introduce WikiDoMiner -- a tool for automatically generating domain-specific corpora by crawling...
{zesch,gurevych,max} (at) tk.informatik.tu-darmstadt.de Abstract. We analyze Wikipedia as a lexical ...
Wikipedia is not only a large encyclopedia, but lately also a source of linguistic data for various ...
The online encyclopedia Wikipedia is a vast, constantly evolving tapestry of interlinked articles. F...
AbstractThe online encyclopedia Wikipedia is a vast, constantly evolving tapestry of interlinked art...
Wikipedia is a goldmine of information; not just for its many readers, but also for the growing comm...
Wikipedia is a goldmine of information; not just for its many readers, but also for the growing comm...
Wikipedia is a goldmine of information; not just for its many readers, but also for the growing comm...
We introduce WikiDoMiner - a tool for automatically generating domain-specific corpora by crawling W...
Wikipedia is a goldmine of information; not just for its many readers, but also for the growing comm...
Wikipedia is a goldmine of information; not just for its many readers, but also for the growing comm...
Wikipedia is a goldmine of information; not just for its many readers, but also for the growing comm...
Wikipedia is a goldmine of information; not just for its many readers, but also for the growing comm...
Wikipedia is a goldmine of information; not just for its many readers, but also for the growing comm...
We introduce WikiDoMiner -- a tool for automatically generating domain-specific corpora by crawling...
We introduce WikiDoMiner -- a tool for automatically generating domain-specific corpora by crawling...
{zesch,gurevych,max} (at) tk.informatik.tu-darmstadt.de Abstract. We analyze Wikipedia as a lexical ...
Wikipedia is not only a large encyclopedia, but lately also a source of linguistic data for various ...
The online encyclopedia Wikipedia is a vast, constantly evolving tapestry of interlinked articles. F...
AbstractThe online encyclopedia Wikipedia is a vast, constantly evolving tapestry of interlinked art...
Wikipedia is a goldmine of information; not just for its many readers, but also for the growing comm...
Wikipedia is a goldmine of information; not just for its many readers, but also for the growing comm...
Wikipedia is a goldmine of information; not just for its many readers, but also for the growing comm...
We introduce WikiDoMiner - a tool for automatically generating domain-specific corpora by crawling W...
Wikipedia is a goldmine of information; not just for its many readers, but also for the growing comm...
Wikipedia is a goldmine of information; not just for its many readers, but also for the growing comm...
Wikipedia is a goldmine of information; not just for its many readers, but also for the growing comm...
Wikipedia is a goldmine of information; not just for its many readers, but also for the growing comm...
Wikipedia is a goldmine of information; not just for its many readers, but also for the growing comm...
We introduce WikiDoMiner -- a tool for automatically generating domain-specific corpora by crawling...
We introduce WikiDoMiner -- a tool for automatically generating domain-specific corpora by crawling...
{zesch,gurevych,max} (at) tk.informatik.tu-darmstadt.de Abstract. We analyze Wikipedia as a lexical ...
Wikipedia is not only a large encyclopedia, but lately also a source of linguistic data for various ...