We introduce WikiDoMiner -- a tool for automatically generating domain-specific corpora by crawling Wikipedia. WikiDoMiner helps requirements engineers acquire an external knowledge resource that is specific to the underlying domain of a given requirements specification (RS). Having the possibility to build a such a corpus is important since domain-specific datasets are scarce. WikiDoMiner generates the corpus by first extracting a set of domain-specific keywords from the RS, and then querying Wikipedia for these keywords. The output of WikiDoMiner is a set of Wikipedia articles that are relevant to the domain of the input RS. Mining Wikipedia for domain-specific knowledge can be beneficial for multiple requirements engineering essential ...
Wikipedia is not only a large encyclopedia, but lately also a source of linguistic data for various ...
There are many opportunities to improve the interactivity of information retrieval systems beyond th...
The Web bears the potential of being the world’s greatest encyclopedic source, but we are far from f...
We introduce WikiDoMiner -- a tool for automatically generating domain-specific corpora by crawling...
We introduce WikiDoMiner - a tool for automatically generating domain-specific corpora by crawling W...
peer reviewedWe introduce WikiDoMiner -- a tool for automatically generating domain-specific corpora...
peer reviewedWe introduce WikiDoMiner -- a tool for automatically generating domain-specific corpora...
The online encyclopedia Wikipedia is a vast, constantly evolving tapestry of interlinked articles. F...
AbstractThe online encyclopedia Wikipedia is a vast, constantly evolving tapestry of interlinked art...
Wikipedia is a goldmine of information; not just for its many readers, but also for the growing comm...
Wikipedia is a goldmine of information; not just for its many readers, but also for the growing comm...
We present a simple but effective method of automatically extracting domain-specific terms using Wik...
Domain terms are a useful resource for tuning both resources and NLP processors to domain specific t...
Domain-specific thesauri are high-cost, high-maintenance, high-value knowledge structures. We show h...
Domain-specific thesauri are high-cost, high-maintenance, high-value knowledge structures. We show h...
Wikipedia is not only a large encyclopedia, but lately also a source of linguistic data for various ...
There are many opportunities to improve the interactivity of information retrieval systems beyond th...
The Web bears the potential of being the world’s greatest encyclopedic source, but we are far from f...
We introduce WikiDoMiner -- a tool for automatically generating domain-specific corpora by crawling...
We introduce WikiDoMiner - a tool for automatically generating domain-specific corpora by crawling W...
peer reviewedWe introduce WikiDoMiner -- a tool for automatically generating domain-specific corpora...
peer reviewedWe introduce WikiDoMiner -- a tool for automatically generating domain-specific corpora...
The online encyclopedia Wikipedia is a vast, constantly evolving tapestry of interlinked articles. F...
AbstractThe online encyclopedia Wikipedia is a vast, constantly evolving tapestry of interlinked art...
Wikipedia is a goldmine of information; not just for its many readers, but also for the growing comm...
Wikipedia is a goldmine of information; not just for its many readers, but also for the growing comm...
We present a simple but effective method of automatically extracting domain-specific terms using Wik...
Domain terms are a useful resource for tuning both resources and NLP processors to domain specific t...
Domain-specific thesauri are high-cost, high-maintenance, high-value knowledge structures. We show h...
Domain-specific thesauri are high-cost, high-maintenance, high-value knowledge structures. We show h...
Wikipedia is not only a large encyclopedia, but lately also a source of linguistic data for various ...
There are many opportunities to improve the interactivity of information retrieval systems beyond th...
The Web bears the potential of being the world’s greatest encyclopedic source, but we are far from f...