Many articles in the online encyclopedia Wikipedia have hyperlinks to ambigu-ous article titles. To improve the reader experience, any link to an ambiguous title should be replaced with a link to one of the unambiguous meanings. We propose a novel statistical topic model, which we refer to as the Link Text Topic Model (lttm), that can suggest new link targets for existing ambiguous links in Wikipedia articles. For evaluation, we develop a method for extracting ground truth from snapshots of Wikipedia at different points in time. We evaluate lttm on this ground truth, and demonstrate its superiority over existing link- and content-based approaches. Finally, we build a web service that uses lttm to suggest unambiguous articles for human edito...
One of the valuable features of any collaboratively constructed semantic resource (CSR) is its abili...
We investigate the task of finding links from Wikipedia pages to external web pages. Such external l...
Wikipedia is a goldmine of information. Each article describes a single concept, and together they c...
Many articles in the online encyclopedia Wikipedia have hyperlinks to ambiguous article titles. To i...
Wikipedia contains an enormous amount of human knowledge. The wide range of covered topics is hierar...
Wikipedia contains an enormous amount of human knowledge. The wide range of covered topics is hierar...
Automatically linking Wikipedia pages is done mostly by two strategies: (i) a content based strategy...
International audienceWikipedia, the largest open-collaborative online encyclopedia, is a corpus of ...
Automatically linking Wikipedia pages can be done either content based by exploiting word similariti...
International audienceNetworks of documents connected by hyperlinks, such as Wikipedia, are ubiquito...
International audienceMany Wikipedia articles that cover the same topic in different language editio...
AbstractIntroductionThe ambiguity of biomedical abbreviations is one of the challenges in biomedical...
This paper contains a description of experiments for the 2008 INEX XML-mining track. Our goal for th...
Wikipedia article names can be utilized as a controlled vocabulary for identifying the main topics i...
Wikification, commonly referred to as Disam-biguation to Wikipedia (D2W), is the task of identifying...
One of the valuable features of any collaboratively constructed semantic resource (CSR) is its abili...
We investigate the task of finding links from Wikipedia pages to external web pages. Such external l...
Wikipedia is a goldmine of information. Each article describes a single concept, and together they c...
Many articles in the online encyclopedia Wikipedia have hyperlinks to ambiguous article titles. To i...
Wikipedia contains an enormous amount of human knowledge. The wide range of covered topics is hierar...
Wikipedia contains an enormous amount of human knowledge. The wide range of covered topics is hierar...
Automatically linking Wikipedia pages is done mostly by two strategies: (i) a content based strategy...
International audienceWikipedia, the largest open-collaborative online encyclopedia, is a corpus of ...
Automatically linking Wikipedia pages can be done either content based by exploiting word similariti...
International audienceNetworks of documents connected by hyperlinks, such as Wikipedia, are ubiquito...
International audienceMany Wikipedia articles that cover the same topic in different language editio...
AbstractIntroductionThe ambiguity of biomedical abbreviations is one of the challenges in biomedical...
This paper contains a description of experiments for the 2008 INEX XML-mining track. Our goal for th...
Wikipedia article names can be utilized as a controlled vocabulary for identifying the main topics i...
Wikification, commonly referred to as Disam-biguation to Wikipedia (D2W), is the task of identifying...
One of the valuable features of any collaboratively constructed semantic resource (CSR) is its abili...
We investigate the task of finding links from Wikipedia pages to external web pages. Such external l...
Wikipedia is a goldmine of information. Each article describes a single concept, and together they c...