As free online encyclopedias with massive volumes of content, Wikipedia and Wikidata are key to many Natural Language Processing (NLP) tasks, such as information retrieval, knowledge base building, machine translation, text classification, and text summarization. In this paper, we introduce WikiDes, a novel dataset to generate short descriptions of Wikipedia articles for the problem of text summarization. The dataset consists of over 80k English samples on 6987 topics. We set up a two-phase summarization method — description generation (Phase I) and candidate ranking (Phase II) — as a strong approach that relies on transfer and contrastive learning. For description generation, T5 and BART show their superiority compared to other small-scale...
Thesauri are useful knowledge structures for assisting information retrieval. Yet their production i...
Thesauri are useful knowledge structures for assisting information retrieval. Yet their production i...
Thesauri are useful knowledge structures for assisting information retrieval. Yet their production i...
As free online encyclopedias with massive volumes of content, Wikipedia and Wikidata are key to many...
As free online encyclopedias with massive volumes of content, Wikipedia and Wikidata are key to many...
Nowadays natural language generation (NLG) is used in everything from news reporting and chatbots to...
While Wikipedia exists in 287 languages, its content is unevenly distributed among them. It is there...
WikiWoods is an ongoing initiative to provide rich syntacto-semantic annotations for English Wikiped...
While Wikipedia exists in 287 languages, its content is unevenly distributed among them. In this wor...
When humans approach the task of text categorization, they interpret the specific wording of the doc...
We propose a language-independent graph-based method to build a-la-carte article collections on user...
The Web has evolved into a huge mine of knowledge carved in different forms, the predominant one sti...
Fast-developing fields such as Artificial Intelligence (AI) often outpace the efforts of encyclopedi...
We propose a language-independent graph-based method to build a-la-carte article collections on user...
Wikipedia is one of the richest knowledge sources on the Web today. In order to facilitate navigatin...
Thesauri are useful knowledge structures for assisting information retrieval. Yet their production i...
Thesauri are useful knowledge structures for assisting information retrieval. Yet their production i...
Thesauri are useful knowledge structures for assisting information retrieval. Yet their production i...
As free online encyclopedias with massive volumes of content, Wikipedia and Wikidata are key to many...
As free online encyclopedias with massive volumes of content, Wikipedia and Wikidata are key to many...
Nowadays natural language generation (NLG) is used in everything from news reporting and chatbots to...
While Wikipedia exists in 287 languages, its content is unevenly distributed among them. It is there...
WikiWoods is an ongoing initiative to provide rich syntacto-semantic annotations for English Wikiped...
While Wikipedia exists in 287 languages, its content is unevenly distributed among them. In this wor...
When humans approach the task of text categorization, they interpret the specific wording of the doc...
We propose a language-independent graph-based method to build a-la-carte article collections on user...
The Web has evolved into a huge mine of knowledge carved in different forms, the predominant one sti...
Fast-developing fields such as Artificial Intelligence (AI) often outpace the efforts of encyclopedi...
We propose a language-independent graph-based method to build a-la-carte article collections on user...
Wikipedia is one of the richest knowledge sources on the Web today. In order to facilitate navigatin...
Thesauri are useful knowledge structures for assisting information retrieval. Yet their production i...
Thesauri are useful knowledge structures for assisting information retrieval. Yet their production i...
Thesauri are useful knowledge structures for assisting information retrieval. Yet their production i...