Although neural information retrieval has witnessed great improvements, recent works showed that the generalization ability of dense retrieval models on target domains with different distributions is limited, which contrasts with the results obtained with interaction-based models. To address this issue, researchers have resorted to adversarial learning and query generation approaches; both approaches nevertheless resulted in limited improvements. In this paper, we propose to use a self-supervision approach in which pseudo-relevance labels are automatically generated on the target domain. To do so, we first use the standard BM25 model on the target domain to obtain a first ranking of documents, and then use the interaction-based model T53B t...
Most of the existing open-source search engines, utilize keyword or tf-idf based techniques to find ...
A lack of reliable relevance labels for training ranking functions is a significant problem for many...
This paper studies multi-task training of retrieval-augmented generation models for knowledge-intens...
Dense retrieval approaches can overcome the lexical gap and lead to significantly improved search re...
Pseudo Relevance Feedback (PRF) is known to improve the effectiveness of bag-of-words retrievers. At...
Deep learning for Information Retrieval (IR) requires a large amount of high-quality query-document ...
Pseudo-relevance feedback mechanisms, from Rocchio to the relevance models, have shown the usefulnes...
One challenge with neural ranking is the need for a large amount of manually-labeled relevance judgm...
Many information retrieval tasks require large labeled datasets for fine-tuning. However, such datas...
Recent developments of machine learning models, and in particular deep neural networks, have yielded...
Recent developments in neural information retrieval models have been promising, but a problem remain...
Pseudo-relevance feedback (PRF) is a classical technique to improve search engine retrieval effectiv...
The limited availability of ground truth relevance labels has been a major impediment to the applica...
Pseudo-relevance feedback mechanisms, from Rocchio to the relevance models, have shown the usefulnes...
This paper describes a method of domain adaptive training for semantic segmentation using multiple s...
Most of the existing open-source search engines, utilize keyword or tf-idf based techniques to find ...
A lack of reliable relevance labels for training ranking functions is a significant problem for many...
This paper studies multi-task training of retrieval-augmented generation models for knowledge-intens...
Dense retrieval approaches can overcome the lexical gap and lead to significantly improved search re...
Pseudo Relevance Feedback (PRF) is known to improve the effectiveness of bag-of-words retrievers. At...
Deep learning for Information Retrieval (IR) requires a large amount of high-quality query-document ...
Pseudo-relevance feedback mechanisms, from Rocchio to the relevance models, have shown the usefulnes...
One challenge with neural ranking is the need for a large amount of manually-labeled relevance judgm...
Many information retrieval tasks require large labeled datasets for fine-tuning. However, such datas...
Recent developments of machine learning models, and in particular deep neural networks, have yielded...
Recent developments in neural information retrieval models have been promising, but a problem remain...
Pseudo-relevance feedback (PRF) is a classical technique to improve search engine retrieval effectiv...
The limited availability of ground truth relevance labels has been a major impediment to the applica...
Pseudo-relevance feedback mechanisms, from Rocchio to the relevance models, have shown the usefulnes...
This paper describes a method of domain adaptive training for semantic segmentation using multiple s...
Most of the existing open-source search engines, utilize keyword or tf-idf based techniques to find ...
A lack of reliable relevance labels for training ranking functions is a significant problem for many...
This paper studies multi-task training of retrieval-augmented generation models for knowledge-intens...