Augmenting language models with a retrieval mechanism has been shown to significantly improve their performance while keeping the number of parameters low. Retrieval-augmented models commonly rely on a semantic retrieval mechanism based on the similarity between dense representations of the query chunk and potential neighbors. In this paper, we study the state-of-the-art Retro model and observe that its performance gain is better explained by surface-level similarities, such as token overlap. Inspired by this, we replace the semantic retrieval in Retro with a surface-level method based on BM25, obtaining a significant reduction in perplexity. As full BM25 retrieval can be computationally costly for large datasets, we also apply it in a re-r...
Large Language Models (LLMs) make natural interfaces to factual knowledge, but their usefulness is l...
Dense retrieval, which describes the use of contextualised language models such as BERT to identify ...
Text retrieval is a long-standing research topic on information seeking, where a system is required ...
In this work, we propose a simple method that applies a large language model (LLM) to large-scale re...
Retrieval augmentation enables large language models to take advantage of external knowledge, for ex...
The recent decade has witnessed an explosive growth of online information with the birth of Web. Sea...
Recent work has shown that small distilled language models are strong competitors to models that are...
We propose a novel method of query expansion for Language Modeling (LM) in Information Retrieval (IR...
Document retrieval systems recover documents from a database and order them according to their perce...
Despite the fact that modern deep neural networks have the ability to memorize (almost) the entire t...
Retrieval-augmented language models (RALMs) represent a substantial advancement in the capabilities ...
Web search services process thousands of queries per second, and filter their answers from collectio...
The aims of this paper are twofold. Our first aim is to compare results of the earlier Terabyte trac...
Dual-encoder-based dense retrieval models have become the standard in IR. They employ large Transfor...
We systematically investigate a new approach to estimating the parameters of language models for inf...
Large Language Models (LLMs) make natural interfaces to factual knowledge, but their usefulness is l...
Dense retrieval, which describes the use of contextualised language models such as BERT to identify ...
Text retrieval is a long-standing research topic on information seeking, where a system is required ...
In this work, we propose a simple method that applies a large language model (LLM) to large-scale re...
Retrieval augmentation enables large language models to take advantage of external knowledge, for ex...
The recent decade has witnessed an explosive growth of online information with the birth of Web. Sea...
Recent work has shown that small distilled language models are strong competitors to models that are...
We propose a novel method of query expansion for Language Modeling (LM) in Information Retrieval (IR...
Document retrieval systems recover documents from a database and order them according to their perce...
Despite the fact that modern deep neural networks have the ability to memorize (almost) the entire t...
Retrieval-augmented language models (RALMs) represent a substantial advancement in the capabilities ...
Web search services process thousands of queries per second, and filter their answers from collectio...
The aims of this paper are twofold. Our first aim is to compare results of the earlier Terabyte trac...
Dual-encoder-based dense retrieval models have become the standard in IR. They employ large Transfor...
We systematically investigate a new approach to estimating the parameters of language models for inf...
Large Language Models (LLMs) make natural interfaces to factual knowledge, but their usefulness is l...
Dense retrieval, which describes the use of contextualised language models such as BERT to identify ...
Text retrieval is a long-standing research topic on information seeking, where a system is required ...