Current large language models (LLMs) can exhibit near-human levels of performance on many natural language-based tasks, including open-domain question answering. Unfortunately, at this time, they also convincingly hallucinate incorrect answers, so that responses to questions must be verified against external sources before they can be accepted at face value. In the thesis, I report two simple experiments to automatically validate generated answers against a corpus. We base our experiments on questions and passages from the MS MARCO (V1) test collection, and a retrieval pipeline consisting of sparse retrieval, dense retrieval and neural rerankers. In the first experiment, we validate the generated answer in its entirety. After presenting a q...
When answering a question, people often draw upon their rich world knowledge in addition to the part...
Recent Language Models (LMs) have shown impressive capabilities in generating texts with the knowled...
International audienceQuestion answering (QA) aims at retrieving precise information from a large co...
Large language models (LLMs) have been shown to possess impressive capabilities, while also raising ...
Although remarkable progress has been achieved in preventing large language model (LLM) hallucinatio...
Large Language Models (LLMs) are increasingly used for accessing information on the web. Their truth...
Generative Large Language Models (LLMs) such as GPT-3 are capable of generating highly fluent respon...
We propose a benchmark to measure whether a language model is truthful in generating answers to ques...
Large Language Models (LLMs), such as ChatGPT/GPT-4, have garnered widespread attention owing to the...
Large language models (LLMs) have demonstrated impressive language understanding and generation capa...
International audienceQuestion answering (QA) aims at retrieving precise information from a large co...
Recently developed large language models have achieved remarkable success in generating fluent and c...
Semantic consistency of a language model is broadly defined as the model's ability to produce semant...
The interactive nature of Large Language Models (LLMs) theoretically allows models to refine and imp...
The fluency and creativity of large pre-trained language models (LLMs) have led to their widespread ...
When answering a question, people often draw upon their rich world knowledge in addition to the part...
Recent Language Models (LMs) have shown impressive capabilities in generating texts with the knowled...
International audienceQuestion answering (QA) aims at retrieving precise information from a large co...
Large language models (LLMs) have been shown to possess impressive capabilities, while also raising ...
Although remarkable progress has been achieved in preventing large language model (LLM) hallucinatio...
Large Language Models (LLMs) are increasingly used for accessing information on the web. Their truth...
Generative Large Language Models (LLMs) such as GPT-3 are capable of generating highly fluent respon...
We propose a benchmark to measure whether a language model is truthful in generating answers to ques...
Large Language Models (LLMs), such as ChatGPT/GPT-4, have garnered widespread attention owing to the...
Large language models (LLMs) have demonstrated impressive language understanding and generation capa...
International audienceQuestion answering (QA) aims at retrieving precise information from a large co...
Recently developed large language models have achieved remarkable success in generating fluent and c...
Semantic consistency of a language model is broadly defined as the model's ability to produce semant...
The interactive nature of Large Language Models (LLMs) theoretically allows models to refine and imp...
The fluency and creativity of large pre-trained language models (LLMs) have led to their widespread ...
When answering a question, people often draw upon their rich world knowledge in addition to the part...
Recent Language Models (LMs) have shown impressive capabilities in generating texts with the knowled...
International audienceQuestion answering (QA) aims at retrieving precise information from a large co...