As language models are adapted by a more sophisticated and diverse set of users, the importance of guaranteeing that they provide factually correct information supported by verifiable sources is critical across fields of study & professions. This is especially the case for high-stakes fields, such as medicine and law, where the risk of propagating false information is high and can lead to undesirable societal consequences. Previous work studying factuality and attribution has not focused on analyzing these characteristics of language model outputs in domain-specific scenarios. In this work, we present an evaluation study analyzing various axes of factuality and attribution provided in responses from a few systems, by bringing domain experts...
Naturally occurring information-seeking questions often contain questionable assumptions -- assumpti...
Document-level models for information extraction tasks like slot-filling are flexible: they can be a...
Current large language models (LLMs) can exhibit near-human levels of performance on many natural la...
We propose a benchmark to measure whether a language model is truthful in generating answers to ques...
We study whether language models can evaluate the validity of their own claims and predict which que...
With a rise in false, inaccurate, and misleading information in propaganda, news, and social media, ...
The irreplaceable key to the triumph of Question & Answer (Q&A) platforms is their users providing h...
Trustworthy answer content is abundant in many high-resource languages and is instantly accessible t...
Semantic consistency of a language model is broadly defined as the model's ability to produce semant...
Knowledge-intensive tasks (e.g., open-domain question answering (QA)) require a substantial amount o...
The fluency and creativity of large pre-trained language models (LLMs) have led to their widespread ...
Large language models (LLMs) are trained on web-scale corpora that inevitably include contradictory ...
Large Language Models (LLMs) are increasingly used for accessing information on the web. Their truth...
Large language models (LLMs) have demonstrated powerful text generation capabilities, bringing unpre...
Alongside huge volumes of research on deep learning models in NLP in the recent years, there has bee...
Naturally occurring information-seeking questions often contain questionable assumptions -- assumpti...
Document-level models for information extraction tasks like slot-filling are flexible: they can be a...
Current large language models (LLMs) can exhibit near-human levels of performance on many natural la...
We propose a benchmark to measure whether a language model is truthful in generating answers to ques...
We study whether language models can evaluate the validity of their own claims and predict which que...
With a rise in false, inaccurate, and misleading information in propaganda, news, and social media, ...
The irreplaceable key to the triumph of Question & Answer (Q&A) platforms is their users providing h...
Trustworthy answer content is abundant in many high-resource languages and is instantly accessible t...
Semantic consistency of a language model is broadly defined as the model's ability to produce semant...
Knowledge-intensive tasks (e.g., open-domain question answering (QA)) require a substantial amount o...
The fluency and creativity of large pre-trained language models (LLMs) have led to their widespread ...
Large language models (LLMs) are trained on web-scale corpora that inevitably include contradictory ...
Large Language Models (LLMs) are increasingly used for accessing information on the web. Their truth...
Large language models (LLMs) have demonstrated powerful text generation capabilities, bringing unpre...
Alongside huge volumes of research on deep learning models in NLP in the recent years, there has bee...
Naturally occurring information-seeking questions often contain questionable assumptions -- assumpti...
Document-level models for information extraction tasks like slot-filling are flexible: they can be a...
Current large language models (LLMs) can exhibit near-human levels of performance on many natural la...