We introduce FAITHSCORE (Faithfulness to Atomic Image Facts Score), a reference-free and fine-grained evaluation metric that measures the faithfulness of the generated free-form answers from large vision-language models (LVLMs). The FAITHSCORE evaluation first identifies sub-sentences containing descriptive statements that need to be verified, then extracts a comprehensive list of atomic facts from these sub-sentences, and finally conducts consistency verification between fine-grained atomic facts and the input image. Meta-evaluation demonstrates that our metric highly correlates with human judgments of faithfulness. We collect two benchmark datasets (i.e. LLaVA-1k and MSCOCO-Cap) for evaluating LVLMs instruction-following hallucinations. W...
The interactive nature of Large Language Models (LLMs) theoretically allows models to refine and imp...
Using deep learning, computer vision now rivals people at object recognition and detection, opening ...
Large Language Models (LLMs) are increasingly used for accessing information on the web. Their truth...
Large Vision-Language Models (LVLMs) have recently achieved remarkable success. However, LVLMs are s...
Large Multimodal Models (LMM) are built across modalities and the misalignment between two modalitie...
Large Language Models (LLMs), such as ChatGPT/GPT-4, have garnered widespread attention owing to the...
Although remarkable progress has been achieved in preventing large language model (LLM) hallucinatio...
This research paper focuses on the challenges posed by hallucinations in large language models (LLMs...
Nowadays, the research on Large Vision-Language Models (LVLMs) has been significantly promoted thank...
Recently developed large language models have achieved remarkable success in generating fluent and c...
The goal of information-seeking dialogue is to respond to seeker queries with natural language utter...
Large Language Models (LLMs) have demonstrated remarkable proficiency in generating fluent text. How...
The fluency and creativity of large pre-trained language models (LLMs) have led to their widespread ...
Although demonstrating superb performance on various NLP tasks, large language models (LLMs) still s...
Advancement in large pretrained language models has significantly improved their performance for con...
The interactive nature of Large Language Models (LLMs) theoretically allows models to refine and imp...
Using deep learning, computer vision now rivals people at object recognition and detection, opening ...
Large Language Models (LLMs) are increasingly used for accessing information on the web. Their truth...
Large Vision-Language Models (LVLMs) have recently achieved remarkable success. However, LVLMs are s...
Large Multimodal Models (LMM) are built across modalities and the misalignment between two modalitie...
Large Language Models (LLMs), such as ChatGPT/GPT-4, have garnered widespread attention owing to the...
Although remarkable progress has been achieved in preventing large language model (LLM) hallucinatio...
This research paper focuses on the challenges posed by hallucinations in large language models (LLMs...
Nowadays, the research on Large Vision-Language Models (LVLMs) has been significantly promoted thank...
Recently developed large language models have achieved remarkable success in generating fluent and c...
The goal of information-seeking dialogue is to respond to seeker queries with natural language utter...
Large Language Models (LLMs) have demonstrated remarkable proficiency in generating fluent text. How...
The fluency and creativity of large pre-trained language models (LLMs) have led to their widespread ...
Although demonstrating superb performance on various NLP tasks, large language models (LLMs) still s...
Advancement in large pretrained language models has significantly improved their performance for con...
The interactive nature of Large Language Models (LLMs) theoretically allows models to refine and imp...
Using deep learning, computer vision now rivals people at object recognition and detection, opening ...
Large Language Models (LLMs) are increasingly used for accessing information on the web. Their truth...