Multimodal machine translation (MMT) systems have been shown to outperform their text-only neural machine translation (NMT) counterparts when visual context is available. However, recent studies have also shown that the performance of MMT models is only marginally impacted when the associated image is replaced with an unrelated image or noise, which suggests that the visual context might not be exploited by the model at all. We hypothesize that this might be caused by the nature of the commonly used evaluation benchmark, also known as Multi30K, where the translations of image captions were prepared without actually showing the images to human translators. In this paper, we present a qualitative study that examines the role of datasets in st...
Multimodality in Machine Translation Jindřich Libovický Traditionally, most natural language process...
Multimodal machine translation (MMT) aims to improve translation quality by incorporating informatio...
Recent advancements in multimodal techniques open exciting possibilities for models excelling in div...
Multimodal machine translation (MMT) systems have been shown to outperform their text-only neural ma...
Current work on multimodal machine translation (MMT) has suggested that the visual modality is eithe...
Typically it is seen that multimodal neural machine translation (MNMT) systems trained on a combinat...
Over the past few years, there has been a lot of progress being made in machine translation through ...
Multimodal machine translation involves drawing information from more than one modality, based on th...
Recently, there has been a surge in research in multimodal machine translation (MMT), where addition...
We introduce a novel multimodal machine translation model that integrates image features modulated b...
Multimodal machine translation involves drawing information from more than one modality, based on th...
In this paper, we study how humans perceive the use of images as an additional knowledge source to m...
International audienceIn recent years, joint text-image embeddings have significantly improved thank...
Recent work on multimodal machine translation has attempted to address the problem of producing targ...
This paper introduces BD2BB, a novel language and vision benchmark that requires multimodal models c...
Multimodality in Machine Translation Jindřich Libovický Traditionally, most natural language process...
Multimodal machine translation (MMT) aims to improve translation quality by incorporating informatio...
Recent advancements in multimodal techniques open exciting possibilities for models excelling in div...
Multimodal machine translation (MMT) systems have been shown to outperform their text-only neural ma...
Current work on multimodal machine translation (MMT) has suggested that the visual modality is eithe...
Typically it is seen that multimodal neural machine translation (MNMT) systems trained on a combinat...
Over the past few years, there has been a lot of progress being made in machine translation through ...
Multimodal machine translation involves drawing information from more than one modality, based on th...
Recently, there has been a surge in research in multimodal machine translation (MMT), where addition...
We introduce a novel multimodal machine translation model that integrates image features modulated b...
Multimodal machine translation involves drawing information from more than one modality, based on th...
In this paper, we study how humans perceive the use of images as an additional knowledge source to m...
International audienceIn recent years, joint text-image embeddings have significantly improved thank...
Recent work on multimodal machine translation has attempted to address the problem of producing targ...
This paper introduces BD2BB, a novel language and vision benchmark that requires multimodal models c...
Multimodality in Machine Translation Jindřich Libovický Traditionally, most natural language process...
Multimodal machine translation (MMT) aims to improve translation quality by incorporating informatio...
Recent advancements in multimodal techniques open exciting possibilities for models excelling in div...