Image generation from text is the task of generating new images from a textual unit such as word, phase, clause and sentence. It has attracted great attention in both the community of natural language processing and computer vision. Current approaches usually employ an end-to-end framework to tackle the problem. However, we find that the entity information, including categories and attributes of the images, are ignored by most approaches. Such information is crucial for guaranteeing semantic alignment and generating image accurately. For two pictures of the same category, the emphasis of the corresponding text description may be different, but the images generated by these two sentences should have some similarities and the generation proce...
Nurani Venkitasubramanian A., Tuytelaars T., Moens M.-F., ''Entity linking across vision and languag...
Automatic image description systems typically produce generic sentences that only make use of a smal...
Automatically generating an accurate and meaningful description of an image is very challenging. How...
This paper describes a set of methods to link entities across images and text. As a corpus, we used ...
Abstract—We present a system to automatically generate natural language descriptions from images. Th...
Unifying multiple descriptions to determine the details of an everyday event can be a challenging ta...
Automatically generating images based on a natural language description is a challenging problem wit...
In this paper, we address the problem of automatically generating human-like descriptions for unseen...
Texts and images provide alternative, yet orthogonal views of the same underlying cognitive concept....
An urgent limitation in current Image Captioning models is their tendency to produce generic caption...
A picture is worth a thousand words, the adage reads. However, pictures cannot replace words in term...
Text-to-image synthesis is a fascinating area of research that aims to generate images based on text...
We automatically construct a dictionary of visual (possible to perceive on a picture) or non-visual ...
In this article, we describe a system that classifies relations between entities extracted from an i...
In this paper, we present the task of generating image descriptions with gold standard visual detect...
Nurani Venkitasubramanian A., Tuytelaars T., Moens M.-F., ''Entity linking across vision and languag...
Automatic image description systems typically produce generic sentences that only make use of a smal...
Automatically generating an accurate and meaningful description of an image is very challenging. How...
This paper describes a set of methods to link entities across images and text. As a corpus, we used ...
Abstract—We present a system to automatically generate natural language descriptions from images. Th...
Unifying multiple descriptions to determine the details of an everyday event can be a challenging ta...
Automatically generating images based on a natural language description is a challenging problem wit...
In this paper, we address the problem of automatically generating human-like descriptions for unseen...
Texts and images provide alternative, yet orthogonal views of the same underlying cognitive concept....
An urgent limitation in current Image Captioning models is their tendency to produce generic caption...
A picture is worth a thousand words, the adage reads. However, pictures cannot replace words in term...
Text-to-image synthesis is a fascinating area of research that aims to generate images based on text...
We automatically construct a dictionary of visual (possible to perceive on a picture) or non-visual ...
In this article, we describe a system that classifies relations between entities extracted from an i...
In this paper, we present the task of generating image descriptions with gold standard visual detect...
Nurani Venkitasubramanian A., Tuytelaars T., Moens M.-F., ''Entity linking across vision and languag...
Automatic image description systems typically produce generic sentences that only make use of a smal...
Automatically generating an accurate and meaningful description of an image is very challenging. How...