Automatic generation of textual stories from visual data representation, known as visual storytelling, is a recent advancement in the problem of images-to-text. Instead of using a single image as input, visual storytelling processes a sequential array of images into coherent sentences. A story contains non-visual concepts as well as descriptions of literal object(s). While previous approaches have applied external knowledge, our approach was to regard the non-visual concept as the semantic correlation between visual modality and textual modality. This paper, therefore, presents new features representation based on a canonical correlation analysis between two modalities. Attention mechanism are adopted as the underlying architecture of the i...
This paper presents a framework for indexing and browsing databases of stories, in particular charac...
Visual storytelling aims to automatically generate a human-like short story given an image stream. M...
| openaire: EC/H2020/780069/EU//MeMADThis chapter focuses on the recent surge of interest in automat...
Story visualization aims to generate a series of images, semantically matching a given sequence of s...
© 2021 IEEEPrevious models for vision-to-language generation tasks usually pretrain a visual encoder...
Multimodal language analysis often considers relationships between features based on text and those ...
Characters are essential to the plot of any story. Establishing the characters before writing a stor...
When speakers describe an image, they tend to look at objects before mentioning them. In this paper,...
We address the problem of visual storytelling, i.e., generating a story for a given sequence of imag...
Abstract Automatic generation of natural language description for individual images (a.k.a. image ca...
Photos, drawings, figures, etc. supplement textual information in various kinds of media, for exampl...
This paper presents a novel approach for automatically generating image descriptions: visual detecto...
Language and vision provide complementary information. Integrating both modalities in a single multi...
Language and vision provide complementary information. Integrating both modalities in a single multi...
Texts and images provide alternative, yet orthogonal views of the same underlying cognitive concept....
This paper presents a framework for indexing and browsing databases of stories, in particular charac...
Visual storytelling aims to automatically generate a human-like short story given an image stream. M...
| openaire: EC/H2020/780069/EU//MeMADThis chapter focuses on the recent surge of interest in automat...
Story visualization aims to generate a series of images, semantically matching a given sequence of s...
© 2021 IEEEPrevious models for vision-to-language generation tasks usually pretrain a visual encoder...
Multimodal language analysis often considers relationships between features based on text and those ...
Characters are essential to the plot of any story. Establishing the characters before writing a stor...
When speakers describe an image, they tend to look at objects before mentioning them. In this paper,...
We address the problem of visual storytelling, i.e., generating a story for a given sequence of imag...
Abstract Automatic generation of natural language description for individual images (a.k.a. image ca...
Photos, drawings, figures, etc. supplement textual information in various kinds of media, for exampl...
This paper presents a novel approach for automatically generating image descriptions: visual detecto...
Language and vision provide complementary information. Integrating both modalities in a single multi...
Language and vision provide complementary information. Integrating both modalities in a single multi...
Texts and images provide alternative, yet orthogonal views of the same underlying cognitive concept....
This paper presents a framework for indexing and browsing databases of stories, in particular charac...
Visual storytelling aims to automatically generate a human-like short story given an image stream. M...
| openaire: EC/H2020/780069/EU//MeMADThis chapter focuses on the recent surge of interest in automat...