With popularity of smart phone users, people enjoy sharing their stories by posing photos on social media platform. Hence, it’s convenient if stories can be automatically written once users upload photos. Benefiting from huge improvement of deep learning techniques and computation power, it is now possible to generate such a story based on users’ input images. Therefore, the objective of this project is to explore and design a deep learning model for visual story telling task. To be more detailed, this project aims to develop a deep learning model that can generate a story with five sentences using five given photos. The first part of the project focus on comparing the latest techniques used for visual story telling and evaluating their pe...
Impressive image captioning results (i.e., an objective description for an image) are achieved with ...
Applying machine learning (ML), and especially deep learning, to understand visual content is becomi...
Stories are diverse and highly personalized, resulting in a large possible output space for story ge...
With popularity of smart phone users, people enjoy sharing their stories by posing photos on social ...
Currently, an increasing number of people are starting to use smart phone to take photos in their da...
With the development of mobile device, the process of image usage is trending more and more popular....
In the modern era, image captioning has become one of the most widely required tools. Moreover, ther...
As Deep learning emerges from Machine learning to become a leading technology in today’s day and age...
Video captioning refers to the process of conveying information of video clips through automatically...
Research that applies artificial intelligence (AI) to generate the captions for an image has been ex...
Visual storytelling is a task of creating a short story based on photo streams. Unlike existing visu...
In 2018, the number of mobile phone users will reach about 4.9 billion. Assuming an aver...
With the rapid development of science and technology, high technology tools have been widely used in...
A Common problem linking computer vision and natural language processing is the ability to generate ...
Humans have the innate ability to perceive an image just by looking at it, for us images are not jus...
Impressive image captioning results (i.e., an objective description for an image) are achieved with ...
Applying machine learning (ML), and especially deep learning, to understand visual content is becomi...
Stories are diverse and highly personalized, resulting in a large possible output space for story ge...
With popularity of smart phone users, people enjoy sharing their stories by posing photos on social ...
Currently, an increasing number of people are starting to use smart phone to take photos in their da...
With the development of mobile device, the process of image usage is trending more and more popular....
In the modern era, image captioning has become one of the most widely required tools. Moreover, ther...
As Deep learning emerges from Machine learning to become a leading technology in today’s day and age...
Video captioning refers to the process of conveying information of video clips through automatically...
Research that applies artificial intelligence (AI) to generate the captions for an image has been ex...
Visual storytelling is a task of creating a short story based on photo streams. Unlike existing visu...
In 2018, the number of mobile phone users will reach about 4.9 billion. Assuming an aver...
With the rapid development of science and technology, high technology tools have been widely used in...
A Common problem linking computer vision and natural language processing is the ability to generate ...
Humans have the innate ability to perceive an image just by looking at it, for us images are not jus...
Impressive image captioning results (i.e., an objective description for an image) are achieved with ...
Applying machine learning (ML), and especially deep learning, to understand visual content is becomi...
Stories are diverse and highly personalized, resulting in a large possible output space for story ge...