Computer Vision is a scientific discipline which involves the development of an algorithmic basis for the construction of intelligent systems that aim at analysis, understanding and extraction of useful information from visual data. This visual data can be plain images, video sequences, views from multiple cameras, etc. Natural Language Processing (NLP), is the ability of machines to read and understand human languages. Visual Question Answering (VQA), is a multi-discipline Artificial Intelligence (AI) research problem, which is a combination of Natural Language Processing (NLP), Computer Vision (CV), and Knowledge Reasoning (KR). Given an image and a question related to the image in natural language, the algorithm has to output an accurate...
Using deep learning, computer vision now rivals people at object recognition and detection, opening ...
Many vision and language tasks require commonsense reasoning beyond data-driven image and natural la...
Given visual input and a natural language question about it, the visual question answering (VQA) tas...
Computer Vision is a scientific discipline which involves the development of an algorithmic basis fo...
With advances of internet computing and a great success of social media websites, internet is explod...
There has been immense progress in the fields of computer vision, object detection and natural langu...
Visual Question Answering (VQA) is a stimulating process in the field of Natural Language Processing ...
Visual Question Answering (VQA) is a task for evaluating image scene understanding abilities and sho...
Together with the development of more accurate methods in Computer Vision and Natural Language Under...
Wearable cameras generate a large amount of photos which are, in many cases, useless or redundant. O...
This bachelor's thesis explores different deep learning techniques to solve the Visual Question-Answ...
We propose a novel attention based deep learning ar-chitecture for visual question answering task (V...
One of the most intriguing features of the Visual Question Answering (VQA) challenge is the unpredic...
A Visual Question Answering (VQA) task is the ability of a system to take an image and an open-ended...
In this paper, we propose to employ the convolutional neural network (CNN) for the image question an...
Using deep learning, computer vision now rivals people at object recognition and detection, opening ...
Many vision and language tasks require commonsense reasoning beyond data-driven image and natural la...
Given visual input and a natural language question about it, the visual question answering (VQA) tas...
Computer Vision is a scientific discipline which involves the development of an algorithmic basis fo...
With advances of internet computing and a great success of social media websites, internet is explod...
There has been immense progress in the fields of computer vision, object detection and natural langu...
Visual Question Answering (VQA) is a stimulating process in the field of Natural Language Processing ...
Visual Question Answering (VQA) is a task for evaluating image scene understanding abilities and sho...
Together with the development of more accurate methods in Computer Vision and Natural Language Under...
Wearable cameras generate a large amount of photos which are, in many cases, useless or redundant. O...
This bachelor's thesis explores different deep learning techniques to solve the Visual Question-Answ...
We propose a novel attention based deep learning ar-chitecture for visual question answering task (V...
One of the most intriguing features of the Visual Question Answering (VQA) challenge is the unpredic...
A Visual Question Answering (VQA) task is the ability of a system to take an image and an open-ended...
In this paper, we propose to employ the convolutional neural network (CNN) for the image question an...
Using deep learning, computer vision now rivals people at object recognition and detection, opening ...
Many vision and language tasks require commonsense reasoning beyond data-driven image and natural la...
Given visual input and a natural language question about it, the visual question answering (VQA) tas...