Computer Vision is a scientific discipline which involves the development of an algorithmic basis for the construction of intelligent systems that aim at analysis, understanding and extraction of useful information from visual data. This visual data can be plain images, video sequences, views from multiple cameras, etc. Natural Language Processing (NLP), is the ability of machines to read and understand human languages. Visual Question Answering (VQA), is a multi-discipline Artificial Intelligence (AI) research problem, which is a combination of Natural Language Processing (NLP), Computer Vision (CV), and Knowledge Reasoning (KR). Given an image and a question related to the image in natural language, the algorithm has to output an accurate...
Together with the development of more accurate methods in Computer Vision and Natural Language Under...
Computer Vision has undergone major changes over the recent five years. Here, we investigate if the ...
This work aims to address the problem of image-based question-answering (QA) with new models and dat...
Computer Vision is a scientific discipline which involves the development of an algorithmic basis fo...
Visual Question Answering (VQA) is a stimulating process in the field of Natural Language Processing...
There has been immense progress in the fields of computer vision, object detection and natural langu...
With advances of internet computing and a great success of social media websites, internet is explod...
We propose a novel attention based deep learning ar-chitecture for visual question answering task (V...
In this paper, we propose to employ the convolutional neural network (CNN) for the image question an...
Wearable cameras generate a large amount of photos which are, in many cases, useless or redundant. O...
Many vision and language tasks require commonsense reasoning beyond data-driven image and natural la...
Visual Question Answering (VQA) is an extremely stimulating and challenging research area where Comp...
We propose a method for visual question answering which combines an internal representation of the c...
Visual Question Answering (VQA) is a task for evaluating image scene understanding abilities and sho...
This bachelor's thesis explores different deep learning techniques to solve the Visual Question-Answ...
Together with the development of more accurate methods in Computer Vision and Natural Language Under...
Computer Vision has undergone major changes over the recent five years. Here, we investigate if the ...
This work aims to address the problem of image-based question-answering (QA) with new models and dat...
Computer Vision is a scientific discipline which involves the development of an algorithmic basis fo...
Visual Question Answering (VQA) is a stimulating process in the field of Natural Language Processing...
There has been immense progress in the fields of computer vision, object detection and natural langu...
With advances of internet computing and a great success of social media websites, internet is explod...
We propose a novel attention based deep learning ar-chitecture for visual question answering task (V...
In this paper, we propose to employ the convolutional neural network (CNN) for the image question an...
Wearable cameras generate a large amount of photos which are, in many cases, useless or redundant. O...
Many vision and language tasks require commonsense reasoning beyond data-driven image and natural la...
Visual Question Answering (VQA) is an extremely stimulating and challenging research area where Comp...
We propose a method for visual question answering which combines an internal representation of the c...
Visual Question Answering (VQA) is a task for evaluating image scene understanding abilities and sho...
This bachelor's thesis explores different deep learning techniques to solve the Visual Question-Answ...
Together with the development of more accurate methods in Computer Vision and Natural Language Under...
Computer Vision has undergone major changes over the recent five years. Here, we investigate if the ...
This work aims to address the problem of image-based question-answering (QA) with new models and dat...