This paper describes the contribution by participants from Umeå University, Sweden, in collaboration with the University of Bern, Switzerland, for the Medical Domain Visual Question Answering challenge hosted by ImageCLEF 2019. We proposed a novel Visual Question Answering approach that leverages a bilinear model to aggregateand synthesize extracted image and question features. While we did not make use of any additional training data, our model used an attention scheme to focus on the relevant input context and was further boosted by using an ensemble of trained models. We show here that the proposed approach performs at state-of-the-art levels, and provides an improvement over several existing methods. The proposed method was ranked 3rd i...
Medical images are playing an important role in the medical domain. A mature medical visual question...
Visual Question Answering (VQA) models take an image and a natural-language question as input and in...
Deep Learning has had a transformative impact on Computer Vision, but for all of the success there i...
This paper describes the contribution by participants from Umeå University, Sweden, in collaboration...
© 2017 IEEE. Visual question answering (VQA) is challenging because it requires a simultaneous under...
Visual Question Answering (VQA) is a recently proposed multimodal task in the general area of machin...
This paper presents an overview of the inaugural edition of the ImageCLEF 2018 Medical Domain Visual...
This paper presents an overview of the Medical Visual Question Answering task (VQA-Med) at ImageCLEF...
Session - ImageCLEF: Multimedia Retrieval in Medicine, Lifelogging, and Internet.In this paper, we d...
© 2018 IEEE. Visual question answering (VQA) is challenging, because it requires a simultaneous unde...
Visual Question Answering (VQA) models aim to answer natural language questions about given images. ...
This paper presents an overview of the fourth edition of the Medical Visual Question Answering (VQA-...
This paper presents an overview of the Medical Visual Question Answering (VQA-Med) task at ImageCLEF...
The project aims to build a medical artificial intelligence (AI) assistant which has the potential t...
Visual Question Answering (VQA) is an extremely stimulating and challenging research area where Comp...
Medical images are playing an important role in the medical domain. A mature medical visual question...
Visual Question Answering (VQA) models take an image and a natural-language question as input and in...
Deep Learning has had a transformative impact on Computer Vision, but for all of the success there i...
This paper describes the contribution by participants from Umeå University, Sweden, in collaboration...
© 2017 IEEE. Visual question answering (VQA) is challenging because it requires a simultaneous under...
Visual Question Answering (VQA) is a recently proposed multimodal task in the general area of machin...
This paper presents an overview of the inaugural edition of the ImageCLEF 2018 Medical Domain Visual...
This paper presents an overview of the Medical Visual Question Answering task (VQA-Med) at ImageCLEF...
Session - ImageCLEF: Multimedia Retrieval in Medicine, Lifelogging, and Internet.In this paper, we d...
© 2018 IEEE. Visual question answering (VQA) is challenging, because it requires a simultaneous unde...
Visual Question Answering (VQA) models aim to answer natural language questions about given images. ...
This paper presents an overview of the fourth edition of the Medical Visual Question Answering (VQA-...
This paper presents an overview of the Medical Visual Question Answering (VQA-Med) task at ImageCLEF...
The project aims to build a medical artificial intelligence (AI) assistant which has the potential t...
Visual Question Answering (VQA) is an extremely stimulating and challenging research area where Comp...
Medical images are playing an important role in the medical domain. A mature medical visual question...
Visual Question Answering (VQA) models take an image and a natural-language question as input and in...
Deep Learning has had a transformative impact on Computer Vision, but for all of the success there i...