Transformer-based approaches have shown good results in image captioning tasks. However, current approaches have a limitation in generating text from global features of an entire image. Therefore, we propose novel methods for generating better image captioning as follows: (1) The Global-Local Visual Extractor (GLVE) to capture both global features and local features. (2) The Cross Encoder-Decoder Transformer (CEDT) for injecting multiple-level encoder features into the decoding process. GLVE extracts not only global visual features that can be obtained from an entire image, such as size of organ or bone structure, but also local visual features that can be generated from a local region, such as lesion area. Given an image, CEDT can create a...
The image captioning task is increasingly prevalent in artificial intelligence applications for medi...
Image captioning aims to generate a corresponding description of an image. In recent years, neural e...
Medical image captioning is a very challenging task that has been rarely addressed in the literature...
Automatic captioning of images is a task that combines the challenges of image analysis and text gen...
The steadily increasing number of medical images places a tremendous burden on doctors, who toned to...
International audienceMedical image segmentation remains particularly challenging for complex and lo...
Image Captioning is the task of providing a natural language description for an image. It has caught...
We propose a novel transformer, capable of segmenting medical images of varying modalities. Challeng...
The domain of Deep Learning that is related to generation of textual description of images is called...
Automatic captioning of images is a task that combines the challenges of image analysis and text gen...
The Transformer-based approach represents the state-of-the-art in image captioning. However, existin...
Medical image segmentation is a crucial way to assist doctors in the accurate diagnosis of diseases....
A methodology is described for the generation of relevant captions for images of an extensiv...
Video captioning via encoder–decoder structures is a successful sentence generation method. In addit...
Image captioning is a challenging task. Meanwhile, it is important for the machine to understand the...
The image captioning task is increasingly prevalent in artificial intelligence applications for medi...
Image captioning aims to generate a corresponding description of an image. In recent years, neural e...
Medical image captioning is a very challenging task that has been rarely addressed in the literature...
Automatic captioning of images is a task that combines the challenges of image analysis and text gen...
The steadily increasing number of medical images places a tremendous burden on doctors, who toned to...
International audienceMedical image segmentation remains particularly challenging for complex and lo...
Image Captioning is the task of providing a natural language description for an image. It has caught...
We propose a novel transformer, capable of segmenting medical images of varying modalities. Challeng...
The domain of Deep Learning that is related to generation of textual description of images is called...
Automatic captioning of images is a task that combines the challenges of image analysis and text gen...
The Transformer-based approach represents the state-of-the-art in image captioning. However, existin...
Medical image segmentation is a crucial way to assist doctors in the accurate diagnosis of diseases....
A methodology is described for the generation of relevant captions for images of an extensiv...
Video captioning via encoder–decoder structures is a successful sentence generation method. In addit...
Image captioning is a challenging task. Meanwhile, it is important for the machine to understand the...
The image captioning task is increasingly prevalent in artificial intelligence applications for medi...
Image captioning aims to generate a corresponding description of an image. In recent years, neural e...
Medical image captioning is a very challenging task that has been rarely addressed in the literature...