Zero-shot semantic segmentation aims to recognize the semantics of pixels from unseen categories with zero training samples. Previous practice [1] proposed to train the classifiers for unseen categories using the visual features generated from semantic word embeddings. However, the generator is merely learned on the seen categories while no constraint is applied to the unseen categories, leading to poor generalization ability. In this work, we propose a Consistent Structural Relation Learning (CSRL) approach to constrain the generating of unseen visual features by exploiting the structural relations between seen and unseen categories. We observe that different categories are usually with similar relations in either semantic word embedding s...
How to equip machines with the ability to understand an image and explain everything in it has a lon...
Zero-shot learning (ZSL) is made possible by learning a projection function between a feature space ...
Semantic segmentation is one of the most fundamental problems in computer vision and pixel-level lab...
Zero-shot learning has gained popularity due to its potential to scale recognition models without re...
Zero-shot learning (ZSL) is widely studied in recent years to solve the problem of lacking annotatio...
Zero-shot learning aims to classify unseen image categories by learning a visual-semantic embedding ...
Zero-shot learning aims to classify visual objects without any training data via knowledge transfer ...
Semantic Image Interpretation is the task of extracting a structured semantic description from image...
Zero-shot learning (ZSL) aims to bridge the knowledge transfer via available semantic representation...
Human beings have the remarkable ability to recognize novel visual objects only based on the descrip...
Zero-shot learning (ZSL) aims at recognizing classes for which no visual sample is available at trai...
International audienceSemantic segmentation models are limited in their ability to scale to large nu...
Zero-shot object detection is an emerging research topic that aims to recognize and localize previou...
Conventional zero-shot learning (ZSL) methods recognise an unseen instance by projecting its visual ...
Automatically recognizing a large number of action categories from videos is of significant importan...
How to equip machines with the ability to understand an image and explain everything in it has a lon...
Zero-shot learning (ZSL) is made possible by learning a projection function between a feature space ...
Semantic segmentation is one of the most fundamental problems in computer vision and pixel-level lab...
Zero-shot learning has gained popularity due to its potential to scale recognition models without re...
Zero-shot learning (ZSL) is widely studied in recent years to solve the problem of lacking annotatio...
Zero-shot learning aims to classify unseen image categories by learning a visual-semantic embedding ...
Zero-shot learning aims to classify visual objects without any training data via knowledge transfer ...
Semantic Image Interpretation is the task of extracting a structured semantic description from image...
Zero-shot learning (ZSL) aims to bridge the knowledge transfer via available semantic representation...
Human beings have the remarkable ability to recognize novel visual objects only based on the descrip...
Zero-shot learning (ZSL) aims at recognizing classes for which no visual sample is available at trai...
International audienceSemantic segmentation models are limited in their ability to scale to large nu...
Zero-shot object detection is an emerging research topic that aims to recognize and localize previou...
Conventional zero-shot learning (ZSL) methods recognise an unseen instance by projecting its visual ...
Automatically recognizing a large number of action categories from videos is of significant importan...
How to equip machines with the ability to understand an image and explain everything in it has a lon...
Zero-shot learning (ZSL) is made possible by learning a projection function between a feature space ...
Semantic segmentation is one of the most fundamental problems in computer vision and pixel-level lab...