Training convolutional networks for semantic segmentation with strong (per-pixel) and weak (per-bounding-box) supervision requires a large amount of weakly labeled data. We propose two methods for selecting the most relevant data with weak supervision. The first method is designed for finding visually similar images without the need of labels and is based on modeling image representations with a Gaussian Mixture Model (GMM). As a byproduct of GMM modeling, we present useful insights on characterizing the data generating distribution. The second method aims at finding images with high object diversity and requires only the bounding box labels. Both methods are developed in the context of automated driving and experimentation is conducted on ...
Modern deep learning has enabled amazing developments of computer vision in recent years (Hinton and...
A fundamental key-point for the recent success of deep learning models is the availability of large ...
Semantic segmentation is a popular visual recognition task whose goal is to estimate pixel-level obj...
Training convolutional networks for semantic segmentation with strong (per-pixel) and weak (per-boun...
Training convolutional networks for semantic segmentation with strong (per-pixel) and weak (per-boun...
Training convolutional networks for semantic segmentation with strong (per-pixel) and weak (per-boun...
Training convolutional networks for semantic segmentation with strong (per-pixel) and weak (per-boun...
Training convolutional networks for semantic segmentation with strong (per-pixel) and weak (per-boun...
Training convolutional networks for semantic segmentation requires per-pixel ground truth labels, wh...
Training convolutional networks for semantic segmentation requires per-pixel ground truth labels, wh...
Deep convolutional neural networks (DCNNs) trained on a large number of images with strong pixel-lev...
Training convolutional networks for semantic segmentation requires per-pixel ground truth labels, wh...
Training convolutional networks for semantic segmentation requires per-pixel ground truth labels, wh...
Training convolutional networks for semantic segmentation requires per-pixel ground truth labels, wh...
We consider the task of learning a classifier for semantic segmentation using weak supervision in th...
Modern deep learning has enabled amazing developments of computer vision in recent years (Hinton and...
A fundamental key-point for the recent success of deep learning models is the availability of large ...
Semantic segmentation is a popular visual recognition task whose goal is to estimate pixel-level obj...
Training convolutional networks for semantic segmentation with strong (per-pixel) and weak (per-boun...
Training convolutional networks for semantic segmentation with strong (per-pixel) and weak (per-boun...
Training convolutional networks for semantic segmentation with strong (per-pixel) and weak (per-boun...
Training convolutional networks for semantic segmentation with strong (per-pixel) and weak (per-boun...
Training convolutional networks for semantic segmentation with strong (per-pixel) and weak (per-boun...
Training convolutional networks for semantic segmentation requires per-pixel ground truth labels, wh...
Training convolutional networks for semantic segmentation requires per-pixel ground truth labels, wh...
Deep convolutional neural networks (DCNNs) trained on a large number of images with strong pixel-lev...
Training convolutional networks for semantic segmentation requires per-pixel ground truth labels, wh...
Training convolutional networks for semantic segmentation requires per-pixel ground truth labels, wh...
Training convolutional networks for semantic segmentation requires per-pixel ground truth labels, wh...
We consider the task of learning a classifier for semantic segmentation using weak supervision in th...
Modern deep learning has enabled amazing developments of computer vision in recent years (Hinton and...
A fundamental key-point for the recent success of deep learning models is the availability of large ...
Semantic segmentation is a popular visual recognition task whose goal is to estimate pixel-level obj...