Training the Gaussian Process regression model on training centers only, which makes is applicable on large datasets.An asymmetric kernel formulation of the Gaussian Process regression model that adds to its descriptiveness.Learning individualized kernel metrics per data center.Effective use of the available training samples when learning the individualized kernel metrics.Learning for each data center not only the appropriate size but also the shape in the kernel metric. Display Omitted This work incorporates the multi-modality of the data distribution into a Gaussian Process regression model. We approach the problem from a discriminative perspective by learning, jointly over the training data, the target space variance in the neighborhood ...