Deployed machine learning (ML) models often encounter new user data that differs from their training data. Therefore, estimating how well a given model might perform on the new data is an important step toward reliable ML applications. This is very challenging, however, as the data distribution can change in flexible ways, and we may not have any labels on the new data, which is often the case in monitoring settings. In this paper, we propose a new distribution shift model, Sparse Joint Shift (SJS), which considers the joint shift of both labels and a few features. This unifies and generalizes several existing shift models including label shift and sparse covariate shift, where only marginal feature or label distribution shifts are consider...
Machine learning (ML) has become ubiquitous in various disciplines and applications, serving as a po...
In machine learning, we traditionally evaluate the performance of a single model, averaged over a co...
A common use case of machine learning in real world settings is to learn a model from historical dat...
Recent interest in the external validity of prediction models (i.e., the problem of different train ...
Factorizable joint shift (FJS) was recently proposed as a type of dataset shift for which the comple...
When deployed in the real world, machine learning models inevitably encounter changes in the data di...
Machine learning algorithms typically assume that training and test examples are drawn from the same...
The performance of machine learning models under distribution shift has been the focus of the commun...
We propose Regularized Learning under Label shifts (RLLS), a principled and a practical domain-adapt...
When evaluating the performance of clinical machine learning models, one must consider the deploymen...
As input data distributions evolve, the predictive performance of machine learning models tends to d...
Transfer Learning is an area of statistics and machine learning research that seeks answers to the f...
Monitoring machine learning models once they are deployed is challenging. It is even more challengin...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Performance estimation under covariate shift is a crucial component of safe AI model deployment, esp...
Machine learning (ML) has become ubiquitous in various disciplines and applications, serving as a po...
In machine learning, we traditionally evaluate the performance of a single model, averaged over a co...
A common use case of machine learning in real world settings is to learn a model from historical dat...
Recent interest in the external validity of prediction models (i.e., the problem of different train ...
Factorizable joint shift (FJS) was recently proposed as a type of dataset shift for which the comple...
When deployed in the real world, machine learning models inevitably encounter changes in the data di...
Machine learning algorithms typically assume that training and test examples are drawn from the same...
The performance of machine learning models under distribution shift has been the focus of the commun...
We propose Regularized Learning under Label shifts (RLLS), a principled and a practical domain-adapt...
When evaluating the performance of clinical machine learning models, one must consider the deploymen...
As input data distributions evolve, the predictive performance of machine learning models tends to d...
Transfer Learning is an area of statistics and machine learning research that seeks answers to the f...
Monitoring machine learning models once they are deployed is challenging. It is even more challengin...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Performance estimation under covariate shift is a crucial component of safe AI model deployment, esp...
Machine learning (ML) has become ubiquitous in various disciplines and applications, serving as a po...
In machine learning, we traditionally evaluate the performance of a single model, averaged over a co...
A common use case of machine learning in real world settings is to learn a model from historical dat...