This paper presents a novel evaluation framework for Out-of-Distribution (OOD) detection that aims to assess the performance of machine learning models in more realistic settings. We observed that the real-world requirements for testing OOD detection methods are not satisfied by the current testing protocols. They usually encourage methods to have a strong bias towards a low level of diversity in normal data. To address this limitation, we propose new OOD test datasets (CIFAR-10-R, CIFAR-100-R, and ImageNet-30-R) that can allow researchers to benchmark OOD detection performance under realistic distribution shifts. Additionally, we introduce a Generalizability Score (GS) to measure the generalization ability of a model during OOD detection. ...
Deep learning methods have boosted the adoption of NLP systems in real-life applications. However, t...
Out-of-distribution (OOD) detection is important for machine learning models deployed in the wild. R...
While deep learning models have seen widespread success in controlled environments, there are still ...
Towards Realistic Out-of-Distribution Detection: A Novel Evaluation Framework for Improving Generali...
Towards Realistic Out-of-Distribution Detection: A Novel Evaluation Framework for Improving Generali...
Out-of-distribution (OOD) detection is a critical task for ensuring the reliability and safety of de...
One critical challenge in deploying highly performant machine learning models in real-life applicati...
Out-of-distribution detection is a common issue in deploying vision models in practice and solving i...
Out-of-distribution (OOD) detection is critical to ensuring the reliability and safety of machine le...
In the problem of out-of-distribution (OOD) detection, the usage of auxiliary data as outlier data f...
Out-of-distribution (OOD) detection has recently gained substantial attention due to the importance ...
The deployment of machine learning solutions in real-world scenarios often involves addressing the c...
The separation between training and deployment of machine learning models implies that not all scena...
The task of out-of-distribution (OOD) detection is vital to realize safe and reliable operation for ...
Detection of Out-of-Distribution (OOD) samples in real time is a crucial safety check for deployment...
Deep learning methods have boosted the adoption of NLP systems in real-life applications. However, t...
Out-of-distribution (OOD) detection is important for machine learning models deployed in the wild. R...
While deep learning models have seen widespread success in controlled environments, there are still ...
Towards Realistic Out-of-Distribution Detection: A Novel Evaluation Framework for Improving Generali...
Towards Realistic Out-of-Distribution Detection: A Novel Evaluation Framework for Improving Generali...
Out-of-distribution (OOD) detection is a critical task for ensuring the reliability and safety of de...
One critical challenge in deploying highly performant machine learning models in real-life applicati...
Out-of-distribution detection is a common issue in deploying vision models in practice and solving i...
Out-of-distribution (OOD) detection is critical to ensuring the reliability and safety of machine le...
In the problem of out-of-distribution (OOD) detection, the usage of auxiliary data as outlier data f...
Out-of-distribution (OOD) detection has recently gained substantial attention due to the importance ...
The deployment of machine learning solutions in real-world scenarios often involves addressing the c...
The separation between training and deployment of machine learning models implies that not all scena...
The task of out-of-distribution (OOD) detection is vital to realize safe and reliable operation for ...
Detection of Out-of-Distribution (OOD) samples in real time is a crucial safety check for deployment...
Deep learning methods have boosted the adoption of NLP systems in real-life applications. However, t...
Out-of-distribution (OOD) detection is important for machine learning models deployed in the wild. R...
While deep learning models have seen widespread success in controlled environments, there are still ...