Surprise Adequacy (SA) is one of the emerging and most promising adequacy criteria for Deep Learning (DL) testing. As an adequacy criterion, it has been used to assess the strength of DL test suites. In addition, it has also been used to find inputs to a Deep Neural Network (DNN) which were not sufficiently represented in the training data, or to select samples for DNN retraining. However, computation of the SA metric for a test suite can be prohibitively expensive, as it involves a quadratic number of distance calculations. Hence, we developed and released a performance-optimized, but functionally equivalent, implementation of SA, reducing the evaluation time by up to 97%. We also propose refined variants of the SA computation algorithm,...
Despite impressive capabilities and outstanding performance, deep neural networks (DNNs) have captur...
The utilisation of Deep Learning (DL) raises new challenges regarding its dependability in critical ...
In general, Deep Neural Networks (DNNs) are evaluated by the generalization performance measured on ...
Deep Learning (DL) systems are rapidly being adopted in safety and security critical domains, urgent...
Deep Neural Networks (DNN) are typically tested for accuracy relying on a set of unlabelled real wor...
Context: Deep Neural Networks (DNN) have shown great promise in various domains, for example to supp...
DNN testing is one of the most effective methods to guarantee the quality of DNN. In DNN testing, ma...
Deep Learning (DL) has revolutionized the capabilities of vision-based systems (VBS) in critical app...
Testing deep learning-based systems is crucial but challenging due to the required time and labor fo...
Successful deployment of Deep Neural Networks (DNNs), particularly in safety-critical systems, requi...
Deep Learning (DL) is having a transformational effect in critical areas such as finance, healthcare...
peer reviewedSimilar to traditional software that is constantly under evolution, deep neural network...
Deep learning (DL) models are trained on sampled data, where the distribution of training data diffe...
Deep learning (DL) training is nondeterministic and such nondeterminism was shown to cause significa...
Reliable and robust evaluation methods are a necessary first step towards developing machine learnin...
Despite impressive capabilities and outstanding performance, deep neural networks (DNNs) have captur...
The utilisation of Deep Learning (DL) raises new challenges regarding its dependability in critical ...
In general, Deep Neural Networks (DNNs) are evaluated by the generalization performance measured on ...
Deep Learning (DL) systems are rapidly being adopted in safety and security critical domains, urgent...
Deep Neural Networks (DNN) are typically tested for accuracy relying on a set of unlabelled real wor...
Context: Deep Neural Networks (DNN) have shown great promise in various domains, for example to supp...
DNN testing is one of the most effective methods to guarantee the quality of DNN. In DNN testing, ma...
Deep Learning (DL) has revolutionized the capabilities of vision-based systems (VBS) in critical app...
Testing deep learning-based systems is crucial but challenging due to the required time and labor fo...
Successful deployment of Deep Neural Networks (DNNs), particularly in safety-critical systems, requi...
Deep Learning (DL) is having a transformational effect in critical areas such as finance, healthcare...
peer reviewedSimilar to traditional software that is constantly under evolution, deep neural network...
Deep learning (DL) models are trained on sampled data, where the distribution of training data diffe...
Deep learning (DL) training is nondeterministic and such nondeterminism was shown to cause significa...
Reliable and robust evaluation methods are a necessary first step towards developing machine learnin...
Despite impressive capabilities and outstanding performance, deep neural networks (DNNs) have captur...
The utilisation of Deep Learning (DL) raises new challenges regarding its dependability in critical ...
In general, Deep Neural Networks (DNNs) are evaluated by the generalization performance measured on ...