Deep Learning (DL) components are routinely integrated into software systems that need to perform complex tasks such as image or natural language processing. The adequacy of the test data used to test such systems can be assessed by their ability to expose artificially injected faults (mutations) that simulate real DL faults. In this paper, we describe an approach to automatically generate new test inputs that can be used to augment the existing test set so that its capability to detect DL mutations increases. Our tool DeepMetis implements a search based input generation strategy. To account for the non-determinism of the training and the mutation processes, our fitness function involves multiple instances of the DL model under test. Exper...
This is the dataset of mutations generated for Speaker Recognition subject system by DeepCrime mutat...
Trained agents to be used in the replication package of the paper "Mutation Testing of Deep Reinforc...
Mutation testing is a type of software testing proposed in the 1970s where program statements are de...
Deep Learning (DL) solutions are increasingly adopted, but how to test them remains a major open res...
Deep Learning (DL) is increasingly adopted to solve complex tasks such as image recognition or auton...
Mutation testing is a well-established technique for assessing a test suite’s quality by injecting a...
Software has been an essential part of human life, and it substantially improves production and enri...
Recently many mutation testing tools have been proposed that rely on bug-fix patterns and natural la...
Successful deployment of Deep Neural Networks (DNNs), particularly in safety-critical systems, requi...
This is the dataset of mutations generated for UnityEyes and Movie Recommender subject systems by De...
This is the dataset of mutations generated for MNIST digit classifier by DeepCrime and DeepMutation+...
Assessing the quality of Deep Learning (DL) systems is crucial, as they are increasingly adopted in ...
This is the dataset of mutations generated for MNIST digit classifier by DeepCrime mutation testing ...
Deep Learning (DL) has revolutionized the capabilities of vision-based systems (VBS) in critical app...
We introduce µBert, a mutation testing tool that uses a pre-trained language model (CodeBERT) to gen...
This is the dataset of mutations generated for Speaker Recognition subject system by DeepCrime mutat...
Trained agents to be used in the replication package of the paper "Mutation Testing of Deep Reinforc...
Mutation testing is a type of software testing proposed in the 1970s where program statements are de...
Deep Learning (DL) solutions are increasingly adopted, but how to test them remains a major open res...
Deep Learning (DL) is increasingly adopted to solve complex tasks such as image recognition or auton...
Mutation testing is a well-established technique for assessing a test suite’s quality by injecting a...
Software has been an essential part of human life, and it substantially improves production and enri...
Recently many mutation testing tools have been proposed that rely on bug-fix patterns and natural la...
Successful deployment of Deep Neural Networks (DNNs), particularly in safety-critical systems, requi...
This is the dataset of mutations generated for UnityEyes and Movie Recommender subject systems by De...
This is the dataset of mutations generated for MNIST digit classifier by DeepCrime and DeepMutation+...
Assessing the quality of Deep Learning (DL) systems is crucial, as they are increasingly adopted in ...
This is the dataset of mutations generated for MNIST digit classifier by DeepCrime mutation testing ...
Deep Learning (DL) has revolutionized the capabilities of vision-based systems (VBS) in critical app...
We introduce µBert, a mutation testing tool that uses a pre-trained language model (CodeBERT) to gen...
This is the dataset of mutations generated for Speaker Recognition subject system by DeepCrime mutat...
Trained agents to be used in the replication package of the paper "Mutation Testing of Deep Reinforc...
Mutation testing is a type of software testing proposed in the 1970s where program statements are de...