Fuzzing is the process of finding security vulnerabilities in input-processing code by repeatedly testing the code with modified inputs. In this paper, we formalize fuzzing as a reinforcement learning problem using the concept of Markov decision processes. This in turn allows us to apply state-of-the-art deep Q -learning algorithms that optimize rewards, which we define from runtime properties of the program under test. By observing the rewards caused by mutating with a specific set of actions performed on an initial program input, the fuzzing agent learns a policy that can next generate new higher-reward inputs. We have implemented this new approach, and preliminary empirical evidence shows that reinforcement fuzzing can outperform baselin...
Detection of malicious behavior is a fundamental problem in security. One of the major challenges in...
Abstract—The aim of intelligent techniques, termed game AI, used in computer video games is to provi...
Code: https://github.com/google-research/google-research/tree/master/munchausen_rlInternational audi...
Fuzzing is the process of finding security vulnerabilities in input-processing code by repeatedly te...
Generally, the present disclosure is directed to using machine learning to manage a trade-off betwee...
Recent studies have shown that reinforcement learning (RL) models are vulnerable in various noisy sc...
Programmatic Reinforcement Learning is the study of learning algorithms that can leverage partial sy...
Abstract. Fuzzing consists of repeatedly testing an application with modified, or fuzzed, inputs wit...
Doctor of PhilosophyDepartment of Computer ScienceArslan MunirWilliam H. HsuSince the inception of D...
Reward shaping is an efficient way to incorporate domain knowledge into a reinforcement learning age...
Q-learning can be used to find an optimal action-selection policy for any given finite Markov Decisi...
The Reinforcement learning (RL) algorithms solve a wide range of problems we faced. The topic of RL ...
We present a new adversarial learning method for deep reinforcement learning (DRL). Based on this me...
In several reinforcement learning (RL) scenarios, mainly in security settings, there may be adversar...
Machine learning plays a pivotal role in artificial intelligence, allowing machines to mimic human l...
Detection of malicious behavior is a fundamental problem in security. One of the major challenges in...
Abstract—The aim of intelligent techniques, termed game AI, used in computer video games is to provi...
Code: https://github.com/google-research/google-research/tree/master/munchausen_rlInternational audi...
Fuzzing is the process of finding security vulnerabilities in input-processing code by repeatedly te...
Generally, the present disclosure is directed to using machine learning to manage a trade-off betwee...
Recent studies have shown that reinforcement learning (RL) models are vulnerable in various noisy sc...
Programmatic Reinforcement Learning is the study of learning algorithms that can leverage partial sy...
Abstract. Fuzzing consists of repeatedly testing an application with modified, or fuzzed, inputs wit...
Doctor of PhilosophyDepartment of Computer ScienceArslan MunirWilliam H. HsuSince the inception of D...
Reward shaping is an efficient way to incorporate domain knowledge into a reinforcement learning age...
Q-learning can be used to find an optimal action-selection policy for any given finite Markov Decisi...
The Reinforcement learning (RL) algorithms solve a wide range of problems we faced. The topic of RL ...
We present a new adversarial learning method for deep reinforcement learning (DRL). Based on this me...
In several reinforcement learning (RL) scenarios, mainly in security settings, there may be adversar...
Machine learning plays a pivotal role in artificial intelligence, allowing machines to mimic human l...
Detection of malicious behavior is a fundamental problem in security. One of the major challenges in...
Abstract—The aim of intelligent techniques, termed game AI, used in computer video games is to provi...
Code: https://github.com/google-research/google-research/tree/master/munchausen_rlInternational audi...