In this paper we investigate the use of graph embedding networks, with unsupervised features learning, as neural architecture to learn over binary functions. We propose several ways of automatically extract features from the control flow graph (CFG) and we use the structure2vec graph embedding techniques to translate a CFG to a vectors of real numbers. We train and test our proposed architectures on two different binary analysis tasks: binary similarity, and, compiler provenance. We show that the unsupervised extraction of features improves the accuracy on the above tasks, when compared with embedding vectors obtained from a CFG annotated with manually engineered features (i.e., ACFG proposed in [39]). We additionally compare the resul...
Network embedding aims at learning the low dimensional representation of nodes. These representation...
A multitude of important real-world or synthetic systems possess network structures. Extending learn...
Graph structures are a powerful abstraction of many real-world data, such as human interactions and ...
In this paper we consider the binary similarity problem that consists in determining if two binary f...
Graph embedding is a transformation of nodes of a graph into a set of vectors. A good embedding shou...
International audienceWe consider the problem of recovering the compiling chain used to generate a g...
A main challenge in mining network-based data is finding effective ways to represent or encode graph...
Graphs are a rich and versatile data structure. They are widely used in representing data like socia...
Graphs are natural representations of problems and data in many fields. For example, in computationa...
Recently, there have been some breakthroughs in graph analysis by applying the graph neural networks...
Graph-structured data appears frequently in domains including chemistry, natural language semantics,...
Techniques for learning vectorial representations of graphs (graph embeddings) have recently emerged...
Binary code similarity detection, whose goal is to detect similar binary functions without having ac...
Thesis will be uploaded upon expiry of the journal embargo on Chapter 3 in July 2023.Graph data cons...
The main objective of this workshop is to bring together researchers in the machine learning and pro...
Network embedding aims at learning the low dimensional representation of nodes. These representation...
A multitude of important real-world or synthetic systems possess network structures. Extending learn...
Graph structures are a powerful abstraction of many real-world data, such as human interactions and ...
In this paper we consider the binary similarity problem that consists in determining if two binary f...
Graph embedding is a transformation of nodes of a graph into a set of vectors. A good embedding shou...
International audienceWe consider the problem of recovering the compiling chain used to generate a g...
A main challenge in mining network-based data is finding effective ways to represent or encode graph...
Graphs are a rich and versatile data structure. They are widely used in representing data like socia...
Graphs are natural representations of problems and data in many fields. For example, in computationa...
Recently, there have been some breakthroughs in graph analysis by applying the graph neural networks...
Graph-structured data appears frequently in domains including chemistry, natural language semantics,...
Techniques for learning vectorial representations of graphs (graph embeddings) have recently emerged...
Binary code similarity detection, whose goal is to detect similar binary functions without having ac...
Thesis will be uploaded upon expiry of the journal embargo on Chapter 3 in July 2023.Graph data cons...
The main objective of this workshop is to bring together researchers in the machine learning and pro...
Network embedding aims at learning the low dimensional representation of nodes. These representation...
A multitude of important real-world or synthetic systems possess network structures. Extending learn...
Graph structures are a powerful abstraction of many real-world data, such as human interactions and ...