International audienceThe last decades improvements in processing abilities have quickly led to an increasing use of data analyses implying massive data-sets. To retrieve insightful information from any data driven approach, a pivotal aspect to ensure is good data quality. Manual correction of massive data-sets requires tremendous efforts, is prone to errors, and results being really costly. If knowledge in a specific field can often allow the development of efficient models for anomaly detection and data correction, this knowledge can sometimes be unavailable and a more generic approach should be found. This paper presents a novel approach to anomaly detection and correction in mixed tabular data using Bayesian Networks. We present an algo...
Despite the fact that Dynamic Bayesian Network models have become a popular modelling platform to ma...
Motivation Mixed molecular data combines continuous and categorical features of the same samples, s...
One essential and challenging task in data science is data cleaning - the process of identifying and...
Today, there has been a massive proliferation of huge databases storing valuable information. The op...
We focus on automatic anomaly detection in SQL databases for security systems.Many logs of database ...
Anomaly detection for mixed-type data is an important problem that has not been well addressed in th...
• A submitted manuscript is the version of the article upon submission and before peer-review. There...
Learning the network structure of a large graph is computationally demanding, and dynamically monito...
Recent advances in sensor technology are facilitating the deployment of sensors into the environment...
Attributed graphs, where nodes are associated with a rich set of attributes, have been widely used i...
We propose a novel approach which combines the use of Bayesian network and probabilistic association...
Anomaly analysis is of great interest to diverse fields, including data mining and machine learning,...
Anomaly detection has become a crucial part of the protection of information and integrity. Due to t...
Anomaly detection methods can be very useful in identifying interesting or concerning events. In thi...
181 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2007.Anomaly detection is the task...
Despite the fact that Dynamic Bayesian Network models have become a popular modelling platform to ma...
Motivation Mixed molecular data combines continuous and categorical features of the same samples, s...
One essential and challenging task in data science is data cleaning - the process of identifying and...
Today, there has been a massive proliferation of huge databases storing valuable information. The op...
We focus on automatic anomaly detection in SQL databases for security systems.Many logs of database ...
Anomaly detection for mixed-type data is an important problem that has not been well addressed in th...
• A submitted manuscript is the version of the article upon submission and before peer-review. There...
Learning the network structure of a large graph is computationally demanding, and dynamically monito...
Recent advances in sensor technology are facilitating the deployment of sensors into the environment...
Attributed graphs, where nodes are associated with a rich set of attributes, have been widely used i...
We propose a novel approach which combines the use of Bayesian network and probabilistic association...
Anomaly analysis is of great interest to diverse fields, including data mining and machine learning,...
Anomaly detection has become a crucial part of the protection of information and integrity. Due to t...
Anomaly detection methods can be very useful in identifying interesting or concerning events. In thi...
181 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2007.Anomaly detection is the task...
Despite the fact that Dynamic Bayesian Network models have become a popular modelling platform to ma...
Motivation Mixed molecular data combines continuous and categorical features of the same samples, s...
One essential and challenging task in data science is data cleaning - the process of identifying and...