2021 Spring.Includes bibliographical references.Data quality tests are used to validate the data stored in databases and data warehouses, and to detect violations of syntactic and semantic constraints. Domain experts grapple with the issues related to the capturing of all the important constraints and checking that they are satisfied. The constraints are often identified in an ad hoc manner based on the knowledge of the application domain and the needs of the stakeholders. Constraints can exist over single or multiple attributes as well as records involving time series and sequences. The constraints involving multiple attributes can involve both linear and non-linear relationships among the attributes. We propose ADQuaTe as a data quality t...
The main goal of this research is to contribute to automated performance anomaly detection for large...
Daily operation of a large-scale experiment is a resource consuming task, particularly from perspect...
The capability of deep learning (DL) techniques for dealing with non-linear, dynamic and correlated ...
Anomalous data lead to scientific discoveries. Although machine learning systems can be forced to re...
Anomaly detection in industrial time series data is essential for identifying and preventing potenti...
Nowadays, big data systems (e.g., Hadoop and Spark) are being widely adopted by many domains for off...
In the last decades, many works have been done to enhance data performances in the computer field. D...
Anomaly detection is an important data mining task. Most existing methods treat anomalies as inconsi...
International audienceBig Data systems are producing huge amounts of data in real-time. Finding anom...
Real-time data processing has become an increasingly important challenge as the need for faster anal...
International audienceClassification algorithms have been widely adopted to detect anomalies for var...
Anomaly detection is the task of identifying observations in a dataset that do not conform the expec...
Anomaly detection is the problem of identifying data points or patterns that do not conform to norma...
University of Minnesota M.S. thesis. May 2019. Major: Computer Science. Advisor: Edward McFowland II...
Error detection is key for data quality management. Leveraging domain knowledge in the form of user-...
The main goal of this research is to contribute to automated performance anomaly detection for large...
Daily operation of a large-scale experiment is a resource consuming task, particularly from perspect...
The capability of deep learning (DL) techniques for dealing with non-linear, dynamic and correlated ...
Anomalous data lead to scientific discoveries. Although machine learning systems can be forced to re...
Anomaly detection in industrial time series data is essential for identifying and preventing potenti...
Nowadays, big data systems (e.g., Hadoop and Spark) are being widely adopted by many domains for off...
In the last decades, many works have been done to enhance data performances in the computer field. D...
Anomaly detection is an important data mining task. Most existing methods treat anomalies as inconsi...
International audienceBig Data systems are producing huge amounts of data in real-time. Finding anom...
Real-time data processing has become an increasingly important challenge as the need for faster anal...
International audienceClassification algorithms have been widely adopted to detect anomalies for var...
Anomaly detection is the task of identifying observations in a dataset that do not conform the expec...
Anomaly detection is the problem of identifying data points or patterns that do not conform to norma...
University of Minnesota M.S. thesis. May 2019. Major: Computer Science. Advisor: Edward McFowland II...
Error detection is key for data quality management. Leveraging domain knowledge in the form of user-...
The main goal of this research is to contribute to automated performance anomaly detection for large...
Daily operation of a large-scale experiment is a resource consuming task, particularly from perspect...
The capability of deep learning (DL) techniques for dealing with non-linear, dynamic and correlated ...