Recent technological advancements have enabled generating and collecting huge amounts of data in a daily manner. This data is used for different purposes that may impact us on an unprecedented scale. Understanding the data, including detecting its outliers, is a critical step before utilizing it. Outlier detection has been studied well in the literature but the existing approaches fail to scale to these very large settings. In this paper, we propose DBSCOUT, an efficient exact algorithm for outlier detection with a linear complexity that can run in parallel over multiple independent machines, making it a fit for the settings with billions of tuples. Besides the theoretical analysis, our experiment results confirm orders of magnitude improve...
Computing outliers and related statistical aggregation functions from large-scale big data sources i...
Outlier detection is a fundamental step in knowledge discovery in databases. With the increasing num...
Abstract Outlier detection is a popular technique that can be utilized in many modern applications l...
We propose a distributed approach addressing the problem of distance-based outlier detection in very...
In this work we introduce a distributed method for detecting distance-based outliers in very large d...
The outlier detection is an important and valuable research in KDD (Knowledge discover in database)....
Outliers are objects that show abnormal behavior with respect to their context or that have unexpect...
This paper deals with finding outliers (exceptions) in large datasets. The identification of outlier...
This paper deals with finding outliers (exceptions) in large, multidimensional datasets. The identif...
In this paper, we propose a novel formulation for distance-based outliers that is based on the dista...
Data mining is a new, important and fast growing database application. Outlier (exception) detection...
Outlier detection is one of the major data mining methods. This paper proposes a three-step approach...
Outlier detection is one of the major data mining methods. This paper proposes a three-step approach...
Our thesis is that we can efficiently identify meaningful outliers in large, multidimensional datas...
With precipitously growing demand to detect outliers in data streams, many studies have been conduct...
Computing outliers and related statistical aggregation functions from large-scale big data sources i...
Outlier detection is a fundamental step in knowledge discovery in databases. With the increasing num...
Abstract Outlier detection is a popular technique that can be utilized in many modern applications l...
We propose a distributed approach addressing the problem of distance-based outlier detection in very...
In this work we introduce a distributed method for detecting distance-based outliers in very large d...
The outlier detection is an important and valuable research in KDD (Knowledge discover in database)....
Outliers are objects that show abnormal behavior with respect to their context or that have unexpect...
This paper deals with finding outliers (exceptions) in large datasets. The identification of outlier...
This paper deals with finding outliers (exceptions) in large, multidimensional datasets. The identif...
In this paper, we propose a novel formulation for distance-based outliers that is based on the dista...
Data mining is a new, important and fast growing database application. Outlier (exception) detection...
Outlier detection is one of the major data mining methods. This paper proposes a three-step approach...
Outlier detection is one of the major data mining methods. This paper proposes a three-step approach...
Our thesis is that we can efficiently identify meaningful outliers in large, multidimensional datas...
With precipitously growing demand to detect outliers in data streams, many studies have been conduct...
Computing outliers and related statistical aggregation functions from large-scale big data sources i...
Outlier detection is a fundamental step in knowledge discovery in databases. With the increasing num...
Abstract Outlier detection is a popular technique that can be utilized in many modern applications l...