In this work we introduce a distributed method for detecting distance-based outliers in very large data sets. Our approach is based on the concept of outlier detection solving set [2], which is a small subset of the data set that can be also employed for predicting novel outliers. The method exploits parallel computation in order to obtain vast time savings. Indeed, beyond preserving the correctness of the result, the proposed schema exhibits excellent performances. From the theoretical point of view, for common settings, the temporal cost of our algorithm is expected to be at least three order of magnitude faster than the classical nested-loop like approach to detect outliers. Experimental results show that the algorithm is efficient and t...
Recent technological advancements have enabled generating and collecting huge amounts of data in a d...
Outlier detection is an important data mining task, whose target is to find the abnormal or atypical...
Anomaly detection is one of the major data mining tasks in modern applications. An element that show...
In this work we introduce a distributed method for detecting distance-based outliers in very large d...
We propose a distributed approach addressing the problem of distance-based outlier detection in very...
In this paper, we propose a novel formulation for distance-based outliers that is based on the dista...
Outlier detection has attracted substantial attention in many applications and research areas; some ...
Outlier detection has attracted substantial attention in many applications and research areas; some ...
none4noThe mining task of outlier detection is essential in many expert and intelligent systems expl...
The outlier detection is an important and valuable research in KDD (Knowledge discover in database)....
This paper deals with finding outliers (exceptions) in large, multidimensional datasets. The identif...
Detecting outliers in data is an important problem with in-teresting applications in a myriad of dom...
Outlier detection is an important problem for the data mining community as outliers often embody pot...
Distance-based outlier detection is widely adopted in many fields, e.g., data mining and machine lea...
The process of discovering interesting patterns in large, possibly huge, data sets is referred to as...
Recent technological advancements have enabled generating and collecting huge amounts of data in a d...
Outlier detection is an important data mining task, whose target is to find the abnormal or atypical...
Anomaly detection is one of the major data mining tasks in modern applications. An element that show...
In this work we introduce a distributed method for detecting distance-based outliers in very large d...
We propose a distributed approach addressing the problem of distance-based outlier detection in very...
In this paper, we propose a novel formulation for distance-based outliers that is based on the dista...
Outlier detection has attracted substantial attention in many applications and research areas; some ...
Outlier detection has attracted substantial attention in many applications and research areas; some ...
none4noThe mining task of outlier detection is essential in many expert and intelligent systems expl...
The outlier detection is an important and valuable research in KDD (Knowledge discover in database)....
This paper deals with finding outliers (exceptions) in large, multidimensional datasets. The identif...
Detecting outliers in data is an important problem with in-teresting applications in a myriad of dom...
Outlier detection is an important problem for the data mining community as outliers often embody pot...
Distance-based outlier detection is widely adopted in many fields, e.g., data mining and machine lea...
The process of discovering interesting patterns in large, possibly huge, data sets is referred to as...
Recent technological advancements have enabled generating and collecting huge amounts of data in a d...
Outlier detection is an important data mining task, whose target is to find the abnormal or atypical...
Anomaly detection is one of the major data mining tasks in modern applications. An element that show...