Outlier detection is a fundamental step in knowledge discovery in databases. With the increasing number of high-dimensional databases, existing outlier detection algorithms that work only in the context of full space are unable to effectively screen out informative outliers. This is because majority of these outliers exists only in subspaces. In this paper, we identify a new outlier detection task for high-dimensional data, i.e. finding the subspaces in which given points are outliers, and propose a novel outlier detection algorithm, called High-D Outlier Detection (HighDOD). The intuitive idea is that we measure the outlying degree of the point using the sum of distances between this point and its k nearest neighbors. Two pruning strategie...
Abstract Subspace outlier detection has emerged as a practical approach for outlier detection. Class...
In this paper, we propose a novel formulation for distance-based outliers that is based on the dista...
Detecting outliers in high-dimensional data is crucial in many domains. Due to the curse of dimensio...
Abstract Outlier detection is a popular technique that can be utilized in many modern applications l...
Abstract Outlier detection is an important problem that has applications in many fields. High dimens...
Outliers detection is currently very active area of research in data set mining community. Outliers ...
Abstract. The outlier detection problem has important applications in the field of fraud detection, ...
Abstract. We propose an original outlier detection schema that detects outliers in varying subspaces...
Detecting outlying subspaces is a relatively new research problem in outlier-ness analysis for high-...
The outlier detection problem has important applications in the eld of fraud detection, network robu...
Outlier detection has been studied extensively in data mining. However, as the emergence of huge dat...
Many real applications are required to detect outliers in high dimensional data sets. The major diff...
The outlier detection problem has important applications in the eld of fraud detection, netw ork rob...
Our thesis is that we can efficiently identify meaningful outliers in large, multidimensional datas...
Our thesis is that we can efficiently identify meaningful outliers in large, multidimensional datas...
Abstract Subspace outlier detection has emerged as a practical approach for outlier detection. Class...
In this paper, we propose a novel formulation for distance-based outliers that is based on the dista...
Detecting outliers in high-dimensional data is crucial in many domains. Due to the curse of dimensio...
Abstract Outlier detection is a popular technique that can be utilized in many modern applications l...
Abstract Outlier detection is an important problem that has applications in many fields. High dimens...
Outliers detection is currently very active area of research in data set mining community. Outliers ...
Abstract. The outlier detection problem has important applications in the field of fraud detection, ...
Abstract. We propose an original outlier detection schema that detects outliers in varying subspaces...
Detecting outlying subspaces is a relatively new research problem in outlier-ness analysis for high-...
The outlier detection problem has important applications in the eld of fraud detection, network robu...
Outlier detection has been studied extensively in data mining. However, as the emergence of huge dat...
Many real applications are required to detect outliers in high dimensional data sets. The major diff...
The outlier detection problem has important applications in the eld of fraud detection, netw ork rob...
Our thesis is that we can efficiently identify meaningful outliers in large, multidimensional datas...
Our thesis is that we can efficiently identify meaningful outliers in large, multidimensional datas...
Abstract Subspace outlier detection has emerged as a practical approach for outlier detection. Class...
In this paper, we propose a novel formulation for distance-based outliers that is based on the dista...
Detecting outliers in high-dimensional data is crucial in many domains. Due to the curse of dimensio...