Outlier detection is an important research direction in the field of data mining. Aiming at the problem of unstable detection results and low efficiency caused by randomly dividing features of the data set in the Isolation Forest algorithm in outlier detection, an algorithm CIIF (Cluster-based Improved Isolation Forest) that combines clustering and Isolation Forest is proposed. CIIF first uses the k-means method to cluster the data set, selects a specific cluster to construct a selection matrix based on the results of the clustering, and implements the selection mechanism of the algorithm through the selection matrix; then builds multiple isolation trees. Finally, the outliers are calculated according to the average search length of each sa...
Outlier is a data point that deviates too much from the rest of dataset. Most of real-world dataset ...
Many studies of outlier detection have been developed based on the cluster-based outlier detection ...
Data clustering is an important data exploration technique with many applications in data mining. We...
Isolation Forest or iForest is one of the outstanding outlier detectors proposed in recent years. Ye...
The first successful isolation-based anomaly detector, ie, iForest, uses trees as a means to perform...
Machine learning methods like outlier detection are becoming increasingly more popular as tools in t...
Detecting anomalies in data sets has been one of the most studied issues in modern data analysis. Th...
In this paper, the mathematical analysis of the Isolation Random Forest Method (IRF Method) for anom...
International audienceThis letter introduces a generalization of Isolation Forest (IF) based on the ...
Anomalies are data points that are few and different. As a result of these properties, we show that,...
[[abstract]]In this paper, a two-phase clustering algorithm for outliers detection is proposed. We f...
This letter introduces a generalization of Isolation Forest (IF) based on the existing Extended IF (...
Most existing model-based approaches to anomaly detection construct a profile of normal instances, t...
We present an anomaly and outlier detection method for graph data. The method relies on the consider...
We present an anomaly and outlier detection method for graph data. The method relies on the consider...
Outlier is a data point that deviates too much from the rest of dataset. Most of real-world dataset ...
Many studies of outlier detection have been developed based on the cluster-based outlier detection ...
Data clustering is an important data exploration technique with many applications in data mining. We...
Isolation Forest or iForest is one of the outstanding outlier detectors proposed in recent years. Ye...
The first successful isolation-based anomaly detector, ie, iForest, uses trees as a means to perform...
Machine learning methods like outlier detection are becoming increasingly more popular as tools in t...
Detecting anomalies in data sets has been one of the most studied issues in modern data analysis. Th...
In this paper, the mathematical analysis of the Isolation Random Forest Method (IRF Method) for anom...
International audienceThis letter introduces a generalization of Isolation Forest (IF) based on the ...
Anomalies are data points that are few and different. As a result of these properties, we show that,...
[[abstract]]In this paper, a two-phase clustering algorithm for outliers detection is proposed. We f...
This letter introduces a generalization of Isolation Forest (IF) based on the existing Extended IF (...
Most existing model-based approaches to anomaly detection construct a profile of normal instances, t...
We present an anomaly and outlier detection method for graph data. The method relies on the consider...
We present an anomaly and outlier detection method for graph data. The method relies on the consider...
Outlier is a data point that deviates too much from the rest of dataset. Most of real-world dataset ...
Many studies of outlier detection have been developed based on the cluster-based outlier detection ...
Data clustering is an important data exploration technique with many applications in data mining. We...