A multivariate dataset consists of n cases in d dimensions, and is often stored in an n by d data matrix. It is well-known that real data may contain outliers. Depending on the situation, outliers may be (a) undesirable errors, which can adversely affect the data analysis, or (b) valuable nuggets of unexpected information. In statistics and data analysis, the word outlier usually refers to a row of the data matrix, and the methods to detect such outliers only work when at least half the rows are clean. But often many rows have a few contaminated cell values, which may not be visible by looking at each variable (column) separately. We propose the first method to detect deviating data cells in amultivariate samplewhich takes the correlations ...
Outlier identification is important in many applications of multivariate analysis. Either because th...
The classical estimators of multivariate location and scatter for the normal model are the sample m...
In this paper we introduce a new method for detecting outliers in a set of proportions. It is based ...
A multivariate dataset consists of n observations in p dimensions, and is often stored in an n by p ...
Most outlier detection rules for multivariate data are based on the assumption of elliptical symmetr...
Standard statistical techniques such as least squares regression are very accurate if the underlying...
Real data may contain both cellwise outliers and casewise outliers. There is a vast literature on ro...
Outlier identification often implies inspecting each z-transformed variable and adding a Mahalanobis...
This article describes a procedure for the detection of multivariate outliers based on the analysis ...
© 2017 The Authors. WIREs Data Mining and Knowledge Discovery published by Wiley Periodicals, Inc. R...
Data Science is the new and exciting interdisciplinary response that has emerged as a consequence of...
Outlier identification is important in many applications of multivariate analysis. Either because th...
Cellwise outliers are likely to occur together with casewise outliers in datasets of relatively larg...
Multivariate data are typically represented by a rectangular matrix (table) in which the rows are th...
We propose a data-analytic method for detecting cellwise outliers. Given a robust covariance matrix,...
Outlier identification is important in many applications of multivariate analysis. Either because th...
The classical estimators of multivariate location and scatter for the normal model are the sample m...
In this paper we introduce a new method for detecting outliers in a set of proportions. It is based ...
A multivariate dataset consists of n observations in p dimensions, and is often stored in an n by p ...
Most outlier detection rules for multivariate data are based on the assumption of elliptical symmetr...
Standard statistical techniques such as least squares regression are very accurate if the underlying...
Real data may contain both cellwise outliers and casewise outliers. There is a vast literature on ro...
Outlier identification often implies inspecting each z-transformed variable and adding a Mahalanobis...
This article describes a procedure for the detection of multivariate outliers based on the analysis ...
© 2017 The Authors. WIREs Data Mining and Knowledge Discovery published by Wiley Periodicals, Inc. R...
Data Science is the new and exciting interdisciplinary response that has emerged as a consequence of...
Outlier identification is important in many applications of multivariate analysis. Either because th...
Cellwise outliers are likely to occur together with casewise outliers in datasets of relatively larg...
Multivariate data are typically represented by a rectangular matrix (table) in which the rows are th...
We propose a data-analytic method for detecting cellwise outliers. Given a robust covariance matrix,...
Outlier identification is important in many applications of multivariate analysis. Either because th...
The classical estimators of multivariate location and scatter for the normal model are the sample m...
In this paper we introduce a new method for detecting outliers in a set of proportions. It is based ...