Challenges with data in the big-data era include (i) the dimension $p$ is often larger than the sample size $n$ (ii) outliers or contaminated points are frequently hidden and more difficult to detect. Challenge (i) renders most conventional methods inapplicable. Thus, it attracts tremendous attention from statistics, computer science, and bio-medical communities. Numerous penalized regression methods have been introduced as modern methods for analyzing high-dimensional data. Disproportionate attention has been paid to the challenge (ii) though. Penalized regression methods can do their job very well and are expected to handle the challenge (ii) simultaneously. Most of them, however, can break down by a single outlier (or single adversary co...
High breakdown estimation (HBE) addresses the problem of getting reliable parameter estimates in the...
High-dimensional data can often display heterogeneity due to heteroscedastic variance or inhomogeneo...
29 pagesInternational audienceWe consider the problem of robustly predicting as well as the best lin...
In the famous least sum of trimmed squares (LTS) of residuals estimator (Rousseeuw (1984)), residual...
To perform regression analysis in high dimensions, lasso or ridge estimation are a common choice. Ho...
Sparse model estimation is a topic of high importance in modern data analysis due to the increasing ...
summary:From the practical point of view the regression analysis and its Least Squares method is cle...
Linear regression models are commonly used statistical models for predicting a response from a set o...
Classical parametric statistics commonly makes assumptions about the data (e.g. normal distribution)...
More attention has been given to regularization methods in the last two decades as a result of exiti...
Ordinary least squares (OLS) is the default method for fitting linear models, but is not applicable ...
Outliers in the data are a common problem in applied statistics. Estimators that give reliable resul...
In this paper we present the Cumulative Median Estimation (CUMed) algorithm for robust sufficient di...
A method is introduced for variable selection and prediction in linear regression problems where the...
In high-dimensional settings, a penalized least squares approach may lose its efficiency in both est...
High breakdown estimation (HBE) addresses the problem of getting reliable parameter estimates in the...
High-dimensional data can often display heterogeneity due to heteroscedastic variance or inhomogeneo...
29 pagesInternational audienceWe consider the problem of robustly predicting as well as the best lin...
In the famous least sum of trimmed squares (LTS) of residuals estimator (Rousseeuw (1984)), residual...
To perform regression analysis in high dimensions, lasso or ridge estimation are a common choice. Ho...
Sparse model estimation is a topic of high importance in modern data analysis due to the increasing ...
summary:From the practical point of view the regression analysis and its Least Squares method is cle...
Linear regression models are commonly used statistical models for predicting a response from a set o...
Classical parametric statistics commonly makes assumptions about the data (e.g. normal distribution)...
More attention has been given to regularization methods in the last two decades as a result of exiti...
Ordinary least squares (OLS) is the default method for fitting linear models, but is not applicable ...
Outliers in the data are a common problem in applied statistics. Estimators that give reliable resul...
In this paper we present the Cumulative Median Estimation (CUMed) algorithm for robust sufficient di...
A method is introduced for variable selection and prediction in linear regression problems where the...
In high-dimensional settings, a penalized least squares approach may lose its efficiency in both est...
High breakdown estimation (HBE) addresses the problem of getting reliable parameter estimates in the...
High-dimensional data can often display heterogeneity due to heteroscedastic variance or inhomogeneo...
29 pagesInternational audienceWe consider the problem of robustly predicting as well as the best lin...