In this paper we propose a new method to deal with missingness in categorical data. The new proposal is a forward imputation procedure and is presented in the context of the Nonlinear Principal Component Analysis, used to obtain indicators from a large dataset. However, this procedure can be easily adopted in other contexts, and when other multivariate techniques are used. We discuss the statistical features of our imputation technique in connection with other treatment methods which are popular among Nonlinear Principal Component Analysis users. The performance of our method is then compared to the other methods through a simulation study which involves the application to a real dataset extracted from the Euro-barometer survey. Missing dat...
Abstract Background Incomplete categorical variables with more than two categories are common in pub...
Principal component analysis (PCA) is a widely used statistical technique for determining subscales ...
The problem of incomplete data and its implications for drawing valid conclusions from statistical a...
The problem of missing data in building multidimensional composite indicators is a delicate problem ...
International audienceWe propose a new method to impute missing values in mixed datasets. It is base...
Abstract: Missing data are a common problem for researchers working with surveys and other types of ...
International audienceThis paper is a written version of the talk Julie Josse delivered at the 44 Jo...
The imputation of missing data is often a crucial step in the analysis of survey data. This study re...
Imputation of missing data has always represented a problem for researchers from every field. A bias...
Missing data recurrently affect datasets in almost every field of quantitative research. The subject...
The increasing availability of data often characterized by missing values has paved the way for the ...
A high level of data quality has always been a concern for many applications based on machine learni...
We studied four methods for handling incomplete categorical data in statistical modeling: (1) maximu...
A research report submitted to the Faculty of Science, University of the Witwatersrand, for the degr...
The concept of symbolic data has been developed with the aim of representing variables whose measure...
Abstract Background Incomplete categorical variables with more than two categories are common in pub...
Principal component analysis (PCA) is a widely used statistical technique for determining subscales ...
The problem of incomplete data and its implications for drawing valid conclusions from statistical a...
The problem of missing data in building multidimensional composite indicators is a delicate problem ...
International audienceWe propose a new method to impute missing values in mixed datasets. It is base...
Abstract: Missing data are a common problem for researchers working with surveys and other types of ...
International audienceThis paper is a written version of the talk Julie Josse delivered at the 44 Jo...
The imputation of missing data is often a crucial step in the analysis of survey data. This study re...
Imputation of missing data has always represented a problem for researchers from every field. A bias...
Missing data recurrently affect datasets in almost every field of quantitative research. The subject...
The increasing availability of data often characterized by missing values has paved the way for the ...
A high level of data quality has always been a concern for many applications based on machine learni...
We studied four methods for handling incomplete categorical data in statistical modeling: (1) maximu...
A research report submitted to the Faculty of Science, University of the Witwatersrand, for the degr...
The concept of symbolic data has been developed with the aim of representing variables whose measure...
Abstract Background Incomplete categorical variables with more than two categories are common in pub...
Principal component analysis (PCA) is a widely used statistical technique for determining subscales ...
The problem of incomplete data and its implications for drawing valid conclusions from statistical a...