188 pagesMissing data imputation forms the first critical step of many data analysis pipelines. For practical applications, imputation algorithms should produce imputations that match the true data distribution and handle data of mixed types. This dissertation develops new imputation algorithms for data with many different variable types, including continuous, binary, ordinal, and truncated and categorical values, by modeling data as samples from a Gaussian copula model. This semiparametric model learns the marginal distribution of each variable to match the empirical distribution, yet describes the interactions between variables with a joint Gaussian that enables fast inference, imputation with confidence intervals, and multiple imputation...
Multiple imputation (MI) is used to handle missing at random (MAR) data. Despite warnings from stati...
It is common in applied research to have large numbers of variables with mixed data types (continuou...
Imputation is a popular technique for handling item nonresponse often found in data application. Par...
Missing value imputation is crucial for real-world data science workflows. Imputation is harder in t...
In this paper the author demonstrates how the copulas approach can be used to find algorithms for im...
In this thesis, we propose innovative imputation models to handle missing data of mixed-type. O...
In this work we introduce a copula-based method for imputing missing data by using conditional densi...
We propose a method for imputing missing data by using conditional copula functions. Copulas are a p...
Missing data problems impose great challenges to both statisticians and data practitioners. Multiple...
Multiple imputation is a commonly used approach to deal with missing values. In this approach, an im...
Modern datasets commonly feature both substantial missingness and variables of mixed data types, whi...
Abstract. Gold-standard approaches to missing data imputation are complicated and computationally ex...
The goal ofmultiple imputation is to provide valid inferences for statistical estimates from incompl...
The goal of multiple imputation is to provide valid inferences for statistical estimates from incomp...
The method of multiple imputation (MI) is used increasingly for analyzing datasets with missing obse...
Multiple imputation (MI) is used to handle missing at random (MAR) data. Despite warnings from stati...
It is common in applied research to have large numbers of variables with mixed data types (continuou...
Imputation is a popular technique for handling item nonresponse often found in data application. Par...
Missing value imputation is crucial for real-world data science workflows. Imputation is harder in t...
In this paper the author demonstrates how the copulas approach can be used to find algorithms for im...
In this thesis, we propose innovative imputation models to handle missing data of mixed-type. O...
In this work we introduce a copula-based method for imputing missing data by using conditional densi...
We propose a method for imputing missing data by using conditional copula functions. Copulas are a p...
Missing data problems impose great challenges to both statisticians and data practitioners. Multiple...
Multiple imputation is a commonly used approach to deal with missing values. In this approach, an im...
Modern datasets commonly feature both substantial missingness and variables of mixed data types, whi...
Abstract. Gold-standard approaches to missing data imputation are complicated and computationally ex...
The goal ofmultiple imputation is to provide valid inferences for statistical estimates from incompl...
The goal of multiple imputation is to provide valid inferences for statistical estimates from incomp...
The method of multiple imputation (MI) is used increasingly for analyzing datasets with missing obse...
Multiple imputation (MI) is used to handle missing at random (MAR) data. Despite warnings from stati...
It is common in applied research to have large numbers of variables with mixed data types (continuou...
Imputation is a popular technique for handling item nonresponse often found in data application. Par...