Missing data imputation is a very important data cleaning task for machine learning and data mining with incomplete data. This paper proposes two novel methods for missing data imputation, named kEMI and kEMI+, that are based on the k-Nearest Neighbours algorithm for pre-imputation and the Expectation–Maximization algorithm for posterior-imputation. The former is a local search mechanism that aims to automatically find the best value for k and the latter makes use of the best k nearest neighbours to estimate missing scores by learning global similarities. kEMI+ makes use of a novel information fusion mechanism. It fuses top estimations through the Dempster–Shafer fusion module to obtain the final estimation. They handle both numerical and c...
We propose a bi-objective algorithm based on the k-nearest neighbors (biokNN) method to perform impu...
Missing data imputation is a critical part of data cleaning tasks and vital for learning from incomp...
A missing value is a common problem of most data processing in scientific research, which results in...
Missing data imputation is a very important data cleaning task for machine learning and data mining ...
The real-world data analysis and processing using data mining techniques often are facing observatio...
Abstract Machine learning has been the corner stone in analysing and extracting information from dat...
Missing data is a common drawback in many real-life pattern classification scenarios. One of the mos...
Missing data imputation is an important step in the process of machine learning and data mining when...
Statistical Imputation Techniques have been proposed mainly with the aim of predicting the missing v...
K-nearest neighbors (KNN) has been extensively used as imputation algorithm to substitute missing da...
Many real-world applications encountered a common issue in data analysis is the presence of missing ...
Imputation of missing data is important in many areas, such as reducing non-response bias in surveys...
K-nearest neighbors (KNN) has been extensively used as imputation algorithm to substitute missing da...
The incomplete dataset is an unescapable problem in data preprocessing that primarily machine learni...
Machine learning plays a pivotal role in data analysis and information extraction. However, one comm...
We propose a bi-objective algorithm based on the k-nearest neighbors (biokNN) method to perform impu...
Missing data imputation is a critical part of data cleaning tasks and vital for learning from incomp...
A missing value is a common problem of most data processing in scientific research, which results in...
Missing data imputation is a very important data cleaning task for machine learning and data mining ...
The real-world data analysis and processing using data mining techniques often are facing observatio...
Abstract Machine learning has been the corner stone in analysing and extracting information from dat...
Missing data is a common drawback in many real-life pattern classification scenarios. One of the mos...
Missing data imputation is an important step in the process of machine learning and data mining when...
Statistical Imputation Techniques have been proposed mainly with the aim of predicting the missing v...
K-nearest neighbors (KNN) has been extensively used as imputation algorithm to substitute missing da...
Many real-world applications encountered a common issue in data analysis is the presence of missing ...
Imputation of missing data is important in many areas, such as reducing non-response bias in surveys...
K-nearest neighbors (KNN) has been extensively used as imputation algorithm to substitute missing da...
The incomplete dataset is an unescapable problem in data preprocessing that primarily machine learni...
Machine learning plays a pivotal role in data analysis and information extraction. However, one comm...
We propose a bi-objective algorithm based on the k-nearest neighbors (biokNN) method to perform impu...
Missing data imputation is a critical part of data cleaning tasks and vital for learning from incomp...
A missing value is a common problem of most data processing in scientific research, which results in...