Missing data is a common drawback in many real-life pattern classification scenarios. One of the most popular solutions is missing data imputation by the K nearest neighbours (KNN) algorithm. In this article, we propose a novel KNN imputation procedure using a feature-weighted distance metric based on mutual information (MI). This method provides a missing data estimation aimed at solving the classification task, i.e., it provides an imputed dataset which is directed toward improving the classification performance. The MI-based distance metric is also used to implement an effective KNN classifier. Experimental results on both artificial and real classification datasets are provided to illustrate the efficiency and the robustness of the prop...
Abstract — In this paper, Imputation of missing data is solved by using a combined approach. KNN cla...
Missing data is an important issue in almost all fields of quantitative re-search. A nonparametric p...
Imputation of missing data is a common application in supervised classification problems, where the ...
Incomplete data is a common drawback that machine learning techniques need to deal with when solving...
Existing kNN imputation methods for dealing with missing data are designed according to Minkowski di...
Missing data imputation is a very important data cleaning task for machine learning and data mining ...
Feature selection is an important preprocessing task for many machine learning and pattern recogniti...
K-nearest neighbors (KNN) has been extensively used as imputation algorithm to substitute missing da...
Clinical registers constitute an invaluable resource in the medical data-driven decision making cont...
Imputation of missing data is important in many areas, such as reducing non-response bias in surveys...
Missing values in data are common in real world applications. Since the performance of many data min...
The real-world data analysis and processing using data mining techniques often are facing observatio...
We propose a bi-objective algorithm based on the k-nearest neighbors (biokNN) method to perform impu...
Missing data imputation is an important step in the process of machine learning and data mining when...
Many real-world applications encountered a common issue in data analysis is the presence of missing ...
Abstract — In this paper, Imputation of missing data is solved by using a combined approach. KNN cla...
Missing data is an important issue in almost all fields of quantitative re-search. A nonparametric p...
Imputation of missing data is a common application in supervised classification problems, where the ...
Incomplete data is a common drawback that machine learning techniques need to deal with when solving...
Existing kNN imputation methods for dealing with missing data are designed according to Minkowski di...
Missing data imputation is a very important data cleaning task for machine learning and data mining ...
Feature selection is an important preprocessing task for many machine learning and pattern recogniti...
K-nearest neighbors (KNN) has been extensively used as imputation algorithm to substitute missing da...
Clinical registers constitute an invaluable resource in the medical data-driven decision making cont...
Imputation of missing data is important in many areas, such as reducing non-response bias in surveys...
Missing values in data are common in real world applications. Since the performance of many data min...
The real-world data analysis and processing using data mining techniques often are facing observatio...
We propose a bi-objective algorithm based on the k-nearest neighbors (biokNN) method to perform impu...
Missing data imputation is an important step in the process of machine learning and data mining when...
Many real-world applications encountered a common issue in data analysis is the presence of missing ...
Abstract — In this paper, Imputation of missing data is solved by using a combined approach. KNN cla...
Missing data is an important issue in almost all fields of quantitative re-search. A nonparametric p...
Imputation of missing data is a common application in supervised classification problems, where the ...