Abstract. This work proposes and evaluates a Nearest-Neighbor Method to substitute missing values in datasets formed by continuous attributes. In the substitution process, each instance containing missing values is compared with complete instances, and the closest instance is used to assign the attribute missing value. We evaluate this method in simulations performed in four datasets that are usually employed as benchmarks for data mining methods- Iris Plants, Wisconsin Breast Cancer, Pima Indians Diabetes and Wine Recognition. First, we con-sider the substitution process as a prediction task. In this sense, we em-ploy two metrics (Euclidean and Manhattan) to simulate substitutions both in original and normalized datasets. The obtained resu...
In this paper we introduce the Frequency Ratio (FR) method for dealing with missing values within ne...
Abstract — In this paper, Imputation of missing data is solved by using a combined approach. KNN cla...
In this paper, a framework for replacing missing values in a database is proposed since a real-world...
This paper presents a Nearest-Neighbor Method to substitute missing values in continuous datasets an...
Abstract: Problem statement: Predicting the value for missing attributes is an important data prepro...
Missing data handling is an important preparation step for most data discrimination or mining tasks....
Existing kNN imputation methods for dealing with missing data are designed according to Minkowski di...
The Affinity Propagation (AP) algorithm is an effective algorithm for clustering analysis, but it ca...
Missing data is an important issue in almost all fields of quantitative research. A nonparametric pr...
Many real-world applications encountered a common issue in data analysis is the presence of missing ...
Missing value imputation is an actual yet challenging issue confronted by machine learning and data ...
The real-world data analysis and processing using data mining techniques often are facing observatio...
Missing values in data are common in real world applications. Since the performance of many data min...
Fuzzy K-means clustering algorithm is a popular approach for exploring the structure of a set of pat...
The missing values are not uncommon in real data sets. The algorithms and methods used for the data ...
In this paper we introduce the Frequency Ratio (FR) method for dealing with missing values within ne...
Abstract — In this paper, Imputation of missing data is solved by using a combined approach. KNN cla...
In this paper, a framework for replacing missing values in a database is proposed since a real-world...
This paper presents a Nearest-Neighbor Method to substitute missing values in continuous datasets an...
Abstract: Problem statement: Predicting the value for missing attributes is an important data prepro...
Missing data handling is an important preparation step for most data discrimination or mining tasks....
Existing kNN imputation methods for dealing with missing data are designed according to Minkowski di...
The Affinity Propagation (AP) algorithm is an effective algorithm for clustering analysis, but it ca...
Missing data is an important issue in almost all fields of quantitative research. A nonparametric pr...
Many real-world applications encountered a common issue in data analysis is the presence of missing ...
Missing value imputation is an actual yet challenging issue confronted by machine learning and data ...
The real-world data analysis and processing using data mining techniques often are facing observatio...
Missing values in data are common in real world applications. Since the performance of many data min...
Fuzzy K-means clustering algorithm is a popular approach for exploring the structure of a set of pat...
The missing values are not uncommon in real data sets. The algorithms and methods used for the data ...
In this paper we introduce the Frequency Ratio (FR) method for dealing with missing values within ne...
Abstract — In this paper, Imputation of missing data is solved by using a combined approach. KNN cla...
In this paper, a framework for replacing missing values in a database is proposed since a real-world...