Clinical registers constitute an invaluable resource in the medical data-driven decision making context. Accurate machine learning and data mining approaches on these data can lead to faster diagnosis, definition of tailored interventions, and improved outcome prediction. A typical issue when implementing such approaches is the almost unavoidable presence of missing values in the collected data. In this work, we propose an imputation algorithm based on a mutual information-weighted k-nearest neighbours approach, able to handle the simultaneous presence of missing information in different types of variables. We developed and validated the method on a clinical register, constituted by the information collected over subsequent screening visits...
Incomplete data is a common drawback that machine learning techniques need to deal with when solving...
We propose a bi-objective algorithm based on the k-nearest neighbors (biokNN) method to perform impu...
Missing data is one of the most common issues encountered in data cleaning process especially when d...
Abstract Missing data is a common problem in longitudinal datasets which include mult...
Missing data is a common drawback in many real-life pattern classification scenarios. One of the mos...
Imputation of missing data is important in many areas, such as reducing non-response bias in surveys...
BDAW '16: International Conference on Big Data and Advanced Wireless Technologies, Blagoevgrad, Bulg...
: The missing data mechanism is a relevant problem in Machine Learning (ML) and biomedical informati...
A high level of data quality has always been a concern for many applications based on machine learni...
BDAW \u2716: International Conference on Big Data and Advanced Wireless Technologies, Blagoevgrad, B...
AbstractMost clinical and biomedical data contain missing values. A patient’s record may be split ac...
The incomplete dataset is an unescapable problem in data preprocessing that primarily machine learni...
Breast cancer is the most common cancer to females worldwide. Using machine learning technology to p...
Classifying patients based on stated reasons for missing outcome from different intercurrent events ...
One important characteristic of good data is completeness. Missing data is a major problem in the cl...
Incomplete data is a common drawback that machine learning techniques need to deal with when solving...
We propose a bi-objective algorithm based on the k-nearest neighbors (biokNN) method to perform impu...
Missing data is one of the most common issues encountered in data cleaning process especially when d...
Abstract Missing data is a common problem in longitudinal datasets which include mult...
Missing data is a common drawback in many real-life pattern classification scenarios. One of the mos...
Imputation of missing data is important in many areas, such as reducing non-response bias in surveys...
BDAW '16: International Conference on Big Data and Advanced Wireless Technologies, Blagoevgrad, Bulg...
: The missing data mechanism is a relevant problem in Machine Learning (ML) and biomedical informati...
A high level of data quality has always been a concern for many applications based on machine learni...
BDAW \u2716: International Conference on Big Data and Advanced Wireless Technologies, Blagoevgrad, B...
AbstractMost clinical and biomedical data contain missing values. A patient’s record may be split ac...
The incomplete dataset is an unescapable problem in data preprocessing that primarily machine learni...
Breast cancer is the most common cancer to females worldwide. Using machine learning technology to p...
Classifying patients based on stated reasons for missing outcome from different intercurrent events ...
One important characteristic of good data is completeness. Missing data is a major problem in the cl...
Incomplete data is a common drawback that machine learning techniques need to deal with when solving...
We propose a bi-objective algorithm based on the k-nearest neighbors (biokNN) method to perform impu...
Missing data is one of the most common issues encountered in data cleaning process especially when d...