International audienceBACKGROUND: As databases grow larger, it becomes harder to fully control their collection, and they frequently come with missing values: incomplete observations. These large databases are well suited to train machine-learning models, for instance for forecasting or to extract biomarkers in biomedical settings. Such predictive approaches can use discriminative -rather than generative- modeling, and thus open the door to new missing-values strategies. Yet existing empirical evaluations of strategies to handle missing values have focused on inferential statistics. RESULTS: Here we conduct a systematic benchmark of missing-values strategies in predictive models with a focus on large health databases: four electronic health...
: The missing data mechanism is a relevant problem in Machine Learning (ML) and biomedical informati...
This electronic version was submitted by the student author. The certified thesis is available in th...
The availability of data and advanced data analysis tools in the health care domain provide great op...
International audienceBACKGROUND: As databases grow larger, it becomes harder to fully control their...
Pristine and trustworthy data are required for efficient computer modelling for medical decision-mak...
The identification of individual patients at risk of disease has become an integral part of recent t...
Datasets in healthcare are plagued with incomplete information. Imputation is a common method to dea...
Healthcare organizations aim at deriving valuable insights employing data mining and soft computing ...
[[abstract]]While there is an ample amount of medical information available for data mining, many of...
Many real-world datasets suffer from missing data, which can introduce uncertainty into ensuing anal...
Recently, the industry of healthcare started generating a large volume of datasets. If hospitals can...
A high level of data quality has always been a concern for many applications based on machine learni...
Missing data are a universal data quality problem in many domains, leading to misleading analysis an...
Recently, the industry of healthcare started generating a large volume of datasets. If hospitals can...
The analysis of digital health data with machine learning models can be used in clinical application...
: The missing data mechanism is a relevant problem in Machine Learning (ML) and biomedical informati...
This electronic version was submitted by the student author. The certified thesis is available in th...
The availability of data and advanced data analysis tools in the health care domain provide great op...
International audienceBACKGROUND: As databases grow larger, it becomes harder to fully control their...
Pristine and trustworthy data are required for efficient computer modelling for medical decision-mak...
The identification of individual patients at risk of disease has become an integral part of recent t...
Datasets in healthcare are plagued with incomplete information. Imputation is a common method to dea...
Healthcare organizations aim at deriving valuable insights employing data mining and soft computing ...
[[abstract]]While there is an ample amount of medical information available for data mining, many of...
Many real-world datasets suffer from missing data, which can introduce uncertainty into ensuing anal...
Recently, the industry of healthcare started generating a large volume of datasets. If hospitals can...
A high level of data quality has always been a concern for many applications based on machine learni...
Missing data are a universal data quality problem in many domains, leading to misleading analysis an...
Recently, the industry of healthcare started generating a large volume of datasets. If hospitals can...
The analysis of digital health data with machine learning models can be used in clinical application...
: The missing data mechanism is a relevant problem in Machine Learning (ML) and biomedical informati...
This electronic version was submitted by the student author. The certified thesis is available in th...
The availability of data and advanced data analysis tools in the health care domain provide great op...