MOTIVATION: It has been proposed that clustering clinical markers, such as blood test results, can be used to stratify patients. However, the robustness of clusters formed with this approach to data pre-processing and clustering algorithm choices has not been evaluated, nor has clustering reproducibility. Here, we made use of the NHANES survey to compare clusters generated with various combinations of pre-processing and clustering algorithms, and tested their reproducibility in two separate samples. METHOD: Values of 44 biomarkers and 19 health/life style traits were extracted from the National Health and Nutrition Examination Survey (NHANES). The 1999-2002 survey was used for training, while data from the 2003-2006 survey was tested as a v...
<p>(A) The mean and standard deviation of biomarker values are shown for three selected clusters gen...
There are large quantities of information about patients and their medical conditions. The discovery...
Clustering is a powerful machine learning tool for detecting structures in datasets. In the medical ...
It has been proposed that clustering clinical markers, such as blood test results, can be used to st...
<p>The number of validated terms found to be enriched in clusters generated with the different pre-p...
<p>A test-validation approach was used to test the impact of methodological choices on the clusterin...
Applications of clustering algorithms in biomedical research are ubiquitous, with typical examples i...
Disease understanding is key in designing effective treatments and diagnostic tools. A key aspect of...
<p>The NHANES code and description of validated terms found in clusters generated by pre-processing ...
Clustering is a long-standing problem in computer science and is applied in virtually any scientific...
Abstract—Dealing with data means to group information into a set of categories either in order to le...
Clustering has emerged as one of the most essential and popular techniques for discovering patterns ...
Clustering is a powerful machine learning tool for detecting structures in datasets. In the medical ...
In this article an introduction on unsupervised cluster analysis is provided. Clustering is the orga...
While a genuine abundance of biomedical data available nowadays becomes a genuine blessing, it also ...
<p>(A) The mean and standard deviation of biomarker values are shown for three selected clusters gen...
There are large quantities of information about patients and their medical conditions. The discovery...
Clustering is a powerful machine learning tool for detecting structures in datasets. In the medical ...
It has been proposed that clustering clinical markers, such as blood test results, can be used to st...
<p>The number of validated terms found to be enriched in clusters generated with the different pre-p...
<p>A test-validation approach was used to test the impact of methodological choices on the clusterin...
Applications of clustering algorithms in biomedical research are ubiquitous, with typical examples i...
Disease understanding is key in designing effective treatments and diagnostic tools. A key aspect of...
<p>The NHANES code and description of validated terms found in clusters generated by pre-processing ...
Clustering is a long-standing problem in computer science and is applied in virtually any scientific...
Abstract—Dealing with data means to group information into a set of categories either in order to le...
Clustering has emerged as one of the most essential and popular techniques for discovering patterns ...
Clustering is a powerful machine learning tool for detecting structures in datasets. In the medical ...
In this article an introduction on unsupervised cluster analysis is provided. Clustering is the orga...
While a genuine abundance of biomedical data available nowadays becomes a genuine blessing, it also ...
<p>(A) The mean and standard deviation of biomarker values are shown for three selected clusters gen...
There are large quantities of information about patients and their medical conditions. The discovery...
Clustering is a powerful machine learning tool for detecting structures in datasets. In the medical ...