Despite the prominent use of complex survey data and the growing popularity of machine learning methods in epidemiologic research, few machine learning software implementations offer options for handling complex samples. A major challenge impeding the broader incorporation of machine learning into epidemiologic research is incomplete guidance for analyzing complex survey data, including the importance of sampling weights for valid prediction in target populations. Using data from 15, 820 participants in the 1988-1994 National Health and Nutrition Examination Survey cohort, we determined whether ignoring weights in gradient boosting models of all-cause mortality affected prediction, as measured by the F1 score and corresponding 95% confidenc...
Datasets are rarely a realistic approximation of the target population. Say, prevalence is misrepres...
Sampling weights are a reflection of sampling design; they allow us to draw valid conclusions about...
Multilevel complex survey data are obtained from study designs that involve multiple stages of sampl...
Despite the prominent use of complex survey data and the growing popularity of machine learning meth...
Objective: To determine how machine learning has been applied to prediction applications in populati...
The goal of this book is to put an array of tools at the fingertips of students, practitioners, and ...
There is increasing use of non-probability sampling methods in large-scale surveys due to the costs ...
A fundamental technique in survey sampling is to weight included units by the inverse of their proba...
In recent years, there has been a widespread cross-fertilization between Medical Statistics and Mach...
One of the fundamental machine learning tasks is that of predictive classification. Given that organ...
Machine learning methods are widely used within the medical field. However, the reliability and effi...
Background: The concept of boosting emerged from the field of machine learning. The basic idea is to...
In recent years, there has been a widespread cross-fertilization between Medical Statistics and Mach...
Background: Machine learning (ML) has spread rapidly from computer science to several disciplines. G...
Abstract. The general principles of Bayesian data analysis imply that mod-els for survey responses s...
Datasets are rarely a realistic approximation of the target population. Say, prevalence is misrepres...
Sampling weights are a reflection of sampling design; they allow us to draw valid conclusions about...
Multilevel complex survey data are obtained from study designs that involve multiple stages of sampl...
Despite the prominent use of complex survey data and the growing popularity of machine learning meth...
Objective: To determine how machine learning has been applied to prediction applications in populati...
The goal of this book is to put an array of tools at the fingertips of students, practitioners, and ...
There is increasing use of non-probability sampling methods in large-scale surveys due to the costs ...
A fundamental technique in survey sampling is to weight included units by the inverse of their proba...
In recent years, there has been a widespread cross-fertilization between Medical Statistics and Mach...
One of the fundamental machine learning tasks is that of predictive classification. Given that organ...
Machine learning methods are widely used within the medical field. However, the reliability and effi...
Background: The concept of boosting emerged from the field of machine learning. The basic idea is to...
In recent years, there has been a widespread cross-fertilization between Medical Statistics and Mach...
Background: Machine learning (ML) has spread rapidly from computer science to several disciplines. G...
Abstract. The general principles of Bayesian data analysis imply that mod-els for survey responses s...
Datasets are rarely a realistic approximation of the target population. Say, prevalence is misrepres...
Sampling weights are a reflection of sampling design; they allow us to draw valid conclusions about...
Multilevel complex survey data are obtained from study designs that involve multiple stages of sampl...