Controlled variable selection is an important analytical step in various scientific fields, such as brain imaging or genomics. In these high-dimensional data settings, considering too many variables leads to poor models and high costs, hence the need for statistical guarantees on false positives. Knockoffs are a popular statistical tool for conditional variable selection in high dimension. However, they control for the expected proportion of false discoveries (FDR) and not their actual proportion (FDP). We present a new method, KOPI, that controls the proportion of false discoveries for Knockoff-based inference. The proposed method also relies on a new type of aggregation to address the undesirable randomness associated with classical Knock...
We propose the Terminating-Knockoff (T-Knock) filter, a fast variable selection method for high-dime...
Background: In high-throughput studies, hundreds to millions of hypotheses are typically tested. Sta...
Abstract Background When many (up to millions) of statistical tests are conducted in discovery set a...
Controlled variable selection is an important analytical step in various scientific fields, such as ...
In many fields of science, we observe a response variable together with a large number of potential ...
Recently, the scheme of model-X knockoffs was proposed as a promising solution to address controlled...
The discovery of biomarkers that are informative for cancer risk assessment, diagnosis, prognosis an...
International audienceContinuous improvement in medical imaging techniques allows the acquisition of...
We propose new methods to obtain simultaneous false discovery proportion bounds for knockoff-based a...
We consider the variable selection problem, which seeks to identify important variables influencin...
Model-X knockoffs is a flexible wrapper method for high-dimensional regression algorithms, which pro...
Abstract Background Procedures for controlling the false discovery rate (FDR) are widely applied as ...
High-dimensional variable selection is a challenging task, especially when groups of highly correlat...
Recent discoveries suggest that our gut microbiome plays an important role in our health and wellbei...
International audienceThe false discovery proportion (FDP) is a convenient way to account for false ...
We propose the Terminating-Knockoff (T-Knock) filter, a fast variable selection method for high-dime...
Background: In high-throughput studies, hundreds to millions of hypotheses are typically tested. Sta...
Abstract Background When many (up to millions) of statistical tests are conducted in discovery set a...
Controlled variable selection is an important analytical step in various scientific fields, such as ...
In many fields of science, we observe a response variable together with a large number of potential ...
Recently, the scheme of model-X knockoffs was proposed as a promising solution to address controlled...
The discovery of biomarkers that are informative for cancer risk assessment, diagnosis, prognosis an...
International audienceContinuous improvement in medical imaging techniques allows the acquisition of...
We propose new methods to obtain simultaneous false discovery proportion bounds for knockoff-based a...
We consider the variable selection problem, which seeks to identify important variables influencin...
Model-X knockoffs is a flexible wrapper method for high-dimensional regression algorithms, which pro...
Abstract Background Procedures for controlling the false discovery rate (FDR) are widely applied as ...
High-dimensional variable selection is a challenging task, especially when groups of highly correlat...
Recent discoveries suggest that our gut microbiome plays an important role in our health and wellbei...
International audienceThe false discovery proportion (FDP) is a convenient way to account for false ...
We propose the Terminating-Knockoff (T-Knock) filter, a fast variable selection method for high-dime...
Background: In high-throughput studies, hundreds to millions of hypotheses are typically tested. Sta...
Abstract Background When many (up to millions) of statistical tests are conducted in discovery set a...