Abstract Background For differential abundance analysis, zero-inflated generalized linear models, typically zero-inflated NB models, have been increasingly used to model microbiome and other sequencing count data. A common assumption in estimating the false discovery rate is that the p values are uniformly distributed under the null hypothesis, which demands that the postulated model fit the count data adequately. Mis-specification of the distribution of the count data may lead to excess false discoveries. Therefore, model checking is critical to control the FDR at a nominal level in differential abundance analysis. Increasing studies show that the method of randomized quantile residual (RQR) p...
Motivation: The human microbiome plays an important role in human health and disease. The compositio...
Background: Recent advances in next-generation sequencing (NGS) technology enable researchers to col...
<p>Panel (a): Quantile plot of Meme E-values for approximately 15,000 random runs, with E-values ex...
In microbiome research, it is often of interest to investigate the impact of clinical and environmen...
Traditional tools for model diagnosis for Generalized Linear Model (GLM), such as deviance and Pears...
Motivation: An important feature of microbiome count data is the presence of a large number of zeros...
MotivationThe human microbiome is variable and dynamic in nature. Longitudinal studies could explain...
The distributional assumption for a generalized linear model is often checked by plotting the ordere...
Normalization is the first critical step in microbiome sequencing data analysis used to account for ...
Abstract Background Identification of bacterial taxa associated with diseases, exposures, and other ...
Current practice in the normalization of microbiome count data is inefficient in the statistical sen...
Typical data in a microbiome study consist of the operational taxonomic unit (OTU) counts that have ...
<div><p>Typical data in a microbiome study consist of the operational taxonomic unit (OTU) counts th...
<div><p>Current practice in the normalization of microbiome count data is inefficient in the statist...
The microbiome abundance data is known to be over-dispersed and sparse count data. Among various zer...
Motivation: The human microbiome plays an important role in human health and disease. The compositio...
Background: Recent advances in next-generation sequencing (NGS) technology enable researchers to col...
<p>Panel (a): Quantile plot of Meme E-values for approximately 15,000 random runs, with E-values ex...
In microbiome research, it is often of interest to investigate the impact of clinical and environmen...
Traditional tools for model diagnosis for Generalized Linear Model (GLM), such as deviance and Pears...
Motivation: An important feature of microbiome count data is the presence of a large number of zeros...
MotivationThe human microbiome is variable and dynamic in nature. Longitudinal studies could explain...
The distributional assumption for a generalized linear model is often checked by plotting the ordere...
Normalization is the first critical step in microbiome sequencing data analysis used to account for ...
Abstract Background Identification of bacterial taxa associated with diseases, exposures, and other ...
Current practice in the normalization of microbiome count data is inefficient in the statistical sen...
Typical data in a microbiome study consist of the operational taxonomic unit (OTU) counts that have ...
<div><p>Typical data in a microbiome study consist of the operational taxonomic unit (OTU) counts th...
<div><p>Current practice in the normalization of microbiome count data is inefficient in the statist...
The microbiome abundance data is known to be over-dispersed and sparse count data. Among various zer...
Motivation: The human microbiome plays an important role in human health and disease. The compositio...
Background: Recent advances in next-generation sequencing (NGS) technology enable researchers to col...
<p>Panel (a): Quantile plot of Meme E-values for approximately 15,000 random runs, with E-values ex...