Sequence count data are commonly modelled using the negative binomial (NB) distribution. Several empirical studies, however, have demonstrated that methods based on the NB-assumption do not always succeed in controlling the false discovery rate (FDR) at its nominal level. In this paper, we propose a dedicated statistical goodness of fit test for the NB distribution in regression models and demonstrate that the NB-assumption is violated in many publicly available RNA-Seq and 16S rRNA microbiome datasets. The zero-inflated NB distribution was not found to give a substantially better fit. We also show that the NB-based tests perform worse on the features for which the NB-assumption was violated than on the features for which no significant dev...
Chromosomal aberrations, such as micronuclei (MN), have served as biomarkers of genotoxic exposure a...
<div><p>Current practice in the normalization of microbiome count data is inefficient in the statist...
Motivation: An important feature of microbiome count data is the presence of a large number of zeros...
Sequence count data are commonly modelled using the negative binomial (NB) distribution. Several emp...
<div><p>This work is about assessing model adequacy for negative binomial (NB) regression, particula...
Paper presented at the 5th Strathmore International Mathematics Conference (SIMC 2019), 12 - I6 Augu...
Recent Next Generation Sequencing methods provide a count of RNA molecules in the form of short read...
Motivation: The human microbiome plays an important role in human health and disease. The compositio...
RNA-Sequencing (RNA-Seq) has been widely adopted for quantifying gene expression changes in comparat...
This is the publisher’s final pdf. The published article is copyrighted by De Gruyter and can be fou...
Current practice in the normalization of microbiome count data is inefficient in the statistical sen...
We discuss the identification of genes that are associated with an outcome in RNA sequencing and oth...
Abstract Background A large number of probabilistic models used in sequence analysis assign non-...
Graduation date: 2015Access restricted to the OSU Community at author's request from June 24, 2014 -...
BACKGROUND: Deep sequencing of transposon mutant libraries (or TnSeq) is a powerful method for probi...
Chromosomal aberrations, such as micronuclei (MN), have served as biomarkers of genotoxic exposure a...
<div><p>Current practice in the normalization of microbiome count data is inefficient in the statist...
Motivation: An important feature of microbiome count data is the presence of a large number of zeros...
Sequence count data are commonly modelled using the negative binomial (NB) distribution. Several emp...
<div><p>This work is about assessing model adequacy for negative binomial (NB) regression, particula...
Paper presented at the 5th Strathmore International Mathematics Conference (SIMC 2019), 12 - I6 Augu...
Recent Next Generation Sequencing methods provide a count of RNA molecules in the form of short read...
Motivation: The human microbiome plays an important role in human health and disease. The compositio...
RNA-Sequencing (RNA-Seq) has been widely adopted for quantifying gene expression changes in comparat...
This is the publisher’s final pdf. The published article is copyrighted by De Gruyter and can be fou...
Current practice in the normalization of microbiome count data is inefficient in the statistical sen...
We discuss the identification of genes that are associated with an outcome in RNA sequencing and oth...
Abstract Background A large number of probabilistic models used in sequence analysis assign non-...
Graduation date: 2015Access restricted to the OSU Community at author's request from June 24, 2014 -...
BACKGROUND: Deep sequencing of transposon mutant libraries (or TnSeq) is a powerful method for probi...
Chromosomal aberrations, such as micronuclei (MN), have served as biomarkers of genotoxic exposure a...
<div><p>Current practice in the normalization of microbiome count data is inefficient in the statist...
Motivation: An important feature of microbiome count data is the presence of a large number of zeros...