Microbiome

Denoising 16S Sequence Reads using UNOISE3 via VSEARCH and QIIME2

Previously, I shared some workflows for denoising 16S rRNA gene sequence data used by our lab. One of the examples used the popular UNOISE3 algorithm developed by Robert Edgar and implemented in USEARCH.

Microbiome AB/BA Crossover Design

The motivation for this post was also recent question on twitter asking how one might approach the analysis of an AB/BA crossover study when the goal is to estimate the difference in the relative abundance of microbial taxa under treatments A and B.

Multiple Imputation with Microbiome Data

The motivation for this post was a recent twitter exchange discussing difficulties faced when analyzing microbiome data with missing covariate values (i.e., incomplete sample metadata fields). I have run into this issue as well, and given the increasing size of microbiome studies, expect so to have many others.

Applying Topic Models to Microbiome Data in R

Inherent limitations with one-at-a-time (OaaT) feature testing (i.e., single feature differential abundance analysis) have contributed to the increasing popularity of mixture models for correlating microbial features with factors of interest (i.

Sample Size Considerations for Microbial Metagenomics Research

One of the most common questions we get from investigators at the Microbial Metagenomics Analysis Center (MMAC) is how many samples should I collect for my study? Even once we have a clearly stated and testable hypothesis, this is not always easy, since sample size calculations for microbiome studies are typically not amenable to closed form solutions (i.

Sample Size Considerations for Microbiome One-at-a-Time Differential Abundance Testing

I was recently asked how many samples we might need to formally power a differential abundance analysis of prevalent species in the human stool microbiome between “otherwise healthy” control samples and patients with a specific disease condition.

ANCOVA for Analyzing Pre-Post Microbiome Studies

I recently analyzed some data from an experiment with a pre-post study design where our estimands of interest were the baseline adjusted mean difference in Shannon diversity and species differential abundance between arms at post-treatment.

Examining Coverage of CLR Regression

I was thinking recently about the performance of linear models for differential abundance testing in microbiome studies (see here for an example) despite the sparse, extremely non-normal, and compositional nature of mixed microbial community taxonomic profiles.

Differential Feature Abundance Meta-Analysis

In a previous post, I highlighted some difficulties one faces with differential abundance (DA) testing when trying to identify those features (i.e., OTUs/ASVs, species, pathways, etc.) that “differ the most” according to some condition of interest.

Bootstrap Resampling for Ranking Differentially Abundant Taxa

Microbiome studies often seek to identify individual features (i.e., OTUs/ASVs, species, pathways, etc.) associated some condition (i.e., exposure, experimental treatment, etc.) of interest. This problem can be approached in many different ways, but most commonly, one-at-a-time (OaaT) feature screening is undertaken.