A highly adaptive microbiome-based association test for survival traits

Koh, Hyunwook; Livanos, Alexandra E.; Blaser, Martin J.; Li, Huilin

doi:10.1186/s12864-018-4599-8

Methodology article
Open access
Published: 20 March 2018

A highly adaptive microbiome-based association test for survival traits

Hyunwook Koh¹,
Alexandra E. Livanos²,
Martin J. Blaser^3,4 &
…
Huilin Li ORCID: orcid.org/0000-0002-8288-7068¹

BMC Genomics volume 19, Article number: 210 (2018) Cite this article

2791 Accesses
26 Citations
9 Altmetric
Metrics details

Abstract

Background

There has been increasing interest in discovering microbial taxa that are associated with human health or disease, gathering momentum through the advances in next-generation sequencing technologies. Investigators have also increasingly employed prospective study designs to survey survival (i.e., time-to-event) outcomes, but current item-by-item statistical methods have limitations due to the unknown true association pattern. Here, we propose a new adaptive microbiome-based association test for survival outcomes, namely, optimal microbiome-based survival analysis (OMiSA). OMiSA approximates to the most powerful association test in two domains: 1) microbiome-based survival analysis using linear and non-linear bases of OTUs (MiSALN) which weighs rare, mid-abundant, and abundant OTUs, respectively, and 2) microbiome regression-based kernel association test for survival traits (MiRKAT-S) which incorporates different distance metrics (e.g., unique fraction (UniFrac) distance and Bray-Curtis dissimilarity), respectively.

Results

We illustrate that OMiSA powerfully discovers microbial taxa whether their underlying associated lineages are rare or abundant and phylogenetically related or not. OMiSA is a semi-parametric method based on a variance-component score test and a re-sampling method; hence, it is free from any distributional assumption on the effect of microbial composition and advantageous to robustly control type I error rates. Our extensive simulations demonstrate the highly robust performance of OMiSA. We also present the use of OMiSA with real data applications.

Conclusions

OMiSA is attractive in practice as the true association pattern is unpredictable in advance and, for survival outcomes, no adaptive microbiome-based association test is currently available.

Background

The human microbiota is the totality of all microorganisms living in and on the human body [1] and its role in human health and disease has been increasingly studied [2,3,4,5]. Human microbiota studies have been accelerated by the advent of next-generation sequencing technologies which enabled an unbiased characterization of all microorganisms, often by targeting the bacterial 16S ribosomal RNA (rRNA) gene [6, 7]. Diverse microorganisms can be identified based on sequence similarity to known 16S rRNA genes and classified into operational taxonomic units (OTUs) [8]. The OTUs are characterized by their estimated abundance (e.g., read count or relative abundance) and phylogenetic tree structure (i.e., taxonomical and evolutionary relationships). Accordingly, various microbial diversity metrics on the basis of microbial abundance and phylogenetic tree information have been surveyed in microbiome-based association studies [9]. The data are also large-scale including numerous OTUs (e.g., hundreds to thousands) with the presence of a long tail of rare microorganisms. We now pursue further discovery of associated microbial taxa for more integrative assessments about the root causes of maladies.

Recently, a number of microbiome-based association tests have been introduced to survey the entire microbial community (e.g., bacterial kingdom) and microbial taxa (e.g., phyla, classes, orders, families, genera, and species). As statistical power from massive univariate analyses for individual OTUs is considerably low due to the requisite multiple testing correction, here, our focus lies on association tests for microbial groups of multiple OTUs (i.e., the entire community (e.g., bacterial kingdom) and higher-level taxa (e.g., phyla, classes, orders, families, and genera)). A majority of existing methods (e.g., LEfSe [10], STAMP [11], DESeq2 [12], and metagenomeSeq-fit Zig [13]) relate aggregated microbial abundance for each taxon with health or disease outcome [14]. However, these methods are subject to a substantial loss of power as its underlying assumption - the same effect direction for all associated OTUs - is violated (e.g., within a taxon of interest, some OTUs are symbiotic, while others are pathogenic) [14]. To explain, when OTUs in a taxon of interest are all positively (or all negatively) associated with an outcome of interest (i.e., in the case of the same effect direction), their positive (or negative) association signals are amplified in their aggregated abundance, so that we can powerfully discover the association between the aggregated abundance and the outcome of interest. However, when OTUs in a taxon of interest are in mixed effect directions (i.e., some are positively, while others are negatively associated with an outcome of interest), their positive and negative association signals are canceled out in their aggregated abundance, so that we cannot discover any (positive or negative) association between the aggregated abundance and the outcome of interest. Detailed description and simulation studies have been addressed in [14]. Moreover, those aggregate-based methods do not utilize phylogenetic tree structure which considers taxonomical and evolutionary relationships among diverse microorganisms. As an alternative, distance-based analysis is popular and, for example, microbiome regression-based kernel association test (MiRKAT) [15] is spotlighted in this context. MiRKAT incorporates diverse ecologically informative distance metrics (e.g., unique fraction (UniFrac) distance [16,17,18] and Bray-Curtis dissimilarity [19]) into its kernel machine regression framework. As different distance metrics vary in the extent of the relative contributions from microbial abundance and phylogenetic tree information, they can be most accurate in different true underlying association patterns, respectively. However, prior knowledge about the true association pattern is limited and it is thus difficult to predict which distance metric is optimal in practice. The adaptive test of MiRKAT, called Optimal MiRKAT, approximates to an optimal test adaptively among multiple MiRKAT tests using different distance metric specifications; hence, in practice, Optimal MiRKAT is attractive. OMiAT [14] is a further adaptive test which approximates to an optimal test adaptively throughout the sum of powered score tests (SPU) [20] and MiRKAT tests. By including SPU tests in the search space, OMiAT robustly discovers rare, mid-abundant, and abundant associated lineages along with the functionality of Optimal MiRKAT.

There has also been increasing interest in discovering microbial taxa that are associated with survival (i.e., time-to-event) outcomes on the basis of prospective study designs (e.g., randomized clinical trials and prospective cohort studies) [21,22,23]. Survival outcomes are better informed by examining health or disease progression at multiple times over a lengthy period of follow-up. However, all of the above methods can handle only binary or continuous outcomes at a single time point. Currently, a remarkable association testing method in microbiome-based survival analysis is microbiome regression-based kernel association test for survival traits (MiRKAT-S) [24]. As with MiRKAT, MiRKAT-S incorporates distance metrics into its kernel machine regression framework, but is designed to handle survival outcomes. Plantinga et al. [24] also demonstrated that MiRKAT-S has higher power than other distance-based approaches used in prior studies, such as Cox proportional hazards regression followed by principal coordinates analysis [21] or Ward’s agglomerative hierarchical clustering method [25].

However, MiRKAT-S has three critical issues. First, Plantinga et al. [24] reports that MiRKAT-S performs poorly when associated OTUs are rare in abundance. Microbiome data usually contain mostly rare OTUs and only few OTUs representing most of the abundance, especially for gut or oral microbiota which has greater microbial diversity. This indicates that if the test works only for few dominant associated OTUs, numerous rare or mid-abundant taxa are simply ignored. As a remedy, we introduce a new set of association tests, namely, microbiome-based survival analysis using linear and non-linear bases of OTUs (MiSALN), which weigh rare, mid-abundant, and abundant OTUs, respectively, and its adaptive testing method, Optimal MiSALN (OMiSALN), to ensure a robust performance for OTUs with low or high abundance. Second, MiRKAT-S handles distance metrics one-by-one and no adaptive testing method is available. The cherry-picking approach from multiple item-by-item MiRKAT-S tests cannot correctly control type I error rate or the requisite multiple testing correction can lead to a substantial loss of power. Therefore, we also introduce an adaptive testing method for MiRKAT-S, namely, Optimal MiRKAT-S (OMiRKAT-S). Third, as with MiRKAT, MiRKAT-S can assess only the entire community and is not currently applicable to higher-level taxa. Thus, we extend its usability as a general microbial group analytic method.

Our major proposed method is a highly adaptive test, namely, optimal microbiome-based survival analysis (OMiSA), which creates an optimal test throughout multiple MiSALN and MiRKAT-S tests. As a result, OMiSA performs well whether the underlying associated lineages are rare or abundant by MiSALN and phylogenetically related or not by MiRKAT-S.

We now present the methodological details of the approach, and then provide extensive simulations and real data applications, before discussing limitations and other feasibilities.

Methods

This section is devoted to describe the methodological details of our proposed methods. Here, we first organize our new methods separated from existing methods in Fig. 1. That is, MiRKAT-S [24] for the individual use of different distance metrics is an existing method (blue letters, Fig. 1), and our methods (red letters, Fig. 1) are its adaptive test, OMiRKAT-S, MiSALN and its adaptive test, OMiSALN, and OMiSA. Again, OMiSA is our major proposed method, and the other individual and sub-adaptive tests are necessary to reach our final destination, OMiSA.

Models and notations

Suppose that there are n subjects, p OTUs, and q covariates (e.g., age and sex) and the subscripts, i, j, and k, indicate a subject, an OTU, and a covariate, respectively. For each subject i, let T_i be a survival time, C_i be a censoring time, and Y_i be an observed time. Then, Y_i is defined as Y_i = min(T_i, C_i) and an event indicator, δ_i, is defined as δ_i = I(T_i ≤ C_i). The ordered observed event times are denoted by τ₁, ..., τ_m, where m is the number of total events (m ≤ n), and the risk set at time τ_g, is denoted by R_g, for g = 1, ..., m. In addition, for each subject i, denote a p × 1 vector, Z_i, for the microbial composition of the entire community or a higher-level taxon, marked for its elements in OTU-level relative abundance as Z_ij for j = 1, ..., p, and denote a k × 1 vector, X_i, for the covariates, marked for its elements as X_ik for k = 1, ..., q. Here, we assume that n subjects are identically and independently distributed (e.g., random subjects) and C_i is independent of T_i conditional on Z_i and X_i.

To relate microbial composition with survival outcomes adjusting for covariates, we consider a Cox proportional hazard model (Eq. 1) [26].

$$ {\uplambda}_{\mathrm{i}}\left(\mathrm{t}\right)={\uplambda}_0\left(\mathrm{t}\right){\mathrm{e}}^{\Sigma_{\mathrm{k}=1}^{\mathrm{q}}{\upalpha}_{\mathrm{k}}{\mathrm{X}}_{\mathrm{i}\mathrm{k}}+\mathrm{h}\left({\mathrm{Z}}_{\mathrm{i}}\right)}, $$

(1)

where λ_i(t) is the conditional hazard function given Z_i and X_i, λ₀(t) is the baseline hazard function, α’s are coefficients for the effect of covariates (e.g., age and sex), and h(Z_i) is a function which characterizes the relationship between microbial composition and survival outcomes. For example, if we specify h(Z_i) = $ \sum \limits_{\mathrm{j}=1}^{\mathrm{p}}{\upbeta}_{\mathrm{j}}{\mathrm{Z}}_{\mathrm{ij}} $, we can relate the linear effects of OTUs in relative abundance to the log hazard rate. The non-linear effects of OTUs in relative abundance can also be surveyed by the use of non-linear bases of OTUs (e.g., polynomials/splines) [27]. Moreover, we can specify h(Z_i) more flexibly by the use of different positive semi-definite kernel functions modeled for different distance/similarity metrics among subjects [27, 28]. In the following sections, we introduce two different machineries for the specification of h(Z_i) - one for the use of different linear/non-linear bases of OTUs and the other for the use of different kernel functions - and illustrate how their performance varies by different true underlying association patterns. The former is a newly introduced method, MiSALN. The latter is an existing method, MiRKAT-S [24]; hence, we describe its main ideas and formula and refer to its original paper for more details. Of most importance is that we introduce new adaptive testing methods which approximate to an optimal test for each of the two machineries (namely, OMiSALN and OMiRKAT-S, respectively) and throughout the two different machineries (namely, OMiSA).

MiSALN

Suppose that β = (β₁, …, β_p)^T is the vector of regression coefficients for p OTUs. We are particularly interested in testing the null hypothesis of no association between the microbial composition consisting of those p OTUs and survival outcomes, H₀: β = (β₁, …, β_p)^T = 0. Assuming the coefficients, β₁, ..., β_p, are random and independent with mean zero, a common variance, σ², and a pairwise correlation matrix, Ρ, (i.e., E(β) = 0 and Cov(β) = σ²Ρ), the same null hypothesis and its corresponding alternative can be formulated as a variance-component score test with Eq. 2 [28,29,30,31,32].

$$ {\mathrm{H}}_0:{\upsigma}^2=0\;\mathrm{vs}.{\mathrm{H}}_1:{\upsigma}^2>0 $$

(2)

Note here that any full distributional form for the coefficients, β₁, …, β_p, was not required, but we need to specify the correlation matrix, Ρ [29]. For the choice of Ρ, we can consider different choices which have greater deviations from H₀ in directions corresponding to the larger eigenvalues of P [29]. However, we consider the p × p identity matrix, I_p, for no correlation over β_j’s for MiSALN [30].

The Cox proportional hazard model for the null hypothesis can be formulated with Eq. 3.

$$ {\uplambda}_{\mathrm{i}}\left(\mathrm{t}\right)={\uplambda}_0\left(\mathrm{t}\right){\mathrm{e}}^{\Sigma_{\mathrm{k}=1}^{\mathrm{q}}}{\upalpha}_{\mathrm{k}}{\mathrm{X}}_{\mathrm{i}\mathrm{k}} $$

(3)

Based on the maximum likelihood estimates, $ \widehat{\upalpha} $’s, the estimated cumulative hazard rate for subject i at its observed time, $ {\widehat{\Lambda}}_{\mathrm{i}} $, can be derived as in Eq. 4 [29]. We here used Efron’s approximation [33] to handle observations which have tied survival times.

$$ {\widehat{\Lambda}}_{\mathrm{i}}={\Sigma}_{\uptau_g\le {\mathrm{Y}}_{\mathrm{i}}}\frac{\exp \left({\Sigma}_{\mathrm{k}=1}^{\mathrm{q}}{\widehat{\upalpha}}_{\mathrm{k}}{\mathrm{X}}_{\mathrm{i}\mathrm{k}}\right)}{\Sigma_{\mathrm{f}\in {\mathrm{R}}_{\mathrm{g}}}\exp \left({\Sigma}_{\mathrm{k}=1}^{\mathrm{q}}{\widehat{\upalpha}}_{\mathrm{k}}{\mathrm{X}}_{\mathrm{f}\mathrm{k}}\right)} $$

(4)

Based on $ \widehat{\upalpha} $’s and the resulting estimates, $ {\widehat{\Lambda}}_{\mathrm{i}} $’s, Verweij et al. [29] derives a variance-component score test statistic to test H₀: σ² = 0 against H₁: σ² > 0 as in Eq. 5.

$$ \mathrm{U}={\left(\mathrm{d}-\hat{\Lambda}\right)}^{\mathrm{T}}\mathrm{R}\left(\mathrm{d}-\hat{\Lambda}\right), $$

(5)

where d = (δ₁, …, δ_n)^T, $ \widehat{\mathrm{e}} $ = $ {\left({\widehat{\Lambda}}_1,\dots, {\widehat{\Lambda}}_{\mathrm{n}}\right)}^{\mathrm{T}} $, and R is the n × n correlation matrix for n subjects that we need to specify. Here, $ \mathrm{d}-\widehat{\Lambda} $ is the vector of the estimated martingale residuals under the null model (Eq. 3). Verweij et al. also derives the mean and variance of U and demonstrates that the null distribution of the standardized score test closely approximates to standard normal distribution [29]. However, since our proposed tests are based on a residual permutation-based scheme for p-value calculation and the mean and variance of U are evaluated under the null, the unstandardized score test, U, is sufficient in our study.

Of importance is that with different specifications for R, we can survey different correlation structures among subjects. For MiSALN, our choices for R are formulated with Eq. 6.

$$ {\mathrm{R}}_{\mathrm{MiSALN}\left(\upgamma \right)}={\mathrm{Z}}^{\upgamma}{\mathrm{Z}}^{\upgamma^{\mathrm{T}}}, $$

(6)

where Z is the n × p matrix for OTU relative abundances (i.e., compositions), Z = (Z₁, …, Z_n)^T, and γ (ϵ ℝ⁺) powers Z and needs to be pre-specified. The variance-component score test with this correlation matrix, R_MiSALN(γ), can be simply derived as in Eq. 7.

$$ {\mathrm{U}}_{\mathrm{MiSALN}\left(\upgamma \right)}={\left(\mathrm{d}-\hat{\Lambda}\right)}^{\mathrm{T}}{\mathrm{R}}_{\mathrm{MiSALN}\left(\upgamma \right)}\left(\mathrm{d}-\hat{\Lambda}\right), $$

(7)

The correlation structure, R_MiSALN(γ), describes pairwise similarities in microbial abundance among subjects and the variance-component score test, U_MiSALN(γ), represents the degree of overall association between R_MiSALN(γ) and the estimated martingale residuals [30, 31]. Here, only the microbial abundance information is contributed and no phylogenetic tree information is incorporated.

Note that, γ transforms OTUs to the γ’s power of the original relative abundances, so that different bases of OTUs can be surveyed. When γ = 1, the original scale of OTUs is used for testing the linear effect of OTUs. The resulting correlation matrix, ZZ^T, is equivalent to the linear kernel in kernel machine regression models [28] and has also been used for a gene-set association testing method, namely, Global Test [30]. When γ ≠ 1, the non-linear bases of OTUs can be surveyed. Here, we demonstrate different γ value specifications as different weighting schemes for OTUs in relative abundance as follows. As γ increases, abundant OTUs will be relatively weighted, while rare OTUs will gradually be lost, but vice versa as γ decreases. Therefore, we can expect that when abundant OTUs are associated with survival outcomes, a large value of γ can be more suitable by weighting them more, but vice versa when rare OTUs are associated. However, in practice, the true underlying association pattern is mostly unknown and we cannot presume whether rare, mid-abundant, or abundant OTUs are associated with survival outcomes. Therefore, we propose a data-driven approach, Optimal MiSALN (OMiSALN), which approximates to an optimal test adaptively through different γ value specifications and its test statistic is formulated with Eq. 8.

$$ {\mathrm{Q}}_{\mathrm{OMiSALN}}=\underset{\gamma \in \Gamma}{\min }{\mathrm{P}}_{\mathrm{MiSALN}\left(\upgamma \right)}, $$

(8)

where Г is a set of candidate γ values and P_MiSALN(γ) is the estimated p-value for MiSALN(γ), where γϵГ. We can observe that Q_OMiSALN is the minimum p-value among different MiSALN(γ) tests, where γϵГ. Again, Q_OMiSALN itself is a test statistic which requires its own p-value estimation. We use a residual-based permutation method to estimate p-values for individual MiSALN(γ) tests, where γϵГ, and OMiSALN (see Additional file 1).

For a set of candidate γ values, we used Г = {1/4, 1/3, 1/2, 1} and it was sufficient in our simulations and real data analysis.

MiRKAT-S

The key idea behind MiRKAT-S [24] is that diverse distance metrics (e.g., UniFrac distance [16,17,18] and Bray-Curtis dissimilarity [19]) can be incorporated into the kernel machine Cox proportional hazards model. Hence, we can survey the relationship between ecologically related metrics and survival outcomes on health or disease with covariate adjustments (e.g., age and sex) [24]. First, we need to specify a sample-by-sample pairwise distance matrix based on a chosen distance metric and transform it into a kernel (similarity) matrix using the kernel formula, Eq. 9.

$$ \mathrm{K}=-\frac{1}{2}\left(\mathrm{I}-\frac{11^{\hbox{'}}}{\mathrm{n}}\right){\mathrm{D}}^2\left(\mathrm{I}-\frac{11^{\hbox{'}}}{\mathrm{n}}\right), $$

(9)

where D is the n × n pairwise distance matrix and D² is its element-wise square, I is the n × n identity matrix, and 1 in 11′ is the vector of n ones. To ensure the kernel matrix, K, to be positive semi-definite, negative eigenvalues are replaced with zero [24]. Then, using the resulting kernel matrix, the variance-component score statistic can be formulated with Eq. 10 [24, 27].

$$ {\mathrm{U}}_{\mathrm{MiRKAT}-\mathrm{S}\left(\mathrm{k}\right)}={\left(\mathrm{d}-\hat{\Lambda}\right)}^{\mathrm{T}}{\mathrm{K}}_{\left(\mathrm{k}\right)}\left(\mathrm{d}-\hat{\Lambda}\right), $$

(10)

where k is an index for a particular kernel matrix based on a chosen distance metric. Plantinga et al. [24] has also proposed a modified score statistic which accounts for over-dispersion, but since we calculate p-values based on a residual permutation-based method and the dispersion parameter, $ \frac{1}{{\left(\mathrm{d}-\widehat{\Lambda}\right)}^{\mathrm{T}}\left(\mathrm{d}-\widehat{\Lambda}\right)} $, is evaluated under the null, the variance-component score test statistic of Eq. 10 is sufficient in our study.

Importantly, different distance metrics reflect different relative contributions from microbial abundance and phylogenetic tree information; as such, the performance of MiRKAT-S differs according to the choice of distance metric and the true underlying association pattern [16,17,18, 24]. The UniFrac distances are constructed based on phylogenetic tree information and the contribution of microbial abundance is modulated by different weighting schemes. The unweighted UniFrac distance incorporates only microbial presence/absence information so that it is most inclined to phylogenetic tree information [16], whereas the weighted UniFrac distance further incorporates microbial abundances [17]. In this context, the generalized UniFrac distance is regarded as a compromised version between the unweighted and weighted UniFrac distances [18]. In contrast, the Bray-Curtis dissimilarity [19] does not incorporate any phylogenetic tree information so that it is most inclined to microbial abundance information. Accordingly, when associated OTUs are phylogenetically related, the UniFrac distances can be better choices than Bray-Curtis dissimilarity, but vice versa when they are not phylogenetically related. However, we cannot predict which distance metric is optimal to our study. Therefore, here, we proposed a data-driven approach, namely, Optimal MiRKAT-S (OMiRKAT-S), which is taken adaptively through multiple distance metric specifications and its test statistic is formulated with Eq. 11.

$$ {\mathrm{Q}}_{\mathrm{OMiRKAT}-\mathrm{S}}=\underset{\mathrm{k}\in \uppsi}{\min }{\mathrm{P}}_{\mathrm{MiRKAT}-\mathrm{S}\left(\mathrm{k}\right),} $$

(11)

where Ψ is a set of candidate distance metrics and P_{MiRKAT − S(k)} is the estimated p-value for U_{MiRKAT − S(k)}, where kϵΨ. Note that, OMiRKAT-S is similar to Optimal MiRKAT [15], but the difference is that OMiRKAT-S handles survival outcomes, while Optimal MiRKAT handles binary or continuous outcomes at a time point. Here again, Q_{OMiRKAT − S} is the minimum p-value among different MiRKAT-S(k) tests, where kϵΨ, and it is a test statistic that requires its own p-value estimation. Similar to MiSALN(γ)/OMiSALN, a residual-based permutation method was used to estimate p-values for individual MiRKAT-S(k) tests, where kϵΨ, and OMiRKAT-S (see Additional file 1).

For a set of candidate distance metrics, Ψ, we used Ψ = {unweighted UniFrac (K_U), generalized UniFrac(0.5) (K_0.5), weighted UniFrac (K_W), Bray-Curtis (K_BC)}, where K_0.5 is the generalized UniFrac distance with the parameter, ϴ = 0.5, as suggested [9].

OMiSA

Our major proposed method, OMiSA, approximates to an optimal test adaptively throughout all the different variance-component score tests of MiSALN(γ), where γϵГ, and MiRKAT-S(k), where kϵΨ, and its test statistic can be simply formulated with Eq. 12.

$$ {\mathrm{Q}}_{\mathrm{OMiSA}}=\min \left({\mathrm{Q}}_{\mathrm{OMiSA}\mathrm{LN}},{\mathrm{Q}}_{\mathrm{OMiRKAT}-\mathrm{S}}\right) $$

(12)

Q_OMiSA is the minimum p-value among different MiSALN(γ) tests, where γϵГ, and MiRKAT-S(k) tests, where kϵΨ. It is a test statistic, like Q_OMiSALN (Eq. 8) and Q_{OMiRKAT − S} (Eq. 11). We do not report this genuine minimum p-value as the final p-value to be reported for OMiSA, but estimate its p-value using a residual permutation-based method (see Additional file 1). We emphasize that throughout the machineries of MiSALN and MiRKAT-S, OMiSA powerfully discovers microbial taxa whenever their underlying associated OTUs are rare or abundant by MiSALN and phylogenetically related or not by MiRKAT-S. Our extensive simulations in later sections also demonstrate the robust performance of OMiSA.

Assessment of higher-level taxa

We extend all the individual and adaptive tests as general group analytic methods which can assess any higher-level taxon (e.g., phyla, classes, orders, families, and genera), not only the entire community (e.g., bacterial kingdom), as long as they include multiple OTUs with phylogenetic tree structure. The only matter that requires attention is that when we assess higher-level taxa, their OTU relative abundances (i.e., compositions) are computed based on total reads in the entire community (i.e., OTU relative abundances are not sub-compositions which have unit sum to each higher-level taxon). Specifically, such normalization needs to be applied to the OTU relative abundances for MiSALN and to the UniFrac distances for MiRKAT-S.

A graphical representation

As for visual representations of discoveries, we used an existing software tool, GraPhlAn [34], which is addressed later in our real data analysis. As GraPhlAn is flexibly customizable with beautiful circular representations of hierarchical taxonomic tree [34], here, we do not introduce any new graphical representation and suggest to use GraPhlAn after obtaining outcomes from OMiSA.

Results

Simulations

This section is devoted to simulations which evaluate individual MiSALN and MiRKAT-S tests and their adaptive tests, OMiSALN, OMiRKAT-S, and OMiSA in terms of type I error and statistical power. While the association testing methods can be applied to higher-level taxa, as a demonstration, here, we survey the entire community.

Simulation design

We simulated microbiome data according to prior approaches [35] which are based on a Dirichlet-multinomial distribution reflecting real microbial composition. We first estimated proportion means and a dispersion parameter to be inserted into the Dirichlet-multinomial distribution using actual intestinal microbiome data of non-obese diabetic (NOD) mice in [23]. The complete microbiome data include NOD mice in different treatment groups and sequencing time points; however, as a demonstration, we selected 35 fecal samples from NOD mice at 6 weeks of age in the control group which had not been disturbed by antibiotic exposure. Then, 353 OTUs which have proportional mean abundance > 10⁻⁴ were included in the analysis. The total reads per sample was set to be 1000 [15, 24]. Based on these specifications, we simulated OTU counts for small (n = 50) and large (n = 100) samples, respectively, from the Dirichlet-multinomial distribution.

Two covariates for age and sex were simulated from a normal distribution, N(50, 5²) and a Bernoulli distribution, Bern(0.5), respectively. The survival time, T_i, was simulated through Eq. 13, assuming proportional hazards and a Weibull distribution, Weibull(2,2), for the baseline at age = 50 and sex = 0 [28, 36].

$$ {\mathrm{T}}_{\mathrm{i}}=\sqrt{-\frac{4\log {\mathrm{U}}_{\mathrm{i}}}{\exp \operatorname{}\Big(0.5\left({\mathrm{age}}_{\mathrm{i}}-50\right)+0.5{\mathrm{sex}}_{\mathrm{i}}+{\Sigma}_{\mathrm{j}=1}^{\mathrm{p}}{\beta}_{\mathrm{j}}\mathrm{scale}\left({\mathrm{Z}}_{\mathrm{i}\mathrm{j}}\right)}} $$

(13)

where U_i was randomly sampled from a uniform distribution, Unif(0,1), β_j is a coefficient for each OTU j = 1,…,p, and scale(Z_ij) = $ \frac{{\mathrm{Z}}_{\mathrm{ij}}-\mathrm{mean}\left({\mathrm{Z}}_{1\mathrm{j}},{\mathrm{Z}}_{2\mathrm{j}},\dots, {\mathrm{Z}}_{\mathrm{nj}}\right)}{\mathrm{SD}\left({\mathrm{Z}}_{1\mathrm{j}},{\mathrm{Z}}_{2\mathrm{j}},\dots, {\mathrm{Z}}_{\mathrm{nj}}\right)} $, for subjects i = 1, …, n and OTUs j = 1, …, p. The censoring time, C_i, was simulated based on uniform distribution with two different schemes to survey different extent of censoring: 1) C_i ~ Unif(0,10) which is of the estimated proportions of censored outcomes, 25.78%, and 25.88%, for small (n = 50) and large samples (n = 100), respectively, and 2) C_i ~ Unif(0,5) which is of the estimated proportions of censored outcomes, 40.42% and 40.48%, for small (n = 50) and large samples (n = 100), respectively (Table 1). Then, the observed time, Y_i, and the event indicator, δ_i, were calculated by the formula, Y_i = min(T_i, C_i) and δ_i = I(T_i ≤ C_i), respectively.

Table 1 Estimated type I errors and proportions of censored outcomes for different sample sizes and censoring schemes (Unit: %)

Full size table

Empirical type I error rates with the proportions of censored outcomes were estimated by setting β = (β₁, …, β_p)^′ = 0. Statistical power was estimated with four different scenarios: (i), 10 most abundant OTUs; (ii), 10 random OTUs; (iii), 10 least abundant OTUs; and (iv), OTUs in a selected cluster performing the partitioning-around-medoids (PAM) algorithm [37], which are associated with survival outcomes, respectively. The first three scenarios evaluate discovery ability when abundant, random/mid-abundant, or rare OTUs are associated. For the fourth scenario, we distributed all OTUs into 10 clusters using the PAM algorithm on the basis of their cophenetic distances in the real phylogenetic tree. The 10 clusters have contained 7.8%, 8.2%, 13.9%, 6.6%, 33.3%, 8.3%, 1.1%, 1.8%, 17.2%, and 1.8% of total reads, respectively, and in each simulation iteration, the selection of one associated cluster was randomized to overcome arbitrary selection. The fourth scenario additionally reflects phylogeny, which may provide a more realistic evaluation.

We also surveyed different effect sizes and directions for the associated OTUs. Λ is denoted as a set of indices for the associated OTUs. For the same effect direction, we set β_j ∈ Λas a vector of the elements randomly sampled from Unif(0,1), Unif(0,2), or Unif(0,3) and for the mixed effect direction, we set β_j ∈ Λas a vector of the elements randomly sampled from Unif(− 1,1), Unif(− 2,2), or Unif(− 3,3).

Simulation results

Type I error

We can observe that empirical type I error rates are well-controlled at the significance level of 5% across all the individual and adaptive tests for different censoring schemes and for both small samples (n = 50) and large samples (n = 100) (Table 1).

Power

As we observed similar comparative performances of individual and adaptive tests for different sample sizes and censoring schemes, we include here only the outcomes for large samples (n = 100) and the censoring scheme, Unif(0,10), and moved all of the other outcomes to Additional material (see Additional file 2: Figure S1, Additional file 3: Figure S2, Additional file 4: Figure S3, Additional file 5: Figure S4, Additional file 6: Figure S5, Additional file 7: Figure S6). Figs. 2 and 3 report estimated powers for the same effect direction and mixed effect directions, respectively. We observe that with the increase of effect size, power increases for all the methods under any simulation setting (Figs. 2 and 3), as expected. We also observe that MiRKAT-S/OMiRKAT-S gains slightly more power than MiSALN/OMiSALN for the same effect direction, but it is vice versa for mixed effect directions (Figs. 2 and 3).

MiSALN is powerful using either a large γ value when abundant OTUs are associated with survival outcomes (Figs. 2a and 3a), or using a small γ value when rare OTUs are associated (Figs. 2c and 3c), as expected. For MiRKAT-S, the Bray-Curtis dissimilarity gain power for the first two scenarios in which only the microbial abundances for abundant and random OTUs influence the association (Figs. 2a, b and 3a, b), while the UniFrac distances gain power for the fourth scenario in which phylogenetic tree information is reflected (Figs. 2d and 3d), as expected.

It is notable that the Bray-Curtis dissimilarity is most powerful across all the tests when abundant OTUs are associated with survival outcomes, resulting in high power for OMiRKAT-S (Figs. 2a and 3a). However, as reported in [24], the major problem of MiRKAT-S is that it is underpowered using any distance metric when rare OTUs are associated with survival outcomes; as such, OMiRKAT-S is also underpowered (Figs. 2c and 3c). In contrast, we observed that MiSALN using a small γ value gains power when rare OTUs are associated with survival outcomes, resulting in power for OMiSALN (Figs. 2c and 3c). Overall, we observe that individual item-by-item tests fit specific scenarios, respectively, and there is no single test which fits every scenario. Remarkably, OMiSA is highly robust, approaching the most powerful performance, throughout the four scenarios in which abundant, random, and rare OTUs are associated with survival outcomes and phylogenetic tree information is present (Figs. 2a-d and 3a-d).

Software comparison

For individual MiRKAT-S tests, we also tried the software package, MiRKATS [24], to determine whether there is any disparity between software facilities. For the use of MiRKATS, we applied the permutation method for small samples (n = 50) and the analytic p-value calculation for large samples (n = 100), as suggested [24]. We show that two software packages, OMiSA (our software) and MiRKATS [24], produce nearly identical power estimates for individual MiRKAT-S tests (see Additional file 8: Figure S7, Additional file 9: Figure S8, Additional file 10: Figure S9, Additional file 11: Figure S10, Additional file 12: Figure S11, Additional file 13: Figure S12, Additional file 14: Figure S13, Additional file 15: Figure S14).

Real data analysis

Early-life interactions between microbiota and their hosts have been considered as potential key factors in immunological and metabolic development [38]. Type 1 diabetes (T1D) is an autoimmune disease which is growing in incidence with decreasing age of onset [39]. Livanos et al. performed a microbiome profiling study to survey whether antibiotic-mediated gut microbiome perturbation accelerates onset of T1D in mice [23]. For the study, NOD mice were exposed to different antibiotic treatments or not, and their intestinal microbial populations were sequenced over time, as described in detail [23]. In brief, fecal, cecal, or ileal specimens from NOD mice were collected and the V4 region of bacterial 16S rRNA gene was amplified by triplicate PCR (F515/R806) using barcoded fusion primers. OTUs, as well as their phylogenetic relationships, were examined using the QIIME pipeline [8].

The data are extensive and motivate diverse study orientations, but, here, we analyze whether perturbed microbial composition by antibiotic use is associated with T1D survival. This analysis is necessary as a part of mediation analysis to understand the process by which the antibiotic use causally affects T1D development [40]. As a demonstration, we used 19 fecal samples from male NOD mice at 6 weeks of age in therapeutic-dose pulsed antibiotic (PAT) treatment. The mice were followed for 30 weeks, during this time, 10 mice developed T1D, while 9 mice did not. Livanos et al. [23] also performed analyses to determine differences in relative abundance of genera between the T1D-free and T1D-development groups at 30 weeks of follow-up. However, those analyses were limited to surveying the binary T1D status (T1D-free vs. T1D-development) at an arbitrary single time point. Thus, we re-analyzed the data by taking the entire survival process into account using our proposed methods. We applied a filtering rule, a proportion mean > 10⁻⁴, identifying 120 OTUs in the entire community. We first conducted association testing for the entire community (i.e., bacterial kingdom) using the individual and adaptive tests. Then, we tested microbial taxa at five different taxonomic levels, phylum, class, order, family, and genus, respectively. We tested microbial taxa that have ≥2 OTUs in the data using OMiSA and microbial taxa that have only one OTU in the data using univariate Cox proportional hazards models. At each taxonomic level, we omitted OTUs that do not have any taxonomy assignment. For multiple testing correction, we applied the Benjamini-Hochberg (BH) procedure [41] per taxonomic level. However, we are not restricted to the use of BH procedure for multiple testing correction and other less conservative procedures might be considered [42]. No covariate adjustments were included as with [23].

Table 2 reports adjusted p-values for the entire community using the individual and adaptive tests. Based on either of the adaptive tests, OMiSALN, OMiRKAT-S, and OMiSA, we observe that the microbial composition of the entire community is significantly associated with T1D survival (Table 2). Here, we further observe that MiRKAT-S using the Bray-Curtis dissimilarity has the smallest p-value among all the individual tests and the p-value for MiSALN is decreasing as the specified γ value is increasing (Table 2). This may indicate that relatively abundant OTUs in the entire community are associated with T1D survival in microbial abundance.

Table 2 The estimated p-values for the association between microbial composition of the entire community and T1D survival

Full size table

We created Fig. 4 using a software tool, GraPhlAn, for circular representations of hierarchical taxonomic tree [34] and Fig. 4 summarizes discovered taxa (red circles). We observe that the microbial composition of a phylum, Verrucomicrobia (adj. p: 0.005), a class, Verrucomicrobiae (adj. p: <.001), an order, Verrucomicrobiales (adj. p: <.001), two families, Verrucomicrobiaceae (adj. p: <.001) and Streptococcaceae (adj. p: 0.040), and a genus, Akkermansia (adj. p: 0.012), are significantly associated with T1D survival (Fig. 4).

Discussion

We described that our computational procedures are efficient as they are based on the score-based tests without double permutations (see Additional file 1), but Plantinga et al.’s analytical p-value calculation - based on a closed form asymptotic null distribution suggested for large samples (n > 100) - should be more efficient [24]. Testing all higher-level taxa throughout all different taxonomic ranks may impose a greater computational burden on OMiSA. Thus, for such complete association mapping, we suggest to use multi-core computers to simultaneously implement multiple OMiSA tests. Our software package is currently written in R to exploit existing R functions, but the use of lower-level languages (e.g., C or Fortran) may further enhance its computational performance. Although OMiSA approaches the smallest p-value and thus the highest power adaptively, varying p-values from individual candidate tests may provide some biological insights on the basis of their methodological aspects and simulation outcomes (Figs. 2 and 3). For MiSALN, we may, in reverse, infer that when the use of a small γ value gains a relatively smaller p-value, rare OTUs might be associated, but it is vice versa when the use of a large γ value gains a relatively smaller p-value. Similarly, for MiRKAT-S, when the use of UniFrac distances gains a relatively smaller p-value than the use of Bray-Curtis dissimilarity, associated OTUs might be phylogenetically related, while when the use of Bray-Curtis dissimilarity gains a relatively smaller p-value than the use of UniFrac distances, associated OTUs might not be phylogenetically related. However, “rare or abundant” and “phylogenetically related or not” are conceptual terms with no firm definition, so such interpretations might be challenging.

It is a well-recognized issue in the microbiome research community that the compositional nature of data renders spurious correlation among microbial markers by the unit sum constraint per sample. As a remedy, diverse data transformation procedures (e.g., log-ratio transformation [43]) have been studied, but it is still highly debatable on which procedure is the most robust one [13, 44]. Thus, here, we did not employ any specific data transformation procedure, but would let users decide. For example, centered log-ratio transformation [43] can be considered for OMiSALN as it is usable for either data format in count or composition.

While we illustrated our proposed methods to be used for microbiota profiling studies via 16S rRNA gene target sequencing, they are transferrable to metagenomic studies via whole microbial genome sequencing [45]. As long as the data are organized for a group of multiple count markers with their phylogenetic tree structure, any of our proposed methods can be readily used.

Conclusions

As current item-by-item approaches have limitations due to the unknown true association pattern, we introduced three new adaptive tests, OMiSALN, OMiRKAT-S, and OMiSA, for microbiome-based association studies with survival outcomes. We ascertained that they are all statistically valid with well-controlled type I error rates for different sample sizes and censoring proportions. Among those, our major proposed method, OMiSA, is highly attractive, as it is robustly powerful through a breadth of association patterns. We also presented that it is not restricted to test the entire community, but also applicable to any taxonomic level above species, and our residual-based permutation method always produces a closed form p-value (see Additional file 1 [14, 15, 20, 46]). Consequently, we recommend that OMiSA can extensively be used for microbiome-based survival analysis as a robust group analytic method.

Abbreviations

BH:: Benjamini-Hochberg
MiRKAT:: Microbiome regression-based kernel association test
MiRKAT-S:: Microbiome regression-based kernel association test for survival traits
MiSALN:: Microbiome-based survival analysis using linear and non-linear bases of OTUs
NOD:: Non-obese diabetic
OMiAT:: Optimal microbiome-based association test
OMiRKAT-S:: Optimal microbiome regression-based kernel association test for survival traits
OMiSA:: Optimal microbiome-based survival analysis
OMiSALN:: Optimal microbiome-based survival analysis using linear and non-linear bases of OTUs
OTU:: Operational taxonomic unit
PAM:: Partitioning-around-medoids
PAT:: Pulsed antibiotic
SPU:: The sum of powered score tests
T1D:: Type 1 diabetes
UniFrac:: Unique fraction

References

Human Microbiome Project Consortium. A framework for human microbiome research. Nature. 2012;486:215–21.
Article Google Scholar
Cho I, Yamanishi S, Cox L, Methé BA, Zavadil J, Li K, et al. Antibiotics in early life alter the murine colonic microbiome and adiposity. Nature. 2012;488:621–6.
Article CAS PubMed PubMed Central Google Scholar
Cox LM, Yamanish S, Sohn J, Alekseyenko AV, Leung JM, Cho I, et al. Altering the intestinal microbiota during a critical developmental window has lasting metabolic consequences. Cell. 2013;158:705–21.
Article Google Scholar
Bokulich NA, Chung J, Battaglia T, Henderson N, Jay M, Li H, et al. Antibiotics, birth mode, and diet shape microbiome maturation during early life. Sci Transl Med. 2016;8:343–82.
Article Google Scholar
Mahana D, Trent CM, Kurtz ZD, Bokulich NA, Battaglia T, Chung J, et al. Antibiotic perturbation of the murine gut microbiome enhances the adiposity, insulin resistance, and liver disease associated with high-fat diet. Genome Med. 2016;8:48.
Article PubMed PubMed Central Google Scholar
Woese CR, Fox GE, Zablen L, Uchida T, Bonen L, Pechman K, et al. Conservation of primary structure in 16S ribosomal RNA. Nature. 1975;254:83–5.
Article CAS PubMed Google Scholar
Hamady M, Knight R. Microbial community profiling for human microbiome projects: tools, techniques. Genome Res. 2009;11:1998–52.
Google Scholar
Caporaso JG, Kuczynski J, Stombaugh J, Bittinger K, Bushman FD, Costello EK, et al. QIIME allows analysis of high-throughput community sequencing data. Nat Methods. 2010;7:335–6.
Article CAS PubMed PubMed Central Google Scholar
Li H. Microbiome, metagenomics, and high-dimensional compositional data analysis. Annu Rev Stat Appl. 2015;2:73–94.
Article Google Scholar
Segata N, Izard J, Waldron L, Gevers D, Miropolsky L, Garrett WS, et al. Metagenomic biomarker discovery and explanation. Genome Biol. 2011;12:R60.
Article PubMed PubMed Central Google Scholar
Parks DH, Tyson GW, Hugenholtz P, Beiko RG. STAMP: statistical analysis of taxonomic and functional profiles. Bioinformatics. 2014;30:3123–4.
Article CAS PubMed PubMed Central Google Scholar
Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:1–21.
Article Google Scholar
Paulson JN, Stine OC, Bravo HC, Pop M. Differential abundance analysis for microbial marker-gene surveys. Nat Methods. 2013;10:1200–2.
Article CAS PubMed PubMed Central Google Scholar
Koh H, Blaser MJ, Li H. A powerful microbiome-based association test and a microbial taxa discovery framework for comprehensive association mapping. Microbiome. 2017;5:45.
Article PubMed PubMed Central Google Scholar
Zhao N, Chen J, Carroll IM, Rinqel-Kulka T, Epstein MP, Zhou H, et al. Testing in microbiome-profiling studies with MiRKAT, the microbiome regression-based kernel association test. Am J Hum Genet. 2015;96:797–807.
Article CAS PubMed PubMed Central Google Scholar
Lozupone CA, Knight R. UniFrac: a new phylogenetic method for comparing microbial communities. Appl Environ Microbiol. 2005;71:8228–35.
Article CAS PubMed PubMed Central Google Scholar
Lozupone CA, Hamady M, Kelley ST, Knight R. Quantitative and qualitative β diversity measures lead to different insights into factors that structure microbial communities. Appl Environ Microbiol. 2007;73:1576–85.
Article CAS PubMed PubMed Central Google Scholar
Chen J, Bittinger K, Charlson ES, Hoffmann C, Lewis J, Wu GD, et al. Associating microbiome composition with environmental covariates using generalized UniFrac distances. Bioinformatics. 2012;28:2106–13.
Article CAS PubMed PubMed Central Google Scholar
Bray JR, Curtis JT. An ordination of upland forest communities of southern Wisconsin. Ecol Monogr. 1957;27:325–49.
Article Google Scholar
Pan W, Kim J, Zhang Y, Shen X, Wei P. A powerful and adaptive association test for rare variants. Genetics. 2014;(4):1081–95.
Han MK, Zhou Y, Murray S, Tayob N, Noth I, Lama VN, et al. Association between lung microbiome and disease progression in IPF: a prospective cohort study. Lancet Respir Med. 2014;2:548–56.
Article PubMed PubMed Central Google Scholar
Jenq RR, Taur Y, Devlin SM, Ponce DM, Goldberg JD, Ahr KF, et al. Intestinal Blautia is associated with reduced death from graft-versus-host disease. Biol Blood Marrow Transplants. 2015;21:1373–83.
Article Google Scholar
Livanos AE, Greiner TU, Vangay P, Pathmasiri W, Stewart D, McRitchie S, et al. Antibiotic-mediated gut microbiome perturbation accelerates development of type 1 diabetes in mice. Nat Microbiol. 2016;1:6140.
Article Google Scholar
Plantinga A, Zhan X, Zhao N, Chen J, Jenq RR, Wu MC, et al. MiRKAT-S: a community-level test of association between the microbiota and survival times. Microbiome. 2017;5:17.
Article PubMed PubMed Central Google Scholar
Ward Jr. JH. Hierarchical grouping to optimize an objective function. J Am Stat Assoc. 1963;58:236–44.
Article Google Scholar
Cox D. Regression models and life tables (with discussion). J R Stat Soc Series B. 1972;34:187–220.
Google Scholar
Lin X, Cai T, Wu MC, Zhou Q, Liu G, Christiani DC, et al. Kernel machine SNP-set analysis for censored survival outcomes in genome-wide association studies. Genet Epidemiol. 2011;35:620–31.
Article PubMed PubMed Central Google Scholar
Chen H, Lumley T, Brody J, Heard-Costa NL, Fox CS, Cupples LA, et al. Sequence kernel association test for survival traits. Genet Epidemiol. 2014;38:191–7.
Article PubMed PubMed Central Google Scholar
Verweij PJM, Van Houwelingen HC, Stijnen T. A goodness-of-fit test for Cox's proportional hazards model based on martingale residuals. Biometrics. 1998;54:1517–26.
Article Google Scholar
Goeman JJ, Oosting J, Cleton-Jansen AM, Anninga JK, Van Houwelingen HC. Testing association of a pathway with survival using gene expression data. Bioinformatics. 2005;21:1950–7.
Article CAS PubMed Google Scholar
Goeman JJ, Van De Geer SA, Van Houwelingen HC. Testing against a high dimensional alternative. J R Stat Soc Series B. 2006;68:477–93.
Article Google Scholar
Li H, Chen J. Efficient unified rare variant association test by modeling the population genetic distribution in case-control studies. Genet Epidemiol. 2016;40:579–90.
Article PubMed PubMed Central Google Scholar
Efron B. The efficiency of Cox’s likelihood function for censored data. J Am Stat Assoc. 1977;72:557–65.
Article Google Scholar
Morgan XC, Tickle TL, Sokol H, Gevers D, Devaney KL, Ward DV, et al. Dysfunction of the intestinal microbiome in inflammatory bowel disease and treatment. Genome Biol. 2012;13:R79.
Article CAS PubMed PubMed Central Google Scholar
Chen J, Li H. Kernel methods for regression analysis of microbiome composition data. Topics in applied statistics: 2012 symposium of the international Chinese statistical association. New York: Springer; 1998. p. 191–201.
Google Scholar
Bender R, Augustin T, Blettner M. Generating survival times to simulate cox proportional hazards models. Stat Med. 2005;24:1713–23.
Article PubMed Google Scholar
Reynolds AP, Richard G, De La Iglesia B, Rayward-Smith VJ. Clustering rules: a comparison of partitioning and hierarchical clustering algorithms. J Math Model Algorithms. 2006;5:474–504.
Article Google Scholar
Olszak T, An D, Zeissiq S, Vera MP, Richter J, Franke A, et al. Microbial exposure during early life has persistent effects on natural killer T cell function. Science. 2012;336:489–93.
Article CAS PubMed PubMed Central Google Scholar
Diamond Project Group. Incidence and trends of childhood type 1 diabetes worldwide 1990-1999. Diabetic Med. 2006;23:857–66.
Article Google Scholar
Baron RM, Kenny DA. The moderator-mediator variable distinction in social psychological research: conceptual, strategic and statistical considerations. J Pers Soc Psychol. 1986;51:1173–82.
Article CAS PubMed Google Scholar
Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Series B. 1995;57:289–300.
Google Scholar
Sankaran K, Holmes S. structSSI: simultaneous and selective inference for grouped or hierarchically structured data. J Stat Softw. 2014;59(13)
Aitchison J. The statistical analysis of compositional data. J R Stat Soc B. 1982;44:139–77.
Google Scholar
O’Hara RB, Kotze DJ. Do not log-transform count data. Methods Ecol Evol. 2010;2:118–22.
Article Google Scholar
Tringe SG, Rubin EM. Metagenomics: DNA sequencing of environmental samples. Nat Rev Genet. 2005;6:805–14.
Article CAS PubMed Google Scholar
Churchill GA, Doerge RW. Empirical threshold values for quantitative trait mapping. Genetics. 1994;138:963–71.
CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors are grateful to the anonymous reviewers for their insightful observations and comments.

Funding

The work was supported in part by the National Institute of Health (R01 DK090989, R01 DK110014, U01 AI22285), Juvenile Diabetes Research Foundation, Howard Hughes Medical Institute, Diane Belfer Program in Human Microbial Ecology, and C&D Fund. The funding body had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Availability of data and materials

The real microbiome data [23] we used in our real data analysis have been deposited at European Bioinformatics Institute (EBI) database (https://www.ebi.ac.uk) with accession code ERP016357 and Qiita database (https://qiita.ucsd.edu) with access code 10508. The synthetic data we used are described in the above Simulations section.

Our methods are implemented using the R package, OMiSA, which is freely available on the web at https://sites.google.com/site/huilinli09/software or https://github.com/hk1785/OMiSA together with its detailed manual which includes arguments/options, input formats, and outputs with examples. The software and tutorial for GraPhlAn are available at https://bitbucket.org/nsegata/graphlan/wiki/Home and referenced to its original paper [34].

Author information

Authors and Affiliations

Department of Population Health, New York University School of Medicine, 650 First Avenue, Room 547, New York, NY, 10016, USA
Hyunwook Koh & Huilin Li
Department of Medicine, Columbia University Medical Center, New York, NY, 10032, USA
Alexandra E. Livanos
Departments of Medicine and Microbiology, New York University School of Medicine, New York, NY, 10016, USA
Martin J. Blaser
Medical Service, New York Harbor Department of Veterans Affairs Medical Center, New York, NY, 10010, USA
Martin J. Blaser

Authors

Hyunwook Koh
View author publications
You can also search for this author in PubMed Google Scholar
Alexandra E. Livanos
View author publications
You can also search for this author in PubMed Google Scholar
Martin J. Blaser
View author publications
You can also search for this author in PubMed Google Scholar
Huilin Li
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

HK and HL developed the methodological ideas. HK implemented the methods, performed the simulations and real data analysis, and developed the software package. AEL and MJB provided biological insights and interpretations in the real data analysis and contributed to the acquisition of utilized real microbiome data. HK and HL wrote the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Huilin Li.

Ethics declarations

Ethics approval and consent to participate

Not applicable. All utilized data sets are publicly and freely available which do not require any ethics approval and consent to participate.

Consent for publication

Not applicable. All utilized data sets are publicly and freely available which do not require any consent for publication.

Competing interests

The authors declare that they have no competing interests.

Additional files

Additional file 1:

Computational procedures. (PDF 390 kb)

Additional file 2:

Figure S1. Power estimates for the individual and adaptive tests. The censoring scheme, C_i ~ Unif(0,5), and the same effect directions, where β_j ∈ Λ is a vector of the elements sampled from Unif(0,1) (blue), Unif(0,2) (yellow), or Unif(0,3) (red), for a large sample size (n = 100) were surveyed. K_U, K_0.5, K_W, and K_BC, indicates the use of unweighted UniFrac, generalized UniFrac with ϴ = 0.5, weighted UniFrac, and the Bray-Curtis dissimilarity kernels, respectively, for MiRKAT-S [24]. (A: 10 most abundant OTUs are associated. B: 10 random OTUs are associated. C: 10 least abundant OTUs are associated. D: OTUs in a chosen cluster are associated.). (PDF 10 kb)

Additional file 3:

Figure S2. Power estimates for the individual and adaptive tests. The censoring scheme, C_i ~ Unif(0,5), and the mixed effect directions, where β_j ∈ Λ is a vector of the elements sampled from Unif(− 1,1) (blue), Unif(− 2,2) (yellow), or Unif(− 3,3) (red), for a large sample size (n = 100) were surveyed. K_U, K_0.5, K_W, and K_BC, indicates the use of unweighted UniFrac, generalized UniFrac with ϴ = 0.5, weighted UniFrac, and the Bray-Curtis dissimilarity kernels, respectively, for MiRKAT-S [24]. (A: 10 most abundant OTUs are associated. B: 10 random OTUs are associated. C: 10 least abundant OTUs are associated. D: OTUs in a chosen cluster are associated.). (PDF 10 kb)

Additional file 4:

Figure S3. Power estimates for the individual and adaptive tests. The censoring scheme, C_i ~ Unif(0,10), and the same effect directions, where β_j ∈ Λ is a vector of the elements sampled from Unif(0,1) (blue), Unif(0,2) (yellow), or Unif(0,3) (red), for a small sample size (n = 50) were surveyed. K_U, K_0.5, K_W, and K_BC, indicates the use of unweighted UniFrac, generalized UniFrac with ϴ = 0.5, weighted UniFrac, and the Bray-Curtis dissimilarity kernels, respectively, for MiRKAT-S [24]. (A: 10 most abundant OTUs are associated. B: 10 random OTUs are associated. C: 10 least abundant OTUs are associated. D: OTUs in a chosen cluster are associated.). (PDF 9 kb)

Additional file 5:

Figure S4. Power estimates for the individual and adaptive tests. The censoring scheme, C_i ~ Unif(0,10), and the mixed effect directions, where β_j ∈ Λ is a vector of the elements sampled from Unif(0,1) (blue), Unif(0,2) (yellow), or Unif(0,3) (red), for a small sample size (n = 50) were surveyed. K_U, K_0.5, K_W, and K_BC, indicates the use of unweighted UniFrac, generalized UniFrac with ϴ = 0.5, weighted UniFrac, and the Bray-Curtis dissimilarity kernels, respectively, for MiRKAT-S [24]. (A: 10 most abundant OTUs are associated. B: 10 random OTUs are associated. C: 10 least abundant OTUs are associated. D: OTUs in a chosen cluster are associated.). (PDF 9 kb)

Additional file 6:

Figure S5. Power estimates for the individual and adaptive tests. The censoring scheme, C_i ~ Unif(0,5), and the same effect directions, where β_j ∈ Λ is a vector of the elements sampled from Unif(0,1) (blue), Unif(0,2) (yellow), or Unif(0,3) (red), for a small sample size (n = 50) were surveyed. K_U, K_0.5, K_W, and K_BC, indicates the use of unweighted UniFrac, generalized UniFrac with ϴ = 0.5, weighted UniFrac, and the Bray-Curtis dissimilarity kernels, respectively, for MiRKAT-S [24]. (A: 10 most abundant OTUs are associated. B: 10 random OTUs are associated. C: 10 least abundant OTUs are associated. D: OTUs in a chosen cluster are associated.). (PDF 9 kb)

Additional file 7:

Figure S6. Power estimates for the individual and adaptive tests. The censoring scheme, C_i ~ Unif(0,5), and the mixed effect directions, where β_j ∈ Λ is a vector of the elements sampled from Unif(− 1,1) (blue), Unif(− 2,2) (yellow), or Unif(− 3,3) (red), for a small sample size (n = 50) were surveyed. K_U, K_0.5, K_W, and K_BC, indicates the use of unweighted UniFrac, generalized UniFrac with ϴ = 0.5, weighted UniFrac, and the Bray-Curtis dissimilarity kernels, respectively, for MiRKAT-S [24]. (A: 10 most abundant OTUs are associated. B: 10 random OTUs are associated. C: 10 least abundant OTUs are associated. D: OTUs in a chosen cluster are associated.). (PDF 9 kb)

Additional file 8:

Figure S7. Power estimates for individual MiRKAT-S tests through different software facilities, OMiSA and MiRKATS (via analytic p-value calculation). The censoring scheme, C_i ~ Unif(0,10), and the same effect directions, where β_j ∈ Λ is a vector of the elements sampled from Unif(0,1) (blue), Unif(0,2) (yellow), or Unif(0,3) (red), for a large sample size (n = 100) were surveyed. K_U, K_0.5, K_W, and K_BC, indicates the use of unweighted UniFrac, generalized UniFrac with ϴ = 0.5, weighted UniFrac, and the Bray-Curtis dissimilarity kernels, respectively. (PDF 6 kb)

Additional file 9:

Figure S8. Power estimates for individual MiRKAT-S tests through different software facilities, OMiSA and MiRKATS (via analytic p-value calculation). The censoring scheme, C_i ~ Unif(0,10), and the mixed effect directions, where β_j ∈ Λ is a vector of the elements sampled from Unif(− 1,1) (blue), Unif(− 2,2) (yellow), or Unif(− 3,3) (red), for a large sample size (n = 100) were surveyed. K_U, K_0.5, K_W, and K_BC, indicates the use of unweighted UniFrac, generalized UniFrac with ϴ = 0.5, weighted UniFrac, and the Bray-Curtis dissimilarity kernels, respectively. (PDF 6 kb)

Additional file 10:

Figure S9. Power estimates for individual MiRKAT-S tests through different software facilities, OMiSA and MiRKATS (via analytic p-value calculation). The censoring scheme, C_i ~ Unif(0,5), and the same effect directions, where β_j ∈ Λ is a vector of the elements sampled from Unif(0,1) (blue), Unif(0,2) (yellow), or Unif(0,3) (red), for a large sample size (n = 100) were surveyed. K_U, K_0.5, K_W, and K_BC, indicates the use of unweighted UniFrac, generalized UniFrac with ϴ = 0.5, weighted UniFrac, and the Bray-Curtis dissimilarity kernels, respectively. (PDF 6 kb)

Additional file 11:

Figure S10. Power estimates for individual MiRKAT-S tests through different software facilities, OMiSA and MiRKATS (via analytic p-value calculation). The censoring scheme, C_i ~ Unif(0,5), and the mixed effect directions, where β_j ∈ Λ is a vector of the elements sampled from Unif(− 1,1) (blue), Unif(− 2,2) (yellow), or Unif(− 3,3) (red), for a large sample size (n = 100) were surveyed. K_U, K_0.5, K_W, and K_BC, indicates the use of unweighted UniFrac, generalized UniFrac with ϴ = 0.5, weighted UniFrac, and the Bray-Curtis dissimilarity kernels, respectively. (PDF 6 kb)

Additional file 12:

Figure S11. Power estimates for individual MiRKAT-S tests through different software facilities, OMiSA and MiRKATS (via permutation). The censoring scheme, C_i ~ Unif(0,10), and the same effect directions, where β_j ∈ Λ is a vector of the elements sampled from Unif(0,1) (blue), Unif(0,2) (yellow), or Unif(0,3) (red), for a small sample size (n = 50) were surveyed. K_U, K_0.5, K_W, and K_BC, indicates the use of unweighted UniFrac, generalized UniFrac with ϴ = 0.5, weighted UniFrac, and the Bray-Curtis dissimilarity kernels, respectively. (PDF 6 kb)

Additional file 13:

Figure S12. Power estimates for individual MiRKAT-S tests through different software facilities, OMiSA and MiRKATS (via permutation). The censoring scheme, C_i ~ Unif(0,10), and the mixed effect directions, where β_j ∈ Λ is a vector of the elements sampled from Unif(− 1,1) (blue), Unif(− 2,2) (yellow), or Unif(− 3,3) (red), for a small sample size (n = 50) were surveyed. K_U, K_0.5, K_W, and K_BC, indicates the use of unweighted UniFrac, generalized UniFrac with ϴ = 0.5, weighted UniFrac, and the Bray-Curtis dissimilarity kernels, respectively. (PDF 6 kb)

Additional file 14:

Figure S13. Power estimates for individual MiRKAT-S tests through different software facilities, OMiSA and MiRKATS (via permutation). The censoring scheme, C_i ~ Unif(0,5), and the same effect directions, where β_j ∈ Λ is a vector of the elements sampled from Unif(0,1) (blue), Unif(0,2) (yellow), or Unif(0,3) (red), for a small sample size (n = 50) were surveyed. K_U, K_0.5, K_W, and K_BC, indicates the use of unweighted UniFrac, generalized UniFrac with ϴ = 0.5, weighted UniFrac, and the Bray-Curtis dissimilarity kernels, respectively. (PDF 6 kb)

Additional file 15:

Figure S14. Power estimates for individual MiRKAT-S tests through different software facilities, OMiSA and MiRKATS (via permutation). The censoring scheme, C_i ~ Unif(0,5), and the mixed effect directions, where β_j ∈ Λ is a vector of the elements sampled from Unif(− 1,1) (blue), Unif(− 2,2) (yellow), or Unif(− 3,3) (red), for a small sample size (n = 50) were surveyed. K_U, K_0.5, K_W, and K_BC, indicates the use of unweighted UniFrac, generalized UniFrac with ϴ = 0.5, weighted UniFrac, and the Bray-Curtis dissimilarity kernels, respectively. (PDF 6 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Koh, H., Livanos, A.E., Blaser, M.J. et al. A highly adaptive microbiome-based association test for survival traits. BMC Genomics 19, 210 (2018). https://0-doi-org.brum.beds.ac.uk/10.1186/s12864-018-4599-8

Download citation

Received: 09 August 2017
Accepted: 13 March 2018
Published: 20 March 2018
DOI: https://0-doi-org.brum.beds.ac.uk/10.1186/s12864-018-4599-8

A highly adaptive microbiome-based association test for survival traits

Abstract

Background

Results

Conclusions

Background

Methods

Models and notations

MiSALN

MiRKAT-S

OMiSA

Assessment of higher-level taxa

A graphical representation

Results

Simulations

Simulation design

Simulation results

Type I error

Power

Software comparison

Real data analysis

Discussion

Conclusions

Abbreviations

References

Acknowledgements

Funding

Availability of data and materials

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional files

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Genomics

Contact us