Metagenome-Prioritized Phage Cocktails for Crohn's Disease and IBD
CompletedResearch Question
Which bacterial pathobionts are enriched, ubiquitous, and non-protective in IBD, UC, and Crohn's disease patients — considered both across indications and within distinct patient subgroups defined by demographics, severity, native-microbiome structure, and treatment history — and of those pathobionts, which are tractable phage-therapy targets given the available (or characterizable) phages, their host range, the evolutionary escape routes their target strains have available, and the ecological consequences of removing them?
Three coupled deliverables:
- Patient stratification: a reproducible ecotype framework trained on public cohorts that each UC Davis patient can be assigned to, with ecotype-specific pathobiont signatures.
- Pathobiont target atlas: a scored list of candidate targets per ecotype (and per UC-Davis patient), ranked against an explicit biological / phage-availability / ecological-durability rubric.
- Per-patient cocktail drafts: for each of the ~21 unique UC Davis patients, a proposed phage cocktail with candidate phages, strain-coverage evidence, Tier-B/C flags, and confidence notes.
Overview
Five analytical pillars:
- Patient stratification — train reproducible ecotypes on curatedMetagenomicData + HMP2 (DMM / topic modeling / MOFA), project UC Davis onto them.
- Pathobiont identification — compositional-aware within-ecotype differential abundance, scored against the Tier-A criteria rubric (prevalence, mechanism, ecotype-coherence, engraftment evidence, protective-analog exclusion, BGC-encoded inflammatory mediator).
- Functional drivers — pathway, metabolite, and BGC / CB-ORF enrichment mapped to pathobiont contributors within ecotypes.
- Phage targetability — coverage matrix against PhageFoundry + external phage databases, CRISPR-Cas spacer analysis, phage-resistance fitness cost inference.
- UC Davis deep dive — per-patient ecotype assignment, pathobiont dossier, phage-cocktail draft, longitudinal within-patient stability.
A four-tier criteria rubric (Tier A — biological target suitability, Tier B — phage availability, Tier C — ecological durability, Tier D — clinical translation) gates candidates as they move from hypothesis to proposal. Tier D (PK, manufacturability, regulatory) is flagged as experimental follow-up, not analyzed in this project.
Methodological norm — verify where we can: pre-computed reference tables (ref_consensus_severity_indicators, ref_cd_vs_hc_differential, ref_kumbhari_*, ref_viromics_*) are treated as starting points. Where they are load-bearing for a claim, we re-run the underlying computation against the raw fact tables (compositional-aware DA, strain deconvolution, etc.) to distinguish "observed here" from "inherited from reference table." The C. scindens paradox is the canonical example of why.
Key Findings
1. Four reproducible IBD ecotypes with clear disease stratification

Training on 8,489 MetaPhlAn3 samples (fact_taxon_abundance, CMD_HEALTHY + CMD_IBD cohorts) with two independent methods — LDA on pseudo-counts and GMM on CLR + PCA-20 — across K ∈ {2..8}. Per-method fit measures (LDA held-out perplexity, GMM BIC) monotonically decrease with K, as expected for flexible latent-factor models. The discriminating signal is cross-method ARI between LDA and GMM, which has a local maximum at K = 4 (ARI = 0.131) and a second peak at K = 7 (0.140). A parsimony rule — smallest K within 0.02 ARI of the maximum — selects K = 4. Per-sample method agreement at K = 4 is 48.9 %.

The four consensus ecotypes are biologically coherent:
| Ecotype | n | Defining species | Diagnosis pattern |
|---|---|---|---|
| E0 — Diverse commensal | 3,604 | F. prausnitzii 6.8 %, R. bromii 4.5 %, B. uniformis 4.6 %, P. vulgatus 4.4 % | 66.8 % of HC |
| E1 — Bacteroides2 transitional | 2,601 | P. vulgatus 9.8 %, B. uniformis 7.2 %, Phocaeicola dorei 3.5 % | 48 % CD, 58 % UC, 100 % T1D, 97 % T2D, 67 % nonIBD |
| E2 — Prevotella copri enterotype | 920 | P. copri 28 %, F. prausnitzii 6 % | 16.9 % HC, ~0 % disease (non-Western healthy) |
| E3 — Severe Bacteroides-expanded | 1,364 | P. vulgatus 14.2 %, B. fragilis 3.6 % | 50 % CD, 40 % UC, 67 % IBD acute, 38 % CDI, donor 2708 |

This is H1a directionally supported: ≥ 3 reproducible ecotypes. E0 / E1 / E2 / E3 map recognizably onto the original Bacteroides / Prevotella / Ruminococcus enterotype framework (Arumugam 2011, Costea 2018), with E1 / E3 reflecting the Bacteroides2 (Bact2) low-cell-count dysbiosis signature documented in CD by Vandeputte et al. 2017.
(Notebooks: NB01_ecotype_training.ipynb, NB01b_ecotype_refit.ipynb)
2. UC Davis CD patients span three ecotypes, none in E2

All 26 Kuehl_WGS samples (23 unique patients) projected onto the K = 4 reference via the synonymy layer. 262 unique Kaiju-classified species normalized to 97 canonical species in the training feature space. UC Davis distributes:
- E0 — diverse commensal: 7 samples (27 %)
- E1 — Bacteroides2 transitional: 11 samples (42 %)
- E2 — Prevotella copri enterotype: 0 samples
- E3 — severe Bacteroides-expanded: 8 samples (31 %)
χ²(3) vs uniform = 10.0, p = 0.019. The distribution is non-random. UC Davis looks Western (no E2 Prevotella-dominant patients), with active disease dominating (73 % E1 or E3). H1b directionally supported — patients distribute across multiple ecotypes rather than concentrating in one, validating the stratified-targeting premise of the project.

Longitudinal patients: 1112 → E3 at both timepoints; p1/p1reseq → E3 both; p2 → E1 both; 1460 (calprotectin 7,280 μg/g) → E1; patient 6967 flips E1 ↔ E3 between two samples. The 6967 finding is the first direct observation of intra-patient ecosystem instability — relevant to Pillar 5 H5d (dosing-schedule implications).
(Notebook: NB02_ecotype_projection.ipynb)
Methodological aside: Kaiju ↔ MetaPhlAn3 classifier mismatch asymmetry
Projecting Kuehl (Kaiju) onto a MetaPhlAn3-trained embedding exposes an asymmetric robustness between the two ecotype methods. LDA on pseudo-counts is robust: 54 % of Kuehl feature rows outside the training feature space is handled by treating absence as not-detected. GMM on CLR + PCA is fragile: the same sparsity forces all 26 Kuehl samples into a single Gaussian (E3) at confidence > 0.97 — an artifact, not biology. Documented as a project-level finding and committed to docs/discoveries.md. LDA is the primary Kuehl projection call; GMM is advisory.
3. Clinical covariates alone are insufficient for within-IBD ecotype assignment

Two classifiers trained on the pooled CMD cohort (LightGBM) to predict K = 4 consensus ecotype from clinical covariates:
- Minimal — {
is_ibd,sex,age}, n = 8,489: macro OvR AUC = 0.799. - Extended — adds {
hbi_max,sccai_max,calp_max}, n = 1,675 subset: macro AUC = 0.810.
Both exceed the H1c threshold of 0.70. On paper, H1c passes. But applied to UC Davis patients:
- Minimal classifier vs NB02 metagenomic call: 41 % agreement (9/22).
- Extended classifier vs NB02: 36 % agreement (8/22).
- 12 / 22 patients disagree under both classifiers.

The minimal classifier predicts E1 for 19/22 UC Davis patients. In the training cohort, IBD samples split ~58 % E1 / ~40 % E3 / ~2 % E0 / ~0 % E2, so the classifier's dominant learned rule is "is_ibd = 1 → E1." When applied to UC Davis (all-CD, is_ibd constant), this rule collapses to the marginal mode. The extended classifier's training subset is 702 E1 / 959 E3 / 3 E0 / 11 E2 — effectively an E1-vs-E3 binary problem — and severity markers do not separate the two reliably.
H1c revised interpretation: clinical covariates distinguish HC vs IBD trivially (dominated by is_ibd) but do not separate IBD ecotypes (E1 transitional vs E3 severe). For UC Davis-type cohorts, metagenomics remains required for ecotype assignment. The "AUC 0.80 on paper / 41 % patient agreement in practice" gap is itself a methodologically important finding — OvR-AUC on a pooled cohort with a strong cohort-axis feature overstates per-patient classifier usefulness.
(Notebook: NB03_clinical_ecotype_classifier.ipynb)
4. Compositional correction partially, but not fully, resolves the C. scindens paradox

Starting observation: the preliminary project's pooled Mann-Whitney differential-abundance analysis called Clostridium scindens CD-enriched at log₂FC +2.67 — contradicting its established role as a bile-acid-producing protective species (~79 % prevalence in healthy individuals). Three explanations were possible: (1) compositional artifact, (2) strain heterogeneity, (3) ecotype mixing in the pooled analysis.
NB00 tests explanation (1) by running the same curated battery of 8 protective + 6 reported-pathobiont species under both raw Mann-Whitney on relative abundance and CLR-based Mann-Whitney (Gloor 2017, Lin & Peddada 2020). Findings:
- Compositional correction recovers depletion signal for 4+ protective species that raw Mann-Whitney misses: F. prausnitzii, A. muciniphila, R. hominis, L. eligens, A. rectalis flip from "n.s." (raw) to "CD↓" (CLR).
- Roseburia intestinalis shows a sign flip: raw CD↑ → CLR CD↓. The C. scindens artifact pattern reproduced on a second species.
- Top reported pathobionts agree across methods: M. gnavus (R. gnavus), E. bolteae, E. lenta, H. hathewayi all CD-enriched in both.
- C. scindens remains CD-enriched under both methods (raw log₂FC +5.66; CLR Δ +1.25). Compositional correction alone is insufficient; explanations (2) and (3) remain live.
Implication: the norm-N1 decision to re-analyze is justified. Pooled Mann-Whitney on relative abundance is systematically under-sensitive for protective-species depletion, an artifact of compositional bias plus pooled-cohort heterogeneity. C. scindens is not resolvable at this level — the resolution required a design that eliminates study confounding (see §5, NB04c's within-IBD-substudy CD-vs-nonIBD meta). Under that confound-free design C. scindens is CD↑, which is not a paradox but rather the expected behavior of a species that happens to be more prevalent in the IBD-cohort source studies than the healthy-cohort source studies in cMD.
(Notebook: NB00_data_audit.ipynb)
5. Within-ecotype × within-substudy meta-analysis defines ecotype-specific Tier-A (rigor-controlled)
Retraction and rigor repair — what NB04 claimed vs what this section presents
The original NB04 analysis (within-ecotype CD-vs-HC CLR Mann-Whitney, committed 2026-04-24 early, now superseded) made three headline claims, of which two are retracted here:
- ~~H2c — the C. scindens paradox is resolved by within-ecotype stratification.~~ Retracted. Under the confound-free within-IBD-substudy CD-vs-nonIBD contrast, C. scindens is genuinely CD↑ (+1.18 CLR-Δ, FDR 1e-8, 4/4 sign-concordance across sub-studies); under leave-one-species-out refit, C. scindens is CD↑ within both E1 and E3 once it is not part of the clustering input. The NB04 within-ecotype n.s. call was a feature-leakage artifact (clustering samples on taxon abundance then testing the same taxon within cluster is selection-on-outcome confounding). The paradox was not a paradox.
- H2b — target sets differ between ecotypes (Jaccard = 0.14). ~~Interpretation retained; statistic replaced.~~ Jaccard 0.14 was near the random-overlap baseline (~0.10 for top-30 of ~300 filtered species). Under a permutation null (200 random-label permutations, null mean 0.785 ± 0.054), the observed Jaccard of 0.104 has empirical p = 0.000 — H2b survives strongly and the divergence is real, but the claim in NB04 rested on an effect size without a reference distribution, which is the wrong argument structure.
- ~~NB04 Tier-A list (33 species: 18 E1, 15 E3).~~ Retracted. The within-ecotype DA that produced this list was substantially driven by feature leakage (held-out-species sensitivity Jaccard: E1 = 0.230, E3 = 0.064, vs > 0.5 leakage-bounded threshold). When tested against an independent confound-free CD-vs-control contrast, 14 of 18 E1 candidates had negative within-substudy effects — they were ecotype-markers, not CD drivers.
The rigor-repair pipeline that produced the replacement analysis in this section is NB04b (bootstrap CIs + leakage-bound sensitivity + LOO refit + Jaccard permutation null + ecotype stability) → NB04c (confound-free within-IBD-substudy CD-vs-nonIBD meta + LinDA in pure Python) → NB04d (rigor-controlled stopping rule) → NB04e (within-ecotype × within-substudy CD-vs-nonIBD meta). See
FAILURE_ANALYSIS.mdfor the full arc. Two generalizable pitfalls from this repair are indocs/pitfalls.md: cMD substudy-nesting unidentifiability and feature leakage in cluster-stratified DA.
5a. Confound-free design and why it works
curatedMetagenomicData pools samples across ≈ 51 source studies. In the ecotype-assigned slice (8,489 samples), 45 sub-studies have ≥ 10 HC and 5 sub-studies have ≥ 10 CD, but zero sub-studies contain both HC and CD. CMD's healthy-cohort samples come from HC-only studies (LifeLinesDeep_2016, AsnicarF_2021, YachidaS_2019, …); CMD-IBD CD samples come from IBD-cohort studies (HallAB_2017, VilaAV_2018, LiJ_2014, IjazUZ_2017, NielsenHB_2014). A pooled CD-vs-HC LME with substudy random effect is therefore structurally unidentifiable (the random effect perfectly predicts diagnosis) — we verified empirically that statsmodels.mixedlm silently fails to converge on every battery species under this design.
The confound-free contrast that is identifiable is CD-vs-nonIBD within IBD sub-studies — four cMD studies have ≥ 10 CD and ≥ 10 nonIBD (HallAB_2017: 89/73, LiJ_2014: 76/10, IjazUZ_2017: 56/38, NielsenHB_2014: 21/248). Within each sub-study, CLR-Δ is computed on the same samples processed the same way by the same group, with no study-level confound. Inverse-variance weighted meta-analysis across sub-studies combines into a cohort-level effect; stratifying the contrast by ecotype before meta-analysis produces the ecotype-specific effect. Because ecotype and CD-effect are computed on disjoint axes (samples partitioned by ecotype; CLR-Δ computed within each partition across sub-studies), there is no feature-leakage self-reference.
5b. NB04c cohort-level within-substudy CD-vs-nonIBD meta recovers the canonical CD signature
Across the four IBD sub-studies (242 CD / 369 nonIBD pooled), the 14-species curated battery produces the expected canonical CD signature — pathobionts up, protective commensals down — with strong sign concordance:
| Species | Pooled CLR-Δ | FDR | Sign concordance |
|---|---|---|---|
| Mediterraneibacter gnavus | +5.13 | ~0 | 4/4 |
| Eggerthella lenta | +2.30 | 4e-9 | 4/4 |
| Escherichia coli | +1.43 | 2e-4 | 3/4 |
| Clostridium scindens | +1.18 | 1e-8 | 4/4 |
| Enterocloster bolteae | +1.09 | 3e-6 | 4/4 |
| Hungatella hathewayi | +0.92 | 5e-4 | 3/4 |
| Bilophila wadsworthia | +0.07 | n.s. | 3/4 |
| Lachnospira eligens | −1.01 | 4e-3 | 3/4 |
| Roseburia intestinalis | −1.14 | 2e-3 | 4/4 |
| Akkermansia muciniphila | −1.30 | 3e-3 | 3/4 |
| Faecalibacterium prausnitzii | −1.67 | 9e-8 | 4/4 |
| Roseburia hominis | −1.77 | 4e-7 | 4/4 |
| Coprococcus eutactus | −3.09 | 4e-15 | 4/4 |
This flatly contradicts NB04's within-ecotype calls for several species. NB04 called F. prausnitzii / R. hominis / L. eligens CD↑ within both E1 and E3 (the "Simpson's paradox" of the original section 5) — the confound-free analysis shows they are CD↓, consistent with their classical protective-commensal role. NB04's C. scindens "n.s." within both ecotypes is similarly contradicted. The within-ecotype DA in NB04 was producing direction reversals as a compound artifact of feature leakage plus the pooled-cohort substudy × diagnosis confound — both compositional-bias-aware DA methods we tried on the within-ecotype subsets (CLR-MW and LinDA) share the bias, so n_evidence from within-ecotype methods alone does not resolve it.

5c. NB04e ecotype-specific Tier-A under within-ecotype × within-substudy meta
Stratifying the within-substudy CD-vs-nonIBD contrast by ecotype before meta-analysis tests whether the canonical CD signature differs by ecotype and produces ecotype-specific Tier-A lists that are structurally free of leakage. Three (substudy × ecotype) cells meet the ≥ 10 CD AND ≥ 10 nonIBD eligibility bar:
| Ecotype | Substudy | n_CD | n_nonIBD | Status |
|---|---|---|---|---|
| E1 | HallAB_2017 | 67 | 41 | eligible |
| E1 | NielsenHB_2014 | 15 | 239 | eligible |
| E3 | HallAB_2017 | 22 | 31 | eligible |
E0 and E2 are not viable — these are the healthy-cohort ecotypes; no IBD-study nonIBD samples live in them. E1 is meta-viable across two sub-studies; E3 is single-study-only (HallAB_2017).
E1 Tier-A (meta-analysis, 51 candidates, all 100 % sign-concordant across sub-studies, FDR < 0.10, pooled CLR-Δ > 0.5):
| Rank | Species | CLR-Δ | FDR |
|---|---|---|---|
| 1 | Mediterraneibacter gnavus | +4.85 | 2e-12 |
| 2 | Streptococcus salivarius | +3.26 | 2e-9 |
| 3 | Streptococcus thermophilus | +2.69 | 4e-6 |
| 4 | Erysipelatoclostridium innocuum | +2.65 | 4e-7 |
| 5 | Streptococcus parasanguinis | +2.44 | 2e-6 |
| 6 | Enterocloster asparagiformis | +2.41 | 2e-9 |
| 7 | Intestinibacter bartlettii | +2.36 | 3e-5 |
| 8 | Hungatella symbiosa | +2.23 | 1e-5 |
| 9 | Gordonibacter pamelaeae | +2.18 | 2e-5 |
| 10 | Erysipelatoclostridium ramosum | +2.16 | 4e-6 |
| … 41 additional candidates |
Full list: data/nb04e_within_ecotype_meta.tsv.
Biological coherence of the E1 Tier-A. The rigor-controlled E1 list organizes into three biologically coherent groups that the retracted NB04 list did not:
- Oral-derived streptococci as ectopic colonizers (ranks 2, 3, 5) — S. salivarius, S. thermophilus, S. parasanguinis are canonical oral-cavity species; their enrichment in CD stool is consistent with the "oral-gut axis" ectopic-colonization literature for IBD (Xiang 2024, PMID 39188957; Guo 2024, PMID 38545880; Tanwar 2023, PMID 37645044) and with the specific finding that S. salivarius is a salivary biomarker for orofacial granulomatosis co-occurring with CD (Goel 2019, PMID 30796823). Caveat: S. thermophilus appears in anti-inflammatory multi-strain probiotic formulations (Biagioli 2020, PMID 32629887) — its CD↑ signal at the species level does not automatically imply pathobiont status; strain-level evidence (A3 literature, A5 engraftment) is the NB05 disambiguation step.
- Vancomycin-resistant pathobionts (rank 4) — Erysipelatoclostridium innocuum is specifically documented as a vancomycin-resistant pathobiome in IBD with clinically consequential phenotypes (creeping-fat formation and intestinal strictures in CD; reduced UC remission rates); FMT is under evaluation as an intervention (Le 2025, PMID 40074633). Of the E1 Tier-A, E. innocuum is the candidate with the strongest stand-alone clinical-association evidence independent of this project's data.
- Clostridiales-expansion pathobionts (ranks 6–10) — Enterocloster asparagiformis, Intestinibacter bartlettii, Hungatella symbiosa, Erysipelatoclostridium ramosum, and the related clostridial reclassifications (E. bolteae, E. clostridioformis, E. citroniae in the lower-ranked list) are species in the Lachnospiraceae / Erysipelotrichaceae expansion characteristic of the Bacteroides-2 dysbiosis subtype (Vandeputte 2017) that defines E1. Their appearance in E1 Tier-A reflects the underlying ecology rather than being an ecotype-marker artifact — the within-substudy meta-analysis controls for the selection-on-outcome pattern that produced the original NB04 E1 list.
- Polyphenol-metabolism taxa as ambiguous CD-associated (rank 9) — Gordonibacter pamelaeae produces urolithins from dietary ellagitannins (Selma 2014, PMID 24744017) and increases during microbiome recovery from dysbiosis-inducing insult (Tierney 2023, PMID 36840551), which makes its CD↑ signal in E1 difficult to interpret as pathobiont activity. Flag for Tier-A-A4 protective-analog exclusion in NB05.
E3 Tier-A (provisional, 40 candidates, single-study HallAB_2017): top candidates H. symbiosa (+4.64), M. gnavus (+4.46), B. coccoides (+4.22), R. faecis (+4.14), C. spiroforme (+4.11), S. salivarius (+4.04), E. innocuum (+3.68). F. plautii replicates at +2.26 (FDR 0.02). Blautia wexlerae does not replicate (+0.25, FDR 0.80) — the NB04d "rock-solid E3 triad" that included B. wexlerae relied on NB04c's cohort-level within-substudy evidence rather than the E3-restricted within-substudy evidence; under the stricter E3 × HallAB_2017 test, B. wexlerae is removed from the rock-solid set.
The E3 list should be treated as provisional until a second cMD-IBD sub-study that populates E3 with ≥ 10 CD and ≥ 10 nonIBD samples becomes available (candidate: HMP2 once PENDING_HMP2_RAW is resolved).
5d. Classical pathobiont reality check (revised)
Under the confound-free design, the classical engraftment pathobionts from donor 2708 → P1 → P2 are unambiguously CD-enriched — the opposite of what NB04's within-ecotype analysis suggested:
| Species | Within-substudy CD-vs-nonIBD | NB04 within-ecotype (retracted) |
|---|---|---|
| M. gnavus | +5.13 (FDR 0, 4/4) | E1 CD↓ −2.7, E3 CD↑ +1.6 |
| E. lenta | +2.30 (FDR 4e-9, 4/4) | E1 CD↓ −2.4, E3 CD↓ −0.8 |
| E. coli | +1.43 (FDR 2e-4, 3/4) | E1 CD↓ −1.8, E3 CD↓ −0.9 |
| E. bolteae | +1.09 (FDR 3e-6, 4/4) | E1 CD↓ −1.6, E3 n.s. +0.6 |
| H. hathewayi | +0.92 (FDR 5e-4, 3/4) | E1 CD↓ −1.4, E3 n.s. −0.1 |
| K. oxytoca | below prevalence filter | — |
5 of 6 engraftment pathobionts pass the confound-free CD↑ test. The NB04 within-ecotype "these pathobionts are ecotype-markers, not CD drivers" narrative is retracted — they are CD drivers under any analysis that controls for study confounding; the within-ecotype DA was systematically reversing their sign because the HC samples in each ecotype came from entirely different source studies than the CD samples in the same ecotype.
5e. Stopping rule and NB05 input
NB04d formalized a rigor-controlled stopping rule for NB05:
| Criterion | Threshold | E1 | E3 |
|---|---|---|---|
| 1. Feature-leakage bound (held-out-species Jaccard) | > 0.5 | 0.230 ✗ | 0.064 ✗ |
| 2. Ecotype-specific Tier-A (NB04e) | ≥ 3 | 51 ✓ | 40 ✓ (single-study) |
| 3. Engraftment pathobionts under confound-free contrast | ≥ 3 of 6 | 5/5 tested ✓ (cohort-level) | |
| 4. Ecotype framework internal stability (bootstrap ARI) | > 0.30 | 0.16 ✗ |
Criterion 1 fails in both ecotypes and documents that NB04's original within-ecotype Tier-A cannot be trusted directly — but NB04e's rigor-controlled within-ecotype × within-substudy meta is structurally immune to the leakage (clustering axis and DA axis are disjoint). Criterion 4 fails and documents that the ecotype framework is internally marginally stable (ARI 0.13–0.17 across 5 × 80 % subsamples); the framework is usable for downstream stratification (NB02 projection is deterministic once fit) but is not externally replicated and must be flagged.
Per-ecotype NB05 verdict:
- E1 — PROCEED with the 51-candidate meta-viable Tier-A. Confound-free, multi-study support, canonical-pathobiont enrichment at the top.
- E3 — PROCEED WITH CAVEAT using the 40-candidate single-study Tier-A. Replication is the first follow-up (HMP2 ingestion is the unblock).
- Cross-ecotype — the 5 engraftment-confirmed pathobionts are cross-ecotype candidates (NB04c §3).
(Notebooks: NB04_within_ecotype_DA.ipynb superseded; NB04b_analytical_rigor_repair.ipynb, NB04c_rigor_repair_completion.ipynb, NB04d_stopping_rule.ipynb, NB04e_option_A_viability.ipynb are the rigor-controlled pipeline.)
5f. Pillar 2 strengthening — LOSO stability, pathway-feature refit, and HMP2 external replication
Three additional analyses (NB04f, NB04g, NB04h) tested the ecotype framework and Tier-A claims against three distinct failure modes: (i) cross-study generalization, (ii) feature-leakage residual, (iii) external-cohort replication. The results are honest: the ecotype framework has real cross-study variance, but the operational Tier-A replicates strongly on an external cohort.
NB04f — Leave-one-substudy-out (LOSO) ecotype stability. For each of the top 8 cMD sub-studies by sample count (LifeLinesDeep_2016, AsnicarF_2021, NielsenHB_2014, VilaAV_2018, LiJ_2014, HallAB_2017, YachidaS_2019, HansenLBS_2018), held out the substudy, refit K=4 LDA on the remaining samples, projected held-out back, Hungarian-aligned, and computed ARI against the NB01b consensus_ecotype on the held-out samples. Mean LOSO ARI = 0.113 (range 0.000–0.282); mean per-sample agreement 55.0 % (range 16.9 %–85.5 %). This fails the > 0.30 "stable" threshold and is more honest than the bootstrap ARI 0.13–0.17 NB04b reported. Interpretation: some sub-studies (LifeLinesDeep ARI 0.21 / agreement 85.5 %, HansenLBS 65 %) align well with the consensus framework; others (AsnicarF 38 %, VilaAV 17 %) do not. The NB01b consensus was LDA+GMM at 48.9 % cross-method agreement, so part of the LOSO gap is the LDA↔consensus disagreement baseline. The within-cMD ecotype framework is cross-study variable; the "four reproducible ecotypes" framing must be qualified accordingly.
NB04g — Pathway-feature ecotype refit (Option B structural test). Refit K=4 LDA on fact_pathway_abundance (3,145 CMD_IBD samples, 2,000 top-variance HUMAnN3 pathways after prevalence + informative filtering) and compared to the taxon-based consensus_ecotype on the same samples. ARI = 0.113; per-sample agreement 50.6 %. Per-ecotype agreement: E1 = 65.3 % (n=1,839), E2 = 47.4 % (n=19), E3 = 30.7 % (n=1,244), E0 = 0 % (n=43; E0 is the healthy-dominant ecotype, rare in the CMD_IBD pathway cohort). E1 taxon-ecotype is substantially recoverable from a disjoint (pathway) feature basis — that's the meaningful result. E3 is less recoverable (30.7 %), which combined with NB04e's single-study E3 evidence supports the provisional E3 Tier-A framing. The ecotype structure is mixed ecological + taxonomic, not purely one or the other. Scope limitation: fact_pathway_abundance is CMD_IBD only; no HC pathway coverage, so the refit is within-disease only.
NB04h — HMP_2019_ibdmdb (HMP2) external replication. Pulled HMP2 MetaPhlAn3 profiles directly from curatedMetagenomicData v3.18 (1,627 samples, 130 subjects, 582 species; 255/335 training-feature overlap after synonymy). HMP2 is explicitly NOT in our CMD_IBD training set (HMP_2019_ibdmdb was absent from the CMD_IBD substudy list), so it is a genuinely held-out cohort with the same MetaPhlAn3 classifier namespace as the training data.
- Ecotype distribution replicates directionally: HMP2 subject-level (n=130) concentrates in E1 (106 subjects, 82 %), E2 (11), E3 (10), E0 (3). Subject-level χ² for ecotype × {CD, UC, nonIBD} = 15.61, p = 0.016 — ecotype stratifies disease in HMP2 at statistical significance. HMP2 skews more toward E1 than cMD does (cMD is more balanced E1+E3); this likely reflects HMP2's recruitment of newly-diagnosed / milder IBD rather than flare-dominated severe disease.
- Projection confidence is high: median max LDA posterior = 0.861; 80.4 % of samples have max posterior > 0.70. No Kaiju↔MetaPhlAn3 fragility (unlike the UC Davis GMM projection) because HMP2 uses the same MetaPhlAn3 pipeline as the training data.
- E1 Tier-A replicates strongly: per-species CD-vs-nonIBD CLR-Δ within HMP2-projected E1 samples (593 CD / 337 nonIBD), cross-referenced against the 51-candidate NB04e E1 Tier-A list. 45 / 51 (88.2 %) are sign-concordant (both CD↑). Top replicators include M. gnavus (HMP2 effect +1.08, FDR 3e-13), E. asparagiformis (+0.89, FDR 1e-21), H. symbiosa (+1.18, FDR 6e-22), E. innocuum (+0.28, FDR 4e-16), E. bolteae (+1.27, FDR 2e-18), E. clostridioformis (+1.04, FDR 2e-21). Only 2 of the top 20 fail: S. thermophilus (sign-discordant — HMP2 E1 effect slightly negative; potentially reflects differential dairy exposure in HMP2 vs HallAB/NielsenHB cohorts) and Bacteroides stercoris (sign-discordant, n.s.).

Synthesis. The three tests collectively upgrade Pillar 2 from "rigor-controlled on a single cohort with marginally-stable ecotype framework" to "rigor-controlled on cMD + externally replicated on HMP2 with honest documentation of cross-study ecotype variance." The operational Tier-A for NB05 is validated; the ecotype-framework-reproducibility caveat is honestly stated but bounded (the framework is cross-study variable but externally usable because projected ecotypes stratify disease and Tier-A replicates at 88 %).
(Notebooks: NB04f_loso_ecotype_stability.ipynb, NB04g_pathway_ecotype_refit.ipynb, NB04h_hmp2_external_replication.ipynb. HMP2 pull via pull_hmp2_metaphlan3.R against curatedMetagenomicData v3.18.)
5g. NB05 Tier-A scoring — prioritized target list for Pillar 4/5
Four criteria (A3–A6 from RESEARCH_PLAN.md §Criteria) applied to the 71 unique rigor-controlled candidates (51 E1 + 40 E3 provisional + 5 cross-ecotype engraftment, dedup'd). Scoring:
- A3 Literature + cohort CD-association (0–5): five independent signals per candidate — NB04c confound-free meta, HMP2 external replication concordance,
ref_cd_vs_hc_differential(Kumbhari reference; log₂FC > 0.5 + FDR < 0.10),ref_species_ibd_associations(UHGG-indexed dxIBD mixed-effects),ref_phage_biology(curated top-tier targets). Distribution: 13 candidates score 0; 22 score 1; 25 score 2; 5 score 3; 5 score 4; 1 scores 5. - A4 Protective-analog exclusion (0 / 1): fails if within-IBD-substudy CD-vs-nonIBD effect (NB04c §3) is negative — protective-analog risk — or if candidate is on the curated-protective-species list (F. prausnitzii, A. muciniphila, R. intestinalis, R. hominis, L. eligens, A. rectalis, C. scindens, C. eutactus, B. adolescentis, B. longum). Three candidates fail: Anaerostipes hadrus (−0.32 confound-free effect), Clostridium scindens (curated protective list), Roseburia faecis (−2.74 effect).
- A5 Engraftment / strain adaptation (0 / 0.5 / 1): 1.0 for the 5 donor-2708-engraftment pathobionts (M. gnavus, E. lenta, E. coli, E. bolteae, H. hathewayi); 0.5 for Kumbhari strain-competition disease-dominance or IBD-adapted-strain gene signal. 3 candidates hit the 0.5 tier (A. hadrus, B. cellulosilyticus, F. plautii).
- A6 BGC inflammatory mediator (0 / 0.5 / 1): BGC count per candidate from
ref_bgc_catalog(synonymy-inverted matching so pre-GTDB names like "Ruminococcus gnavus" match canonical "Mediterraneibacter gnavus"). 1.0 if ≥ 1 BGC contains CD-enriched CB-ORFs (effect > 0.5 + FDR < 0.05 perref_cborf_enrichment). 14 candidates score 1.0, including M. gnavus (39 BGCs, 26 CD-enriched CB-ORFs — the largest count), E. coli (93 BGCs with MIBiG matches to Colibactin / Yersiniabactin / Enterobactin), S. salivarius (98 BGCs with Salivaricin 9 / A / Cochonodin I), Streptococcus parasanguinis (51 BGCs, 11 CD-enriched), Hungatella hathewayi (6 BGCs, 4 CD-enriched).
Total Tier-A score = A3/5 + A4 + A5 + A6 (range 0–4); actionable threshold = 2.5.
6 actionable candidates (top 6 of 71 scored):
| Rank | Species | Ecotype membership | Total | Key evidence |
|---|---|---|---|---|
| 1 | Hungatella hathewayi | E1 | engraftment | 4.0 | all 5 A3 signals pass; 6 BGCs with 4 CD-enriched CB-ORFs |
| 2 | Mediterraneibacter gnavus | E1 | E3_prov | engraftment | 3.8 | 4/5 A3; 39 BGCs with 26 CD-enriched CB-ORFs (inflammatory glucorhamnan mechanism) |
| 3 | Escherichia coli | E1 | engraftment | 3.6 | engraftment + MIBiG match to Colibactin + Yersiniabactin + Enterobactin |
| 4 | Eggerthella lenta | E1 | engraftment | 3.3 | engraftment + 4/5 A3 signals + Kumbhari IBD-adapted-strain gene signal |
| 5 | Flavonifractor plautii | E1 | E3_prov | 3.3 | Kumbhari strain-competition + BGC with CD-enriched CB-ORFs |
| 6 | Enterocloster bolteae | E1 | engraftment | 2.8 | engraftment; no BGC hit in catalog (potential blind spot) |
Tier-B candidates (score 2.2–2.4, sub-threshold): Enterocloster asparagiformis, Streptococcus salivarius, E. citroniae, E. clostridioformis, Blautia coccoides, Veillonella atypica, S. parasanguinis, Actinomyces oris, V. parvula. These have BGC + A4-pass + A3 = 1–2 signals but lack direct engraftment or strain-adaptation evidence; Pillar 4 phage-targetability scoring may promote any of these based on B-tier phage-availability evidence.

(Notebooks: NB05_tier_a_scoring.ipynb + run_nb05.py. The scored TSV data/nb05_tier_a_scored.tsv is the authoritative hand-off to NB06 co-occurrence networks and Pillar 4 phage-target scoring. Note: this notebook was executed via run_nb05.py rather than nbconvert due to an environment-specific numpy.bool serialization issue in the nbconvert notebook-save path; outputs are authoritative and pre-populated in the committed .ipynb.)
5h. NB06 co-occurrence networks per ecotype (H2d test)
Four per-subnet correlation networks built via CLR transform + rank-based Pearson (= Spearman rho), per-edge BH-FDR, thresholded at |rho| > 0.3 AND FDR < 0.05, with Louvain community detection (networkx.community.louvain_communities, edge-weighted by |rho|). Networkx 3.5's built-in Louvain was sufficient; FastSpar / SpiecEasi installation was held back as unnecessary for the H2d question given clear module structure at CLR-Spearman.
| Subnet | n samples | n nodes | n edges | n modules |
|---|---|---|---|---|
| E1_all | 2,601 | 318 | 28,730 | 6 |
| E1_CD | 581 | 255 | 15,354 | 4 |
| E3_all | 1,364 | 296 | 30,453 | 3 |
| E3_CD | 605 | 252 | 19,909 | 7 |
H2d verdict — nominally PARTIAL, biologically SUPPORTED for the pathobiont module:
The raw mean-actionable-per-module is 1.38 on E1_all + E3_all (below the ≥ 2 bar stated in the plan), but the signal is not uniformly distributed across modules. In every subnet, a single module contains 4-5 of the 6 actionable Tier-A candidates:
| Subnet | Pathobiont module | Size | Actionable members |
|---|---|---|---|
| E1_all | module 1 | 84 | E. lenta, E. bolteae, F. plautii, H. hathewayi, M. gnavus |
| E1_CD | module 0 | 75 | (same set) |
| E3_all | module 1 | 76 | E. lenta, E. bolteae, E. coli, H. hathewayi, M. gnavus |
| E3_CD | module 1 | 57 | E. lenta, E. coli, H. hathewayi, M. gnavus |
The remaining modules per subnet are commensal / Prevotella / diverse-healthy communities that naturally contain 0 Tier-A hits by construction. The mean-per-module statistic is diluted by these biologically-irrelevant-to-the-question modules.
Biological interpretation: Tier-A pathobionts form a single ecologically-linked co-occurrence module within CD ecotypes. Multi-target phage cocktails are therefore appropriate for the pathobiont-module members — they co-favour similar conditions (likely bile-acid dysregulation + low-oxygen inflammation) and the ecological coupling suggests a cocktail hitting 3+ of {M. gnavus, E. lenta, E. bolteae, H. hathewayi, E. coli} will have compounding effects.
Ecotype-specific module membership:
- F. plautii is in the main pathobiont module in E1 but in the generalist module in E3. Relevant for Pillar 5 per-patient cocktails: for E1 patients, F. plautii + main-pathobiont co-targeting is ecologically coherent; for E3 patients, F. plautii is less linked and may need a separate phage.
- E. coli is in the pathobiont module in E3 only, not E1. Consistent with AIEC being more characteristic of severe-Bacteroides-expanded E3 than transitional E1.
Module-anchor commensals (top-degree non-Tier-A hubs in the pathobiont modules, useful for Pillar 3 functional-driver anchoring):
- E1_all module 1: Firmicutes bacterium CAG 110, Collinsella massiliensis, Phascolarctobacterium sp CAG 266
- E3_all module 1: Butyricicoccus pullicaecorum, Anaerostipes caccae, Lactococcus lactis
Literature grounding — butyrate producers anchor the pathobiont module despite their anti-inflammatory biology. Butyricicoccus pullicaecorum is extensively studied as a butyrate-producing Clostridial-cluster-IV IBD-probiotic candidate (Geirnaert 2015a, Steppe 2014, Jeraldo 2016), with published safety data and anti-inflammatory short-chain-fatty-acid profile. Anaerostipes caccae is another canonical butyrate producer. Both being top-degree hubs in the E3 pathobiont module — not in a separate healthy-commensal module — is a biologically interesting finding: the ecological niche the pathobionts occupy is shared with butyrate-producing commensals that are CD-depleted in most pooled analyses but co-vary with pathobionts under within-ecotype co-occurrence. This is consistent with a metabolic-partner / cross-feeding interpretation (pathobiont-produced substrates support the butyrate-producing commensal; the commensal's butyrate doesn't suppress the pathobiont in this context) and suggests Pillar 3 should look specifically at cross-feeding metabolite exchange in this module. It also cautions against "preserve butyrate-producers" as a naive phage-targeting goal — these species may actually track with the pathobionts, not against them, in the CD ecological context.

(Notebook: NB06_cooccurrence_networks.ipynb; executed via run_nb06.py with pre-populated outputs in the committed .ipynb — same workaround as NB05 for the nbconvert numpy.bool issue.)
Pillar 2 close-out
With NB06 complete, Pillar 2 is fully closed: rigor-controlled Tier-A (NB04b-e) → externally replicated on HMP2 (NB04h) → scored + prioritized (NB05, 6 actionable of 71) → co-occurrence structure mapped (NB06, single-pathobiont-module finding). The set of scored + module-assigned + hub-ranked Tier-A is the complete input package for Pillar 4 (phage-availability × target) and Pillar 5 (UC Davis per-patient cocktail drafts).
6. Taxonomy synonymy layer is the project's reusable foundation
data/species_synonymy.tsv — 2,417 alias → 1,848 canonical species, grounded in ref_taxonomy_crosswalk NCBI taxid matching with GTDB r214+ genus renames supplemented. This was motivated by a failure mode discovered in NB00: fact_taxon_abundance contains three divergent taxon-name formats between cohorts (CMD_IBD short names, CMD_HEALTHY full MetaPhlAn3 lineage, KUEHL_WGS Kaiju), and a naive pivot splits the same species into multiple zero-overlap rows, producing log₂FC ≈ 28 artifacts.
The synonymy layer was built once in NB01b and is joined against by NB00, NB01, NB02, NB03, and every downstream notebook. Documented as a project-level pitfall (docs/pitfalls.md) and a candidate BERIL convention — any project integrating multi-cohort microbiome data needs this layer, and the pattern (NCBI taxid + GTDB-version-aware rename table) generalizes.
7. NB07a — Pathway DA + H3a v1.7 three-clause falsifiability (Pillar 3 opener)
First Pillar 3 notebook, executed under RESEARCH_PLAN.md v1.7 norms (post-adversarial-review). Per norm N12, primary contrast is within-IBD-substudy CD-vs-nonIBD meta on fact_pathway_abundance (HUMAnN3 MetaCyc, CMD_IBD only). Per norm N15, substudy meta-viability re-verified for the pathway modality: 3 robust (HallAB_2017, IjazUZ_2017, NielsenHB_2014) + 1 boundary (LiJ_2014, nonIBD = 10) — not the "4 meta-viable" framing v1.6 had inherited from NB04e's taxonomic-modality counts. VilaAV_2018 excluded (CD = 216, nonIBD = 0). 575 unstratified MetaCyc pathways → 409 after 10%-prevalence filter.
H3a v1.7 verdict: PARTIALLY SUPPORTED — 2 of 3 clauses pass; clause (b) is structurally degenerate, not a fundamental refutation.
| Clause | Verdict | Detail |
|---|---|---|
| (a) Pathway count under permutation null | PASS | 52 CD-up + 22 CD-down pathways pass FDR < 0.10 with |
| (b) Category coherence under random-allocation null | FAIL (degenerate) | Only 44 / 409 background pathways match the 7 a-priori MetaCyc categories with the v1.7 regex patterns. Only 3 of 52 CD-up passing pathways land in those categories. Test had ~zero power (null also at 100% top-3 concentration). Interpretation below. |
| (c) Pathway-pathobiont attribution under permutation null | PASS | Max |

Top pathway-pathobiont attribution recapitulates known AIEC biology. The top 25 pairs are all Escherichia coli pathways with biological coherence:
| Rank | ρ_meta | Pathway | AIEC mechanistic context |
|---|---|---|---|
| 1 | 0.797 | GLYOXYLATE-BYPASS (glyoxylate cycle) | Fat utilization in fasting/inflamed gut; AIEC adaptation to bile-acid environment |
| 5 | 0.725 | PWY-6803 phosphatidylcholine acyl editing | TMA precursor pathway — links to NB05 H. hathewayi A6 |
| 6 | 0.707 | PWY-7385 1,3-propanediol biosynthesis | eut/pdu pathway — classical AIEC virulence factor (Dogan 2014) |
| 12 | 0.685 | Allantoin degradation to glyoxylate | Purine recycling under inflammation |
| 13 | 0.683 | 2-methylcitrate cycle I | Propionate detox |
| 19 | 0.640 | Heme biosynthesis from glycine | Iron metabolism — ties to NB05 E. coli Yersiniabactin MIBiG match (Dalmasso 2021) |
| 25 | 0.617 | L-arginine degradation II (AST pathway) | AA-decarboxylation theme (one of the 7 a-priori categories) |
Tier-A pair-count distribution: E. coli 105 pairs > 0.4 (76% of total signal); H. hathewayi 16; M. gnavus 8; F. plautii 7; E. lenta 1; E. bolteae 0. E. coli's domination reflects three factors: highly specialized AIEC functional repertoire well-matched to MetaCyc, higher relative-abundance variance, and E. coli genome content being especially well-represented in HUMAnN3 (vs more obligate-anaerobe Tier-A members like E. lenta / E. bolteae). E. bolteae's zero pathway-attribution despite being NB05-actionable signals that its CD-up signal may be at the BGC-level (testable via NB08, executed) or Kumbhari strain-frequency level (NB10a fact_strain_competition) rather than HUMAnN3-pathway level.
Honest interpretation of clause (b) failure. The test was structurally underpowered: with only 3 of 52 CD-up pathways landing in the 7 a-priori categories, the top-3-concentration statistic is trivially 100% under both observed and random-allocation null. The cMD pathway DA at the unstratified MetaCyc level does not preferentially load on the classical IBD-themed categories (bile-acid 7α-dehydroxylation, mucin degradation, sulfidogenesis, TMA/TMAO, eut/pdu, polyamine, AA-decarb). Instead, the CD signal captures broader bacterial-fitness-in-inflamed-gut themes (heme/iron, glyoxylate, fat metabolism, allantoin/purines) that are not in the prior-literature category set as constructed. Two non-mutually-exclusive interpretations:
- Category-set choice. The 7 a-priori categories were drawn from prior pathobiont-mechanism literature; broadening to include "alternative-electron-acceptor metabolism," "iron acquisition," "fat-utilization-in-inflammation" categories may recover the (b) signal.
- Stratified-pathway resolution. The unstratified-pathway level may be dominated by housekeeping pathways shared across many genera. Per-species stratification (NB07b) — does M. gnavus gain bile-acid-deconjugation pathways CD-vs-nonIBD? — is the appropriate next test of H3a (b).
NB07b is therefore well-positioned: stratified-pathway (PWY-XXX|g__species) gives ~ 42K pathway-species combinations to query directly. Combined with the (c) attribution signal already strong here, H3a (b) is best resolved as a follow-up question, not as a current refutation.
Output artifacts:
- data/nb07a_pathway_meta.tsv — 409 pathways × meta statistics
- data/nb07a_pathway_pathobiont_pairs.tsv — 2,454 pathway × Tier-A-core species pairs with per-substudy + meta ρ
- data/nb07a_h3a_verdict.json — formal H3a v1.7 verdict with all permutation-null statistics
(Notebook: NB07a_pathway_DA_H3a_falsifiability.ipynb; executed via run_nb07a.py per the established nbconvert numpy.bool workaround.)
8. NB07b — Stratified-pathway DA per Tier-A-core species (H3a (b) species-resolved re-test)
NB07a clause-(b) failed because only 3 of 52 CD-up unstratified MetaCyc pathways landed in the 7 a-priori IBD categories — test was structurally underpowered. NB07b tests the alternative: at the species-resolved (HUMAnN3 stratified-pathway form PWY-XXX|g__species), do per-species CD-up pathways concentrate in those categories?
H3a (b) verdict at species-resolved level: NOT SUPPORTED for all 6 Tier-A core species — but the verdict is structural (the 7 a-priori categories are sparse in per-species pathway repertoires), not biological.
| Species | CD-up | CD-down | H3a (b) test | Reason |
|---|---|---|---|---|
| Hungatella hathewayi | 16 | 17 | underpowered | Only 1 of 16 CD-up pathways in 7-cat set |
| Mediterraneibacter gnavus | 0 | 0 | untestable | Background only 4 pathways in 7-cat; no CD-up |
| Escherichia coli | 2 | 18 | underpowered | 0 of 2 CD-up in 7-cat |
| Eggerthella lenta | 0 | 0 | untestable | Background 3 pathways in 7-cat |
| Flavonifractor plautii | 0 | 2 | underpowered | 0 of 0 CD-up in 7-cat |
| Enterocloster bolteae | 1 | 3 | untestable | Background 3 pathways in 7-cat |
Combined NB07a + NB07b H3a (b) conclusion: the 7 a-priori IBD-mechanism categories (bile-acid, mucin, sulfide, TMAO, eut/pdu, polyamine, AA-decarb) are too narrow to capture HUMAnN3 MetaCyc CD signal at either unstratified or species-resolved level. CD-associated pathways are dominated by biosynthesis / niche-shift signals not the prior-literature themes. H3a (b) refutation is real for the v1.7-stated category set but does NOT mean "no compositional themes exist" — it means a different category schema (broader, programmatically derived from MetaCyc taxonomy) is needed.

But NB07b reveals biology NB07a couldn't see at the per-species level.
Hungatella hathewayi (33 passing pathways) — coherent within-carrier biosynthesis-vs-degradation shift CD-vs-nonIBD:
| Direction | Top pathway | Effect | FDR |
|---|---|---|---|
| CD-up | Pentose phosphate pathway | +1.18 | 1.6e-5 |
| CD-up | Glycolysis IV (plant cytosol) | +0.94 | 1.0e-3 |
| CD-up | Chorismate biosynthesis I | +0.86 | 1.6e-6 |
| CD-up | Purine nucleobases degradation | +0.89 | 2.6e-4 |
| CD-down | Pyrimidine deoxyribonucleosides salvage | −1.51 | 2.4e-7 |
| CD-down | 1,3-Propanediol biosynthesis | −1.27 | 4.3e-6 |
| CD-down | Lactose / galactose degradation | −1.18 | 7.0e-5 |
| CD-down | O-antigen biosynthesis (GDP-mannose) | −1.07 | 3.4e-4 |
Interpretation: under CD, H. hathewayi shifts to a biosynthetic / catabolic-stress state (pentose phosphate, anabolic AA biosynthesis, purine salvage) and away from sugar-utilization (lactose, galactose, propanediol, inositol). The propanediol-degradation CD-down within H. hathewayi is consistent with niche partitioning — E. coli may dominate this niche under CD (consistent with NB07a's E. coli propanediol-biosynthesis CD-up at cohort level).
Escherichia coli (20 passing) shows a within-carrier CD-DOWN per-pathway abundance — opposite of the cohort-level CD-up direction:
| Direction | Top pathway | Effect | FDR |
|---|---|---|---|
| CD-down | CMP-legionaminate biosynthesis | −0.95 | 3.0e-5 |
| CD-down | L-1,2-propanediol degradation | −0.81 | 2.9e-4 |
| CD-down | Allantoin degradation to glyoxylate | −0.81 | 1.8e-6 |
| CD-down | Phospholipid remodeling (PE) | −0.84 | 7.3e-6 |
| CD-down | Octane oxidation | −0.72 | 2.4e-5 |
| CD-down | L-histidine degradation I | −0.62 | 2.8e-3 |
This is not contradictory to NB07a's cohort-level E. coli CD-up:
- At cohort level: E. coli relative abundance is CD-up, so total pathway flux scales up
- Within carriers: per-cell pathway repertoire is CD-down — E. coli in CD samples show less metabolic versatility per cell
Two non-mutually-exclusive interpretations:
1. CD's E. coli are an AIEC-specialized subset that has shed peripheral metabolic capabilities (consistent with NB05 §5g Yersiniabactin / Colibactin / Dubinsky 2022 IBD-specific lineage finding)
2. CD's E. coli face metabolic competition from co-abundant Klebsiella / other Enterobacteriaceae, so per-cell read-mapping share is lower across pathways
Both align with the AIEC narrative. Deeper strain-level analysis on cMD raw reads is dropped per plan v1.9 (no-raw-reads scope); the no-raw-reads alternative is genome-content survey on kbase_genomes + Kumbhari fact_strain_competition (Future Direction #9).
Other 4 Tier-A core species (M. gnavus, E. lenta, F. plautii, E. bolteae) show small within-carrier shifts (≤ 4 passing pathways each). Their CD signal is dominated by carriage prevalence, not within-carrier metabolic shift. Implication: pathway-level mechanism is not the right resolution for these species; the no-raw-reads alternatives are BGC-level (NB08, executed) and Kumbhari strain-frequency (NB10a) for the species in fact_strain_competition.
Output artifacts:
- data/nb07b_stratified_pathway_da.tsv — 621 species-pathway × meta rows
- data/nb07b_h3a_b_species_verdict.json — per-species formal verdict
- figures/NB07b_stratified_H3a_b.png — 4-panel diagnostic
(Notebook: NB07b_stratified_pathway_DA.ipynb; executed via run_nb07b.py.)
9. NB07 v1.8 H3a (b) retest — MetaCyc class hierarchy + Fisher per-theme enrichment
The v1.7 H3a (b) FAIL verdict (NB07a §6-8 + NB07b §4-5) was driven by regex-on-pathway-names limitations: only 44/409 background pathways matched the 7 a-priori category patterns; pathways like PWY-5920 (superpathway of heme biosynthesis from glycine) were silently categorized as "0_other" because "heme" / "iron" were not in the regex set. v1.8 replaces the regex approach with structured MetaCyc class assignments from /global_share/KBaseUtilities/ModelSEEDDatabase/Biochemistry/Aliases/Provenance/MetaCyc_Pathways.tbl and expands to 12 IBD-relevant themes including iron/heme acquisition, fat metabolism / glyoxylate, anaerobic respiration, purine/pyrimidine recycling, and aromatic AA / chorismate / indole.
Per plan norm N17 (added in v1.8): prefer ontology / class hierarchy over name-pattern regex for pathway / gene / metabolite categorization wherever feasible. Regex remains a sensitivity check.
v1.8 cohort-level H3a (b) verdict: SUPPORTED. Per-theme Fisher's exact enrichment (CD-up × in-theme) on the NB07a passing pathways across 12 IBD themes; BH-FDR across themes; theme supported if FDR < 0.10 AND odds ratio > 1.5:
| Theme | Background | CD-up | Expected | OR | p | FDR | Supported |
|---|---|---|---|---|---|---|---|
| 08_iron_heme_acquisition | 32 / 409 | 15 / 52 (29 %) | 4.07 | 8.11 | 5.8e-7 | 7.4e-6 | ✓ |
| 09_anaerobic_respiration | 50 | 8 | 6.36 | 1.36 | 0.29 | 1.0 | — |
| 02_mucin_glycan_host | 83 | 11 | 10.55 | 1.06 | 0.50 | 1.0 | — |
| 12_aromatic_AA_chorismate_indole | 22 | 3 | 2.80 | 1.09 | 0.55 | 1.0 | — |
| 10_fat_metabolism_glyoxylate | 111 | 13 | 14.11 | 0.88 | 0.70 | 1.0 | — |
| 07_AA_decarboxylation | 62 | 7 | 7.88 | 0.85 | 0.71 | 1.0 | — |
| 11_purine_pyrimidine_recycling | 78 | 7 | 9.92 | 0.63 | 0.91 | 1.0 | — |
| 06_polyamine_urea | 90 | 6 | 11.44 | 0.42 | 0.99 | 1.0 | — |
Iron/heme acquisition is the dominant CD-up biochemical theme (8.1× over background; 29 % of CD-up pathways vs 8 % expected). The 15 CD-up pathways in this theme include heme biosynthesis (PWY-5920, ρ = 0.64 with E. coli in NB07a §c), heme degradation, and siderophore-related pathways.
This completely reverses the v1.7 H3a (b) "FAIL" verdict — driven entirely by category-schema choice, not by any change in the underlying data. The v1.7 regex on pathway descriptive names matched "iron" / "heme" only via "PWY-5920: superpathway of heme biosynthesis from glycine" → "0_other"; the v1.8 ModelSEED class hierarchy correctly assigns PWY-5920 to HEME-SYN, Heme-b-Biosynthesis, Cofactor-Biosynthesis, Tetrapyrrole-Biosynthesis.
v1.8 species-level H3a (b): H. hathewayi has two themes supported (16 CD-up pathways):
- 11_purine_pyrimidine_recycling: OR=4.86, FDR=0.048 (7/16 CD-up pathways)
- 04_TMA_choline: OR=9.33, FDR=0.048 (4/16 CD-up pathways)
Other Tier-A core species have insufficient species-level CD-up pathway counts for per-theme power (M. gnavus, E. lenta, F. plautii have 0 CD-up; E. coli, E. bolteae have 1-2).

Four-way convergence on iron biology as the dominant CD pathobiont specialization in this dataset:
- NB05 §5g — E. coli MIBiG matches: Yersiniabactin + Enterobactin (both iron siderophores) + Colibactin (same pks pathogenicity island)
- NB07a §c — top pathway-pathobiont attribution: heme biosynthesis ↔ E. coli (ρ = 0.640)
- NB07 v1.8 H3a (b) — iron/heme is the dominant CD-up theme (OR = 8.11, FDR = 7e-6)
- AIEC literature — Dalmasso 2021 (yersiniabactin), Prudent 2021 (LF82 IBC formation via yersiniabactin), Dogan 2014 (AIEC iron-pathway enrichment) all flag iron acquisition as central AIEC fitness mechanism
This is a robust, multi-line-of-evidence-supported claim about CD pathobiont biology that emerges only after the v1.8 schema fix. The v1.7 "FAIL — degenerate" verdict was masking a 8.1-fold iron-theme enrichment.
v1.8 also reveals H. hathewayi-specific themes: purine/pyrimidine recycling and TMA/choline metabolism. H. hathewayi's NB07b CD-up pathways included pentose phosphate (precursor for nucleotide biosynthesis), purine nucleobases degradation, and PWY-6803 phosphatidylcholine acyl editing — exactly the building-block pathways of the two enriched themes. H. hathewayi is a known TMA producer (CutC/D activity) and the choline-metabolism enrichment is mechanistically coherent.
Methodological lesson (added to docs/discoveries.md): v1.7 → v1.8 is a major scientific reversal driven entirely by category-schema choice. v1.7 "no compositional themes" (FAIL) → v1.8 "iron/heme is the dominant theme" (SUPPORTED, OR 8.1, FDR 7e-6). Same data, same DA. The lesson: regex-on-pathway-names is a poor substitute for curator-validated ontology / class hierarchy when one is available; ModelSEEDDatabase ships a usable MetaCyc class hierarchy with 90 % coverage of HUMAnN3 outputs and should be the default for any pathway category-enrichment test in BERIL projects.
Output artifacts:
- data/nb07_h3a_v18_pathway_classes.tsv — pathway × MetaCyc-classes × IBD-themes (audit trail)
- data/nb07_h3a_v18_cohort_enrichment.tsv — per-theme Fisher enrichment cohort-level
- data/nb07_h3a_v18_species_enrichment.tsv — per-species per-theme Fisher enrichment
- data/nb07_h3a_v18_verdict.json — formal v1.8 H3a (b) verdict
- figures/NB07_H3a_v18_class_enrichment.png — visualization
(Script: run_nb07_h3a_v18.py. Builds on NB07a + NB07b pathway-DA outputs; adds class-based theme enrichment per plan v1.8.)
10. NB07c — Module-anchor commensal × pathobiont metabolic coupling (H3a-new)
NB06 surfaced the H2d co-occurrence finding that 4–5 of 6 actionable Tier-A pathobionts co-cluster in a single CD-specific module per ecotype. The follow-up question — and the H3a-new test specified in plan v1.7 X4 fix — is whether the non-pathobiont anchors of these CD-specific modules show metabolic coupling with the pathobionts they sit beside. Two CD-specific modules from data/nb06_module_hubs.tsv provide the substrate:
- E1_CD module 0 (75 nodes): anchor commensals Clostridiales bacterium 1_7_47FAA, Anaerostipes caccae (the only genuine butyrate-producer in module-anchor commensals), Bacteroides nordii; with the 5 actionable Tier-A pathobionts of the E1_CD subnet (H. hathewayi, F. plautii, E. bolteae, E. lenta, M. gnavus).
- E3_CD module 1 (57 nodes): anchor commensals Actinomyces sp. oral-taxon-181, Actinomyces sp. HMSC035G02 (both oral cavity ectopic colonizers), Lactonifactor longoviformis (lactate utilizer); with 4 module pathobionts (E. lenta, H. hathewayi, E. coli, M. gnavus).
Per (anchor, pathobiont) pair: within-IBD-substudy Spearman ρ across CMD_IBD samples; Fisher z-meta across the 3 robust substudies (ZellerG_2014, NielsenHB_2014, IjazUZ_2017); sign concordance. Plus an iron-context layer: triple correlation (anchor × pathobiont × iron-pathway) over the 15 v1.8 iron/heme pathways, to test whether the v1.8 iron-theme is a community-wide signature or pathobiont-specific.
E1_CD coupling — A. caccae shows clean strong-positive coupling with all 5 module pathobionts (sign concordance 1.0 across all 3 substudies):
| Pair | ρ_meta | Interpretation |
|---|---|---|
| A. caccae × E. bolteae | +0.39 | Strongest pair |
| A. caccae × H. hathewayi | +0.33 | |
| A. caccae × M. gnavus | +0.31 | |
| A. caccae × F. plautii | +0.29 | |
| A. caccae × E. lenta | +0.08 | Weakest |
| B. nordii × M. gnavus | −0.21 | Niche competition (negative) |
| B. nordii × F. plautii | −0.20 | Niche competition (negative) |
The A. caccae × pathobiont pattern is consistent with butyrate-producer cross-feeding embedded in the CD pathobiont module — pathobiont-released substrates (M. gnavus glucorhamnan / mucin sugars; F. plautii bile-acid metabolites; H. hathewayi lactate) feed A. caccae's butyrogenic fermentation. The §2 iron-pathway layer rules against an iron-cross-feeding mechanism: ρ(A. caccae × iron-pwy) = +0.13 (mean over 15 iron pathways), much weaker than ρ(E. coli × iron-pwy) = +0.45. So the A. caccae coupling is not iron-mediated — most likely substrate / sugar / lactate mediated. Cross-feeding vs shared-environment disambiguation is deferred to NB09c (metabolite corroboration).
The B. nordii negative coupling with M. gnavus / F. plautii is consistent with niche competition for similar polysaccharide substrates — B. nordii is a generalist Bacteroides that competes with the same mucin / glycan-degrading pathobionts, and CD selects one over the other.
E3_CD coupling — only Lactonifactor × E. lenta is strong (ρ_meta = +0.27). The two oral Actinomyces anchors couple weakly (~ρ=0.17–0.19 with M. gnavus / E. lenta). Oral Actinomyces in the gut are co-trafficked ectopic colonizers under CD inflammation rather than metabolic partners; their NB06 module membership reflects shared inflammation-driven colonization, not metabolic coupling.
Iron-pathway co-variation concentrates on E. coli — narrowing the v1.8 iron-theme interpretation:
| Pathobiont | Mean ρ × 15 iron-pathways |
|---|---|
| E. coli | +0.45 |
| H. hathewayi | +0.20 |
| F. plautii | +0.17 |
| M. gnavus | +0.16 |
| E. bolteae | +0.09 |
| E. lenta | +0.03 |
The 15 iron pathways include ENTBACSYN-PWY (Enterobactin biosynthesis, E. coli-canonical), HEMESYN2-PWY (heme biosynthesis II), and 8 menaquinol-biosynthesis pathways. E. coli's ρ = +0.45 across 2,674 CMD_IBD samples means iron-pathway abundance scales proportionally with E. coli abundance — i.e., the iron-pathway signal is dominantly carried by E. coli. Other Tier-A core species show weak iron-pathway coupling, consistent with them having other CD specialization mechanisms (TMA / choline for H. hathewayi per v1.8 §9; bile-acid 7α-dehydroxylation for F. plautii; glucorhamnan / mucin for M. gnavus).
This narrows the v1.8 iron-theme interpretation: rather than "all CD pathobionts have iron specialization," the more accurate framing is "CD's E. coli (AIEC subset) drives the iron-acquisition theme; other Tier-A pathobionts have non-iron specializations." This is mechanistically coherent with NB05 §5g (only E. coli of the actionable Tier-A had iron-siderophore MIBiG matches: Yersiniabactin + Enterobactin) and the NB07b within-carrier E. coli CD-DOWN per-pathway pattern (AIEC strain-level specialization at the cost of generalist metabolic capabilities).

H3a-new verdict: PARTIALLY SUPPORTED. A. caccae × pathobiont coupling is clean in E1_CD (4 strong-positive pairs, all sign-concordant); E3_CD anchors lack metabolic-coupling structure (oral-gut co-trafficking dominates). Cross-feeding vs shared-environment disambiguation deferred to NB09c (metabolite-level test).
Pillar 4 cocktail-design implication — metabolic-coupling cost. The A. caccae × pathobiont coupling has a direct clinical implication for cocktail design: a phage cocktail that depletes M. gnavus / F. plautii / H. hathewayi / E. bolteae may incidentally reduce A. caccae abundance through loss of substrate. A. caccae is the only genuine butyrate-producer in the module-anchor commensals, and butyrate is anti-inflammatory; its incidental depletion could partially offset the cocktail's therapeutic benefit. NB05 actionable Tier-A targets need a "metabolic-coupling cost" annotation before cocktail finalization — for each candidate target, what beneficial commensals depend on it as a substrate source, and what is the predicted net inflammatory balance? This is the species-pair-level extension of the H2d concern surfaced in NB06 (single-pathobiont-module → cocktail-design implication).
Output artifacts:
- data/nb07c_anchor_pathobiont_species_rho.tsv — 27 (anchor × pathobiont) pairs with ρ_meta, per-substudy values, sign concordance
- data/nb07c_anchor_pathobiont_iron_triple.tsv — 405 (anchor × pathobiont × iron-pathway) triples
- data/nb07c_h3a_new_verdict.json — formal verdict
- figures/NB07c_anchor_pathobiont_coupling.png — 2-panel heatmap
(Script: run_nb07c.py. Builds on NB06 module hubs + NB07_v1.8 iron-pathway list.)
11. NB08a — BGC × pathobiont enrichment (H3c) — genomic mechanism layer
NB05 §5g qualitatively flagged E. coli MIBiG matches Yersiniabactin + Enterobactin + Colibactin. NB07 v1.8 §9 found iron/heme acquisition is the dominant CD-up MetaCyc pathway-class theme (OR=8.1, FDR 7e-6). NB07c §10 found iron-pathway co-variation concentrates on E. coli at the sample-correlation level. NB08a is the genomic-content level test: do Tier-A pathobiont genomes (per ref_bgc_catalog, Elmassry 2025; 10,060 BGCs across 6,221 species-annotated entries) carry an over-represented iron-siderophore / genotoxin biosynthetic gene-cluster signature, and is this signature uniformly distributed across actionable Tier-A or concentrated on E. coli?
Test 1 — BGC-theme enrichment (Fisher's exact, Tier-A core BGCs in theme vs background BGCs in theme; BH-FDR across themes; 4 IBD-relevant themes covering iron-siderophore MIBiG matches, genotoxin/microcin MIBiG matches, RiPP-bacteriocin classes, and NRPS-PKS-hybrid classes):
| Theme | Tier-A core | Background | Fisher OR | FDR | Supported |
|---|---|---|---|---|---|
| iron_siderophore | 54 / 286 | 51 / 9,774 | 44.4 | 6.5e-56 | ✓ |
| genotoxin_microcin | 25 / 286 | 4 / 9,774 | 234.0 | 3.3e-35 | ✓ |
| NRPS_PKS_hybrid | 32 / 286 | 739 / 9,774 | 1.54 | 0.042 | ✓ |
| bacteriocin_RiPP | 154 / 286 | 5,217 / 9,774 | 1.02 | 0.90 | — (background-rate) |
The iron_siderophore Fisher OR of 44.4 is one of the largest enrichments in the project. It complements v1.8 §9's pathway-class iron OR of 8.1 (different evidence stream — pathway-cohort co-occurrence vs genomic gene-cluster content) and converges on the same biology: iron acquisition is a CD pathobiont-defining genomic capability, not just a pathway-level cohort signal.
Test 2 — Per-Tier-A-core species iron + genotoxin MIBiG breakdown:
| Tier-A core | n_BGCs | iron MIBiG | genotoxin MIBiG |
|---|---|---|---|
| E. coli | 146 | 54 | 25 |
| E. lenta | 41 | 0 | 0 |
| M. gnavus | 58 | 0 | 0 |
| E. bolteae | 18 | 0 | 0 |
| H. hathewayi | 13 | 0 | 0 |
| F. plautii | 10 | 0 | 0 |
E. coli alone carries the iron+genotoxin BGC signature within actionable Tier-A core. Its 54 iron BGCs comprise 19 Yersiniabactin, 16 Enterobactin, plus 19 BGCs of class=siderophore (some redundant per-strain assemblies); the 25 genotoxin BGCs comprise 8 Colibactin, 15 Microcin B17, and 2 Microcin J25. The other 5 Tier-A core species sit in MIBiG dark matter — they carry substantial BGC content (E. lenta 41, M. gnavus 58, E. bolteae 18) but no MIBiG-annotated iron or genotoxin clusters. This is the genomic-level confirmation of the v1.8 + NB07c narrowing: iron biology in CD pathobionts is essentially an E. coli (AIEC) phenomenon. Other Tier-A pathobionts have non-iron CD-association mechanisms (consistent with v1.8 §9: H. hathewayi purine + TMA/choline themes).
Test 3 — CB-ORF CD-vs-HC enrichment per Tier-A core (read-level; from ref_cborf_enrichment 5,157 CB-ORFs):
| Species | CB-ORFs matched | CD-up at FDR<0.10 | CD-down | Mean effect |
|---|---|---|---|---|
| E. bolteae | 11 | 9 (82 %) | 0 | +2.81 |
| F. plautii | 5 | 2 (40 %) | 0 | +0.77 |
| E. coli | 51 | 14 (27 %) | 0 | +1.56 |
| M. gnavus | 19 | 5 (26 %) | 0 | +1.43 |
| H. hathewayi | 7 | 1 (14 %) | 0 | +0.52 |
| E. lenta | 13 | 0 | 7 (54 %) | -0.28 |
| Background catalog | 5,052 | 2.5 % | — | -0.18 |
5 of 6 Tier-A core species have CD-up CB-ORF rates above the 2.5 % catalog background (range 14–82 %); E. lenta is the exception, with CB-ORFs CD-DOWN at 54 %. This complements the BGC-MIBiG analysis: even where MIBiG annotations are absent (the 5 dark-matter Tier-A core), per-CB-ORF CD-vs-HC RPKM is independently elevated in CD samples. The E. lenta CD-DOWN pattern is consistent with NB07b's species-resolved finding that E. lenta per-pathway abundance is mostly carriage-prevalence-driven (not within-carrier abundance-shifted) and aligns with the canonical Eggerthella CD-association mechanism being drug-metabolism (cardiac glycoside inactivation, Koppel et al. 2018) rather than BGC-encoded inflammatory mediators.
Test 4 — ebf/ecf cohort meta CD-vs-HC (per ref_ebf_ecf_prevalence, 1,349 samples × 4 cohorts; Mann-Whitney CD-vs-HC per cohort + Stouffer's z-meta):
| Compound | n_cohorts | meta z | meta p |
|---|---|---|---|
| RPKM (ebf) | 4 | 11.71 | 1.1e-31 |
| RPKM (ecf) | 4 | 11.97 | 5.1e-33 |
All 4 cohorts (HMP2-IBDMDB, MetaHIT, LLDEEP-NLIBD, PRISM) show CD > HC for both ebf and ecf, with cliff-deltas 0.17–0.73. The Elmassry 2025 immunoactive fatty acid amide BGC family CD-up finding replicates cleanly in our cohort-meta design at p < 1e-31 — the largest single effect in the project so far.

H3c verdict — PARTIALLY SUPPORTED. The hypothesis ("BGC-encoded inflammatory mediators localize to a minority of Tier-A pathobionts and show CD-enrichment beyond what species-level abundance captures"):
- Localization to minority: confirmed — iron+genotoxin BGCs are uniquely E. coli within actionable Tier-A core (1 of 6).
- Beyond species-abundance: partially confirmed — CB-ORF CD-vs-HC is independently CD-up enriched (read-level, not species-derived) for 5 of 6 Tier-A core; ebf/ecf is independently CD-up at p<1e-31 across 4 cohorts. However, the strict species × BGC interaction-term test specified in the original H3c is untested (would require species-stratified per-sample BGC abundance, not in the current pre-computed mart slice).
Five-line iron-acquisition convergence narrative: NB05 §5g (per-actionable MIBiG lookup) → NB07a §c (pathway × pathobiont attribution, ρ=0.640 heme↔E.coli) → NB07 v1.8 §9 (cohort pathway-class enrichment, OR=8.1) → NB07c §2 (sample-level co-variation, ρ=0.45 E.coli×iron-pwy) → NB08a §2 (genomic BGC content, OR=44.4 driven by E.coli's 54 iron BGCs) — five independent evidence streams converging on AIEC iron-acquisition as a central CD-pathobiont specialization mechanism.
Pillar 4 cocktail-design implication (sharpened): phage cocktail design should distinguish:
- E. coli component: target AIEC subset specifically — Yersiniabactin/Enterobactin/Colibactin-positive strains. Per plan v1.9, AIEC strain-resolution from cMD raw reads is dropped; the no-raw-reads alternative is kbase_genomes pks + iron-BGC genome-content query (Future Direction #9);
- M. gnavus, E. lenta, E. bolteae, H. hathewayi, F. plautii: BGC-mechanism dark — design relies on Tier-A scoring + NB06 module membership + NB07b within-carrier metabolic signature, not on BGC presence;
- ebf/ecf RPKM as a sample-level CD biomarker (no per-species attribution available) — usable for treatment-response monitoring, not for cocktail target selection.
Output artifacts:
- data/nb08a_tier_a_bgc_repertoire.tsv — per-species BGC repertoire summary
- data/nb08a_bgc_theme_enrichment.tsv — 4 themes × Fisher OR + FDR (3 supported)
- data/nb08a_tier_a_iron_genotoxin_per_species.tsv — per-species iron + genotoxin MIBiG breakdown
- data/nb08a_cborf_enrichment_per_tier_a.tsv — per-species CB-ORF CD-up rate vs background
- data/nb08a_ebf_ecf_cd_vs_hc.tsv — ebf/ecf cohort meta z-stats
- data/nb08a_h3c_verdict.json — formal H3c verdict
- figures/NB08a_bgc_pathobiont_enrichment.png — 3-panel summary
(Script: run_nb08a.py. Builds on NB05 actionable Tier-A; ref_bgc_catalog + ref_cborf_enrichment + ref_ebf_ecf_prevalence per RESEARCH_PLAN.md NB08a spec.)
12. NB09a — HMP2 metabolomics CD-vs-nonIBD (H3d-DA)
NB09a is the first metabolomics analysis in the project. It tests whether the metabolite axes that distinguish CD from nonIBD in HMP2 are coherent with the iron / TMA / fat / bile-acid / fatty-acid-amide themes already established by the NB07–NB08a pathway and BGC analyses. Per plan v1.9 (raw-read deprecation), no raw-FASTQ reprocessing — uses precomputed HMP2 metabolomics from the mart + cMD R-package HMP2 sample metadata for diagnosis labeling. 468 of 546 metabolomics samples (86 %) match cMD HMP2 metagenomics samples directly by sample-ID code (CSM*); the matched set covers 106 subjects (50 CD + 30 UC + 26 nonIBD) with diagnosis. Subject-level analysis uses one sample per subject (first-occurrence visit) to avoid within-subject correlation; UC excluded from primary contrast.
Per-metabolite Mann-Whitney + cliff_delta + BH-FDR on 592 named (HMDB-annotated) HMP2 metabolites (out of 81,867 total, mostly unnamed peaks). 579 testable (≥5 samples in each group). Passing threshold: FDR < 0.10 + |cliff_delta| > 0.20 → 52 metabolites pass (50 CD-up, 2 CD-down).
Per-theme Fisher's exact across 11 IBD-relevant chemical-class themes (BH-FDR across themes; supported = FDR < 0.10 AND OR > 1.5):
| Theme | n_in_theme | n_CD-up_in_theme | OR | FDR | Supported |
|---|---|---|---|---|---|
| polyamines | 9 | 5 | 14.6 | 0.008 | ✓ |
| long_chain_PUFA | 15 | 6 | 7.9 | 0.009 | ✓ |
| acyl_carnitines | 22 | 2 | 1.06 | 1.0 | — |
| bile_acids | 21 | 3 | 0.56 | 1.0 | — (3 of 3 are tauro/taurine forms — see below) |
| lipid_classes | 163 | 19 | 1.64 | 0.76 | — (19 CD-up CEs/SMs/ceramides; theme dilute) |
| short_chain_fatty_acids | 14 | 0 | — | 1.0 | — (LC-MS undersamples SCFAs) |
| tma_choline | 29 | 2 | 0.78 | 1.0 | — |
| urobilin_porphyrin | 5 | 1 (CD-DOWN) | 143 | 0.13 | — (n=1 too small) |
| tryptophan_indole / fatty_acid_amides / aromatic_AA / oxidative | small | 0 | — | 1.0 | — |
Polyamines CD-up (5 of 9): putrescine (cliff=+0.45, FDR=0.08), N1-acetylspermine (+0.56, 0.06), N-acetylputrescine (+0.46, 0.06), anserine (+0.43, 0.08), diacetylspermine (+0.43, 0.08). Established IBD biomarker class (reviewed Pegg 2014; reported as CD biomarker by Wang 2018, Franzosa 2019). Mechanistically interesting: the v1.8 §9 pathway-level 06_polyamine_urea theme was CD-DOWN at pathway-level (OR=0.42) but CD-UP at metabolite-level here (OR=14.6) — this is mechanistically coherent: polyamine metabolite-pool accumulation can result from increased catabolism of dietary protein / mucin + reduced microbial polyamine clearance without requiring elevated biosynthesis flux. The metabolite-pool readout is the clinically-actionable observation; the pathway-level signal reflects production capacity.
Long-chain PUFAs CD-up (6 of 15): adrenate (C22:4, +0.55, FDR=0.027), arachidonate (C20:4, +0.48, 0.05), docosapentaenoate (C22:5, +0.45, 0.06), docosahexaenoate (C22:6, +0.41, 0.08), eicosapentaenoate (C20:5, +0.40, 0.09). Covers both n-6 (adrenate, arachidonate — eicosanoid precursors) and n-3 (DHA, DPA, EPA) classes. The CD-up signal can reflect (a) impaired host fatty-acid uptake / β-oxidation in inflamed mucosa, (b) increased dietary fat mobilization, (c) reduced microbial PUFA biohydrogenation by Lactobacillus / Roseburia spp. leading to free-PUFA pool accumulation. Mechanistically connected to v1.8 §9 10_fat_metabolism_glyoxylate theme (theme not Fisher-significant at pathway level — OR=0.88 — but the metabolite-pool elevation IS theme-significant here).
Bile acids — only 3 of 21 CD-up but the 3 are tauro-conjugated (free taurine, tauro-α-muricholate, tauro-β-muricholate). Free taurine (the conjugating amino acid) is CD-up at cliff=+0.47. Tauro-α/β-muricholate is CD-up at +0.40. Consistent with reduced microbial bile-acid 7α-dehydroxylation in CD — primary tauro-conjugated BAs accumulate when F. plautii / C. scindens / Eggerthellaceae dehydroxylation activity is impaired (canonical Franzosa 2019 finding). Corroborates NB05 actionable F. plautii's mechanistic role in 7α-dehydroxylation.
Acyl-carnitines C16 + C18:1 CD-up (2 of 22): C16 carnitine (+0.41, FDR=0.08), C18:1 carnitine (+0.39, 0.09). Long-chain fatty-acid β-oxidation intermediates. Mechanistically connected to v1.8 §9 10_fat_metabolism theme + H. hathewayi TMA/choline (carnitine sits in the same metabolic neighborhood as choline → TMA → fatty-acid β-oxidation). Theme-level enrichment OR=1.06 not significant because the 22-carnitine background is dominated by short/medium-chain forms that don't shift in CD.
Lipid classes (CE / SM / TAG / DAG / ceramide) — 19 CD-up but theme-level OR=1.64 (FDR=0.76, not significant). Pattern is informative: dominated by cholesteryl-esters of long-chain PUFAs (12 CEs: C20:4, C20:5, C18:2, C18:3, C16:0, C18:1, C16:1; 4 SMs: C14:0, C16:0, C16:1, C24:1; 1 ceramide: C24:1; 2 TAGs). The CE-PUFA elevation directly mirrors the free-PUFA elevation (CEs are the storage form of esterified PUFAs). Theme-level Fisher doesn't pass because the 163-metabolite background is too dilute (most lipid classes don't shift, just the long-chain-PUFA-conjugated ones).
SCFAs: 0 of 14 CD-DA. None of acetate, propionate, butyrate, valerate, hexanoate reach FDR<0.10. Either (a) HMP2 LC-MS untargeted methods undersample SCFAs (volatile, polar — typically need GC-MS), (b) the cohort-level subject-averaged contrast masks within-subject variation, or (c) gross SCFA pool differences between CD and nonIBD are smaller than within-subject variability. This is an important null for the NB07c cross-feeding hypothesis — butyrate isn't differentially abundant CD-vs-nonIBD at subject-level in HMP2, so the A. caccae × pathobiont cross-feeding hypothesis cannot be confirmed via cohort-level butyrate DA. The NB09c sample-level paired test (does butyrate co-vary with A. caccae × pathobiont co-occurrence within paired metabolomics+metagenomics samples) remains the right test and is the natural follow-up.
Urobilin CD-DOWN (cliff=-0.38, FDR=0.09). Urobilin is the gut-bacterial catabolic product of bilirubin (produced by Clostridium / Bacteroides species expressing bilirubin reductase). CD-DOWN urobilin = reduced gut-bacterial bilirubin reduction = consistent with dysbiosis / loss of urobilinoid-producing commensals (Hall 2024; Vital 2018).

Convergence summary table — NB09a metabolomics × NB07-pillar pathway findings:
| Mechanism axis | Pathway level | Metabolite level (NB09a) | Verdict |
|---|---|---|---|
| Iron / heme | v1.8 §9 OR=8.1 (E. coli) | not measured (LC-MS undersamples siderophores) | corroborated by NB08a §11 BGC; metabolomics neutral |
| Bile-acid 7α-dehydroxylation (F. plautii) | NB07b F. plautii F420-BA pathways CD-up | tauro-α/β-muricholate + taurine CD-up | CONFIRMED at metabolite level |
| TMA / choline | v1.8 §9 H. hathewayi OR=9.3 | C16/C18:1 carnitines CD-up; TMAO not theme-sig | partially corroborated |
| Fat metabolism / glyoxylate | v1.8 §9 OR=0.88 (NS) | long-chain PUFAs OR=7.9 (theme-sig) | NEW at metabolite level |
| Polyamine pool | v1.8 §9 OR=0.42 (CD-DOWN at pathway) | polyamines OR=14.6 (CD-UP at metabolite) | NEW at metabolite level (pool ≠ flux) |
| SCFA cross-feeding (NB07c) | NB07c A. caccae × pathobiont +0.39 | butyrate / acetate / propionate not DA | null at cohort level; NB09c pending |
H3d-DA verdict: SUPPORTED. 52 metabolites pass DA + 2 themes pass Fisher enrichment. Both falsifiability gates met. NB09a adds two new mechanism axes to the project narrative (polyamines + long-chain PUFAs) that complement the iron + bile-acid + TMA findings from NB07–NB08a.
Pillar 4/5 implications:
- Polyamine and PUFA elevations are sample-level CD biomarkers — usable for clinical follow-up (treatment response monitoring) but not per-species cocktail-design targets (no per-species attribution available).
- Tauro-muricholate elevation confirms the mechanistic premise for F. plautii targeting (the bile-acid 7α-dehydroxylation deficit is real at the metabolite-pool level, not just predicted from pathway DA).
- The SCFA null at cohort level sharpens the NB07c "metabolic-coupling-cost" annotation — depleting A. caccae may not produce a measurable cohort-level butyrate change, but the within-sample paired test (NB09c) remains the right test for the cross-feeding causal claim.
Output artifacts:
- data/nb09a_metab_da_cd_vs_nonibd.tsv — 579 named metabolites × Mann-Whitney + cliff + FDR + theme assignment
- data/nb09a_metab_theme_enrichment.tsv — 11 themes × CD-up/CD-down × Fisher OR + FDR
- data/nb09a_h3d_da_verdict.json — formal H3d-DA verdict (SUPPORTED)
- figures/NB09a_metabolomics_cd_vs_nonibd.png — 3-panel summary
(Script: run_nb09a.py. Subject-level analysis on cMD-fetched HMP2 sample-to-subject metadata + mart fact_metabolomics; per plan v1.9 no-raw-reads constraint.)
13. NB09c — Sample-level paired metabolomics × metagenomics: NB07c cross-feeding disambiguation + bile-acid 7α-dehydroxylation network
NB07c §10 left a key disambiguation deferred: the A. caccae × pathobiont species-level coupling at +0.39 (E. bolteae), +0.33 (H. hathewayi), +0.31 (M. gnavus), +0.29 (F. plautii) was consistent with either butyrogenic cross-feeding or shared-environment co-response to the same CD niche. The proposed disambiguation: paired sample-level metabolomic-metagenomic correlation could surface candidate intermediate metabolites (lactate, mucin-glycan products, bile-acid metabolites) shared between anchor and pathobiont. Per plan v1.9 (no-raw-reads scope), NB09c executes this paired analysis using 468 paired CSM* HMP2 samples that have both metabolomics and cMD MetaPhlAn3 metagenomics.
Cross-feeding-triangle test (strict criteria: anchor × pathobiont same sign + both |ρ|>0.20 + both FDR<0.10): only 7 triangles across 8 species × 583 named HMP2 metabolites. Top candidates: caffeine (B. nordii / F. plautii), linoleoylethanolamide (B. nordii / E. lenta; A. caccae / E. lenta), urobilin (A. caccae / E. lenta; B. nordii / E. lenta), cholate (A. caccae / F. plautii; B. nordii / F. plautii). None of the 7 candidates is a butyrogenic cross-feeding intermediate; the pattern is consistent with shared-niche health-direction co-occurrence — A. caccae, B. nordii, F. plautii, E. lenta are all (mostly) commensal species that together correlate negatively with primary BA cholate and positively with urobilin (NB09a CD-DOWN), i.e. co-occur in healthy / normobiotic samples vs CD-dysbiotic samples.
Curated cross-feeding panel — direction-of-association profile:
| Metabolite | A. caccae | B. nordii | H. hath | F. plautii | E. bolteae | E. lenta | M. gnavus | E. coli |
|---|---|---|---|---|---|---|---|---|
| butyrate | +0.10* | +0.13* | -0.08 | -0.02 | -0.04 | -0.05 | -0.02 | +0.04 |
| lactate | +0.18* | +0.10* | +0.03 | -0.23* | -0.20* | -0.07 | +0.10 | +0.24* |
| choline | +0.04 | +0.01 | -0.07 | -0.16* | -0.15* | -0.13* | +0.11* | +0.25* |
| carnitine | -0.04 | -0.02 | -0.01 | -0.18* | -0.06 | -0.04 | +0.04 | +0.19* |
| cadaverine (E. coli specialty) | — | — | — | — | — | — | — | +0.45* |
| putrescine | -0.04 | -0.09 | -0.07 | -0.02 | -0.15* | -0.11* | +0.13* | +0.19* |
| tryptophan | -0.09 | -0.10* | -0.10 | -0.13* | -0.10 | -0.22* | +0.14* | +0.25* |
Three findings emerge:
-
Butyrate × A. caccae = +0.10 is significant but very weak. Consistent with A. caccae as a butyrate producer at the population level, but the cohort-level signal is weak — partly methodological (HMP2 LC-MS undersamples SCFAs; only butyrate and propionate detected) and partly because gut butyrate pool is dominated by other butyrate producers (Faecalibacterium, Roseburia*).
-
Lactate × A. caccae = +0.18* vs lactate × F. plautii = −0.23* and × E. bolteae = −0.20* — opposite signs. If lactate were a cross-feeding intermediate (pathobiont produces, A. caccae consumes), we would expect same-sign positive (both increase together) or asymmetric same-sign (production-consumption coupling). The opposite-sign pattern is the strongest evidence against the cross-feeding hypothesis — A. caccae and the metabolic-coupling-candidate pathobionts occupy different metabolic niches at the cohort level.
-
E. coli dominates the cohort-level correlation signal: choline +0.25, carnitine +0.19, tryptophan +0.25, putrescine +0.19, cadaverine +0.45 (the strongest correlation in the entire panel). These match canonical E. coli / Enterobacteriaceae metabolism: lysine decarboxylase → cadaverine; arginine decarboxylase → putrescine; choline + carnitine substrates. The v1.8 §9 H. hathewayi TMA/choline finding from cMD pathway-level analysis does NOT strongly replicate at HMP2 sample level — H. hathewayi × choline ρ=−0.07 (NS); E. coli dominates. This narrows v1.8 §9 — the TMA/choline signal is a combined Enterobacteriaceae + Lachnospiraceae signal, not specifically H. hathewayi. Possible reasons: HMP2 has lower H. hathewayi prevalence (25 % of paired samples) than the cMD studies that drove the v1.8 finding; at the sample level, E. coli's high prevalence (50 %) and abundance variance dominate the cohort-correlation signal.
Bile-acid 7α-dehydroxylation network — independent strong finding:
| BA Class | A. caccae | B. nordii | H. hath | F. plautii | E. bolteae | E. lenta | M. gnavus | E. coli |
|---|---|---|---|---|---|---|---|---|
| Tauro-α/β-muricholate (1° tauro) | -0.03 | -0.15* | 0.00 | -0.04 | -0.04 | -0.11* | +0.20* | +0.06 |
| Cholate (1° unconj.) | -0.13* | -0.18* | -0.10 | -0.26* | -0.10 | -0.13* | +0.11* | -0.02 |
| Deoxycholate (2° from cholate) | -0.07 | +0.05 | -0.12 | +0.06 | +0.17* | +0.03 | -0.10* | -0.05 |
| Lithocholate (2° from CDCA) | -0.05 | +0.06 | -0.12 | +0.15* | +0.18* | +0.07 | -0.20* | -0.13* |
| Ketodeoxycholate (2° oxidized) | -0.12* | -0.15* | -0.01 | -0.19* | -0.05 | -0.14* | +0.14* | +0.24* |
F. plautii, E. lenta, and E. bolteae — the canonical bile-acid 7α-dehydroxylating bacteria — show the predicted substrate-product signature: negative correlation with primary tauro-conjugated bile acids (substrates) and positive correlation with secondary unconjugated bile acids (products: deoxycholate, lithocholate). This is the direct sample-level confirmation of the bile-acid 7α-dehydroxylation network operating in HMP2 samples. By contrast, M. gnavus and E. coli show the OPPOSITE pattern: positive with primary tauro-BAs, negative with secondary BAs — they are not in the 7α-dehydroxylation network.

NB07c verdict reframed: shared-environment co-occurrence
The cross-feeding hypothesis (a) is not supported by sample-level paired evidence. The shared-environment hypothesis (b) is the more parsimonious explanation for A. caccae × pathobiont species-level coupling. NB09c reframes NB07c: the cocktail-design implication originally stated as "depleting pathobionts may incidentally reduce A. caccae" (NB07c §10) is less load-bearing than originally framed — the substrate-product relationship is not detectable at sample level, so phage targeting of pathobionts is unlikely to substantially deplete A. caccae through metabolic coupling.
Bile-acid coupling cost — primary Pillar 4 cocktail-design annotation
Replace "metabolic-coupling cost" (NB07c §10) with bile-acid coupling cost as the primary per-target ecological-cost annotation for the NB05 actionable Tier-A set:
- F. plautii targeting: highest cost. Strong negative correlation with primary cholate (−0.26) AND positive correlation with secondary lithocholate (+0.15) → 7α-dehydroxylation activity. Phage depletion shifts BA pool toward primary inflammatory forms.
- E. bolteae targeting: moderate cost. Positive with deoxycholate (+0.17) and lithocholate (+0.18) → secondary 7α-dehydroxylation contributor.
- E. lenta targeting: moderate cost. Negative with primary cholate (−0.13) and ketodeoxycholate (−0.14) — partial 7α-dehydroxylation pattern, consistent with Eggerthellaceae bile-acid metabolism activity (Devlin & Fischbach 2015 PMID 26412091).
- H. hathewayi / M. gnavus / E. coli targeting: low BA-coupling cost — these species are not in the 7α-dehydroxylation network in this dataset.
Six-line cross-corroboration extended to bile-acid biology
The bile-acid 7α-dehydroxylation network finding is independently corroborated across project pillars:
- NB07b — F. plautii within-carrier CD-up pathways include F420-dependent bile-acid metabolism pathways
- NB09a — tauro-α/β-muricholate + free taurine CD-up at subject level (consistent with reduced microbial 7α-dehydroxylation in CD)
- NB09c — F. plautii / E. lenta / E. bolteae show the direct substrate-product signature in paired sample-level data (negative with primary tauro-BAs, positive with secondary unconjugated BAs)
- NB05 F. plautii and E. lenta are actionable Tier-A pathobionts
- Literature (Franzosa 2019, Devlin & Fischbach 2015) — bile-acid 7α-dehydroxylation deficit is a canonical IBD finding
This is a second six-line cross-corroboration narrative alongside the iron-acquisition narrative (Novel Contribution #14): bile-acid biology is the second mechanism axis with multi-line evidence convergence.
Outputs
data/nb09c_species_metabolite_corr.tsv— 8 species × 583 metabolites paired Spearman ρ + FDRdata/nb09c_cross_feeding_triangles.tsv— 7 strict cross-feeding-triangle candidatesdata/nb09c_cross_feeding_panel.tsv— curated 7-theme panel direction-of-association profiledata/nb09c_cross_feeding_verdict.json— formal verdictfigures/NB09c_cross_feeding_disambiguation.png— heatmap + triangle scatter
(Script: run_nb09c.py. Builds on NB07c module-anchor × pathobiont species coupling + NB09a metabolite annotations. Per plan v1.9 no-raw-reads.)
14. NB10a — Kumbhari strain-adaptation gene predictor (H3b)
The H3b hypothesis (per plan v1.7): strain-adaptation gene content discriminates disease-dominant from health-dominant strains within Kumbhari, and the discriminating genes are biologically interpretable rather than housekeeping. Per plan v1.7 H3b restriction, the test is restricted to the Kumbhari/LSS-PRISM cohort itself (the only project data with strain-level resolution). Per plan v1.9 (no raw reads), uses precomputed ref_kumbhari_s7_gene_strain_inference (219,121 gene × species rows; 59 species) cross-referenced with fact_strain_competition (15,520 strain × sample rows; 100 UHGG genome IDs in 94 LSS-PRISM participants).
Tier-A core intersection with Kumbhari: only F. plautii is in the 59-species panel. The other 5 actionable Tier-A core (H. hathewayi, M. gnavus, E. lenta, E. bolteae, E. coli) are NOT in Kumbhari — Kumbhari focused on commensal strain heterogeneity, not pathobiont mechanism. The H3b test therefore evaluates the methodology of strain-adaptation gene inference on the 59 commensals; only one actionable Tier-A species is directly tested.
Falsifiability test result: among the 23,579 FDR<0.10 strain-adaptation genes (12,159 IBD-biased + 11,420 health-biased across 59 species), the IBD-biased gene set is significantly:
| Category | IBD-biased % | Background % | Fisher OR | p |
|---|---|---|---|---|
| Adaptation (7 subcategories) | 2.13 % | 1.59 % | 1.38 | 2.4e-6 |
| Housekeeping (6 subcategories) | 2.81 % | 4.36 % | 0.62 | 6.4e-20 |
| Unclassified | 95.06 % | 94.05 % | — | — |
The IBD-biased gene set is 1.38× ENRICHED for adaptation and 0.62× DEPLETED for housekeeping — the canonical SUPPORTED pattern. The strict housekeeping-domination falsifiability bound is rejected at p=6.4e-20.
Adaptation subcategory breakdown (IBD-biased): mucin_glycan utilization (94 genes), two-component signaling (52), antibiotic resistance (35), membrane transport specialty (33), virulence/secretion (15), bile acid (15), oxidative stress response (15). The dominant IBD-adaptation theme is mucin/glycan utilization — consistent with the Kumbhari focus on commensal IBD-niche adaptation.
Cross-species shared IBD-adaptation KOs — top KEGG KOs IBD-biased in 12-21 of 59 species:
| KO | n_species | Function |
|---|---|---|
| K03088 | 21 | Sigma factor (RpoE/RpoS — environmental stress response) |
| K01990 | 20 | ABC transporter ATP-binding protein |
| K06147 | 19 | ABC-2 type transport system |
| K02529 | 18 | LacI/GalR family transcriptional regulator |
| K07240 | 17 | TIM44-like protein |
| K03424 | 17 | TatD-related deoxyribonuclease |
| K08303 | 16 | SmpB (tmRNA-binding, translation rescue) |
| K03466 | 15 | DNA topoisomerase |
The cross-species pattern is dominated by transport, regulation, stress response, and DNA repair — classic niche-adaptation signatures. Top gene symbols include corC (Mg²⁺/Co²⁺ efflux), luxQ (autoinducer-2 quorum sensing), pglA / rfbC (capsule polysaccharide / LPS), bepC (efflux pump), abgB (aminobenzoyl-glutamate utilization) — consistent across 9-10 species each.
F. plautii informative null: zero FDR<0.10 strain-adaptation genes in the Kumbhari analysis (3,245 genes total tested in F. plautii). This is biologically meaningful given F. plautii has confirmed CD-association at species level (NB04e, NB05) and at the bile-acid 7α-dehydroxylation activity level (NB09c §13: ρ × cholate -0.26; ρ × lithocholate +0.15). The interpretation: F. plautii CD-association operates through species-level abundance, not strain-level genomic adaptation. The 7α-dehydroxylation activity is encoded by core bai-operon genes that are presumably present in essentially all F. plautii strains; the CD signal in NB04e + NB07b reflects how much F. plautii (any strain) is present in the sample, not which F. plautii strain is dominant. Independent corroboration of NB07b within-carrier finding (small per-pathway shifts within carriers; CD signal dominated by carriage prevalence).

H3b verdict: SUPPORTED. The Kumbhari strain-adaptation gene-content analysis carries biologically interpretable signal that is statistically distinct from housekeeping artifact. The cross-species shared IBD-adaptation gene set (sigma factor, ABC transport, quorum sensing, capsule LPS, efflux) is consistent with a biologically real "IBD-niche" gene signature operating across multiple commensal species.
Caveats / limitations:
- One cohort only (Kumbhari/LSS-PRISM); cross-cohort sensitivity test was acknowledged-weak per plan v1.7.
- 95 % of FDR-passing genes are unclassified by the gene-symbol regex — the curated 13-category classifier captures only annotated functional roles. The functional enrichment signal is at the category-classified subset, not the full gene set.
- Per plan v1.7, the cross-cohort sensitivity test (NB10b — species-level prevalence-weighted strain-feature on cMD) was specified as sensitivity-only, not load-bearing. Per plan v1.9, the cross-cohort cMD path is dropped (raw reads not available); NB10b is therefore not executed in this project.
- F. plautii CD-association is species-abundance-mediated — implies phage targeting of F. plautii would deplete BA 7α-dehydroxylation activity proportionally to abundance reduction (no strain-level escape route via gene-content selection within F. plautii).
Implications for project narrative:
1. The strain-level analysis methodology is validated on commensal species (H3b SUPPORTED). The actionable Tier-A coverage is single-species (F. plautii only); the analysis is informative for methodology but not for direct pathobiont-mechanism extension.
2. F. plautii CD-association is species-abundance-mediated, confirming NB07b within-carrier reading independently.
3. Cross-species shared IBD-adaptation gene set (sigma factor, ABC transport, quorum sensing, capsule LPS) is a real biological signature — bullet for the Pillar 4 / 5 hand-off discussion of bacterial fitness in the IBD gut.
Outputs
data/nb10a_per_species_bias_counts.tsv— 59 species × {IBD-biased, health-biased} gene countsdata/nb10a_sig_genes_classified.tsv— 23,579 FDR-passing genes × functional classificationdata/nb10a_cross_species_ibd_kos.tsv— multi-species shared KEGG KO signaldata/nb10a_cross_species_ibd_symbols.tsv— multi-species shared gene-symbol signaldata/nb10a_f_plautii_strain_adaptation.tsv— F. plautii deep-dive (empty, no FDR-passing genes)data/nb10a_h3b_verdict.json— formal verdict (SUPPORTED)figures/NB10a_kumbhari_strain_adaptation.png— 3-panel summary
(Script: run_nb10a.py. Per plan v1.7 H3b + v1.9 no-raw-reads.)
15. NB11 — HMP2 serology × Tier-A pathobiont (H3e)
The H3e hypothesis (per plan v1.7): anti-microbial antibody titers correlate with Tier-A pathobiont abundance within IBD patients, n=67 subjects across 3 sites (CCHMC/Harvard/Emory), site as covariate, effect threshold |r|>0.40 calibrated to n=67 power 0.80 at α=0.05. Single-cohort caveat structural per plan v1.7. Per plan v1.9: uses cMD-fetched HMP2 MetaPhlAn3 abundance + mart fact_serology (2,520 measurements; 12 assays). All 67 serology subjects (E-prefix Emory, H-prefix Harvard, M-prefix MGH/CCHMC) match cMD MetaPhlAn3 metagenomics.
Six EU-deduplicated serology axes: ANCA, ASCA, CBir1, IgA-ASCA, IgG-ASCA, OmpC. Cross with 8 species (6 actionable Tier-A core + 2 NB07c module-anchor commensals) → 48 tests; partial Pearson r residualized on site dummy variables; BH-FDR.
Cohort sanity check — canonical IBD-serology patterns hold in n=32 CD + 14 UC + 21 nonIBD subjects:
| Axis | CD mean | UC mean | Canonical pattern | Match |
|---|---|---|---|---|
| ANCA | 17.8 | 42.6 | UC ↑ (pANCA biomarker for UC) | ✓ |
| ASCA | 0.28 | 0.0 | CD ↑ (yeast cell wall) | ✓ |
| CBir1 | 29.0 | 13.7 | CD ↑ (anti-flagellin) | ✓ |
| IgA ASCA | 14.2 | 1.2 | CD ↑ | ✓ |
| IgG ASCA | 15.0 | 3.6 | CD ↑ | ✓ |
| OmpC | 9.3 | 7.6 | CD slight ↑ | partial ✓ |
All 6 axes show canonical IBD direction → data validates standard immunology biology. The H3e test asks the harder question: do antibody titers track individual Tier-A pathobiont abundance within IBD patients?
Top moderate associations (none clear |r|>0.40 + FDR<0.10):
| Pair | partial r | raw ρ | FDR | Site-stratified breakdown |
|---|---|---|---|---|
| ANCA × M. gnavus | +0.31 | +0.24 | 0.40 | Harvard r=+0.456* (p=0.008); CCHMC r=−0.03 NS |
| OmpC × E. lenta | +0.29 | +0.10 | 0.40 | CCHMC r=−0.02 NS; Harvard r=+0.14 NS |
| ANCA × H. hathewayi | +0.23 | +0.16 | 0.59 | Harvard r=+0.345* (p=0.050); CCHMC r=+0.02 NS |
| IgA-ASCA × E. coli | +0.23 | +0.22 | 0.59 | Harvard r=+0.30 (p=0.09); CCHMC r=+0.06 NS |
| CBir1 × E. bolteae | +0.22 | +0.22 | 0.59 | CCHMC r=+0.381* (p=0.035); Harvard r=+0.04 NS |
| ANCA × F. plautii | +0.22 | +0.32 | 0.59 | CCHMC r=+0.443* (p=0.013); Harvard r=+0.25 NS |
Cohort-aggregate signal is dominated by ANCA × M. gnavus (+0.31). Site-stratified breakdown shows most associations are concentrated in 1 of 2 productive sites (Harvard or CCHMC) and dilute when cohort-aggregated. With Harvard 33 + CCHMC 31 + Emory 3 subjects, single-site partial r reaches |r|≈0.40–0.46 individually but the cohort pull toward null prevents the strict |r|>0.40 + FDR<0.10 plan threshold from being met. Top observed: |r|=0.31, FDR=0.40 → strict H3e: NOT SUPPORTED.
Biologically plausible directions (all top pairs are positive r): anti-microbial antibody titers ↑ with target Tier-A species abundance. ANCA × M. gnavus + ANCA × H. hathewayi + ANCA × F. plautii — pANCA antibody (canonically UC-associated) co-elevated with multiple CD-pathobiont species, consistent with high-pathobiont-burden states triggering broader anti-microbial humoral response. CBir1 × E. bolteae mechanistically coherent (CBir1 is anti-bacterial-flagellin; E. bolteae is flagellated). IgA-ASCA × E. coli is the most mucosal-immunity-relevant axis (IgA isotype is the gut-immunity isotype); the +0.23 correlation is consistent with E. coli outer-antigen recognition.

H3e verdict — PARTIAL (NOT SUPPORTED at strict plan threshold). Per plan v1.7 falsifiability bound, no (assay × species) pair clears the |r|>0.40 + FDR<0.10 effect threshold. The PARTIAL framing reflects (a) cohort sanity check passing on canonical IBD-serology patterns, (b) biologically plausible direction of all top pairs, (c) productive sites reaching |r|≈0.40–0.46 individually, and (d) the structural single-cohort caveat acknowledged in plan v1.7.
Implications for Pillar 4–5 cocktail design: serology is not yet a quantitative target-prioritization signal at the per-target level for this cohort. The canonical CD-vs-UC ecotype-stratification axes are intact (ANCA-UC, ASCA-CD, CBir1-CD), so serology can be used as a CD-vs-UC stratifier but not as a per-target abundance predictor — the |r|≈0.31 ceiling at the cohort level means a single subject's anti-M. gnavus ANCA titer cannot be used to confidently predict that subject's M. gnavus abundance. Multi-cohort meta-analysis (out of project scope) is the structural next step.
Limitations:
- n=67 across 3 unevenly-distributed sites is structurally low-power for |r|>0.40 detection at α=0.05 + 48-test multiple-testing correction. Plan v1.7 H3e was acknowledged-weak by design.
- Subject-level aggregation loses longitudinal information (each subject contributed multiple serology + metagenomics visits). Mixed-effects regression would have more power but unbalanced visit counts complicate interpretation.
- OmpC and ANCA cross-react with related antigens beyond their named targets, so per-species attributions are correlative rather than exclusive.
- Single-cohort caveat structural — no project-internal replication path; would need multi-cohort meta to firm the signals up.
Output artifacts:
- data/nb11_serology_species_correlations.tsv — 48 (assay × species) tests with partial Pearson r + raw Spearman ρ + BH-FDR
- data/nb11_serology_site_stratified.tsv — site-stratified breakdown for top 10 pairs
- data/nb11_h3e_verdict.json — formal verdict
- figures/NB11_serology_pathobiont.png — heatmap + top 10 pairs
(Script: run_nb11.py. Per plan v1.7 H3e + v1.9 no-raw-reads.)
16. NB09b — Cross-cohort metabolomics bridge (HMP2 NB09a → FRANZOSA_2019)
NB09a §12 found that polyamines (OR=14.6) + long-chain PUFAs (OR=7.9) are CD-up theme-level signatures in HMP2; tauro-α/β-muricholate + free taurine CD-up corroborates F. plautii 7α-dehydroxylation deficit; urobilin CD-DOWN. NB09b tests whether these signatures replicate in FRANZOSA_2019 (220 participants × 88 CD + 76 UC + 56 Control), an independent multi-site IBD metabolomics cohort. Per plan v1.9 no raw reads — uses precomputed fact_metabolomics only. DAVE_SAMP_METABOLOMICS dropped (mouse ileal AKR/J tissue, not human IBD).
Method: Franzosa metabolite IDs follow [m/z]_[RT] format with no compound-name annotations in the mart. Bridge HMP2 named metabolites (with HMDB ID + m/z + RT) to Franzosa peaks within method by m/z (±0.005 Da tolerance). RT scales differ between HMP2 and Franzosa runs (different gradients/columns) so RT is not used as a matching constraint. 122 of 592 named HMP2 metabolites match a Franzosa peak; 118 of 122 pass Franzosa CD-vs-Control DA filters.
Per-theme cross-cohort sign-concordance:
| Theme | n matched | n CD-up HMP2 | n CD-up Franz | sign-concord | % concord | median Franz cliff |
|---|---|---|---|---|---|---|
| urobilin_porphyrin | 3 | 0 | 0 | 3 | 100 % | −0.747 (CD-DOWN) |
| acyl_carnitines | 5 | 4 | 5 | 4 | 80 % | +0.531 |
| long_chain_PUFA | 12 | 12 | 9 | 9 | 75 % | +0.273 |
| aromatic_AA_metabolites | 3 | 3 | 2 | 2 | 67 % | +0.181 |
| bile_acids_primary | 7 | 5 | 4 | 4 | 57 % | +0.252 |
| lipid_classes | 21 | 14 | 4 | 11 | 52 % | −0.069 |
| tryptophan_indole | 2 | 1 | 0 | 1 | 50 % | −0.199 |
| bile_acids_secondary | 5 | 3 | 2 | 2 | 40 % | −0.159 |
| polyamines | 0 | — | — | — | — | (no m/z match — methodological gap) |
3 themes ≥75 % sign-concordant (urobilin, acyl-carnitines, long-chain PUFA), overall 64 % concordance (76 of 118), 9 strict cross-cohort replications (both FDR<0.10 + |cliff|>0.20 + sign-match) — these represent the strongest cross-cohort signal in the project's metabolomics analyses.

Findings:
-
Urobilin CD-DOWN replicates at 100 % (3/3 matches; median Franzosa cliff −0.747 — the largest cross-cohort effect size in the panel). Reflects loss of bilirubin-reducing commensals (Hall 2024; Vital 2018) consistently in CD across HMP2 and Franzosa. Strongest replicated metabolite-level finding in the project.
-
Acyl-carnitines CD-up replicates at 80 % (4/5 matches; median Franzosa cliff +0.531). The C16 + C18:1 long-chain acyl-carnitines that NB09a §12 flagged as CD-up are independently elevated in Franzosa CD samples. Mechanistically connected to v1.8 §9 fat-metabolism/glyoxylate theme + impaired β-oxidation in inflamed mucosa.
-
Long-chain PUFAs CD-up replicates at 75 % (9/12 matches; median Franzosa cliff +0.273). Adrenate/arachidonate/DHA/DPA replicate cleanly; eicosapentaenoate (EPA) shows opposite sign in 1 of 3 non-concordant cases (possibly cohort-specific dietary fish-oil supplementation differences). Inflammatory eicosanoid-precursor pool elevation is a robust cross-cohort signature.
-
Bile-acid signals are weaker cross-cohort than within-cohort. Primary BAs replicate at 57 % (4/7), secondary BAs at 40 % (2/5). The NB09c §13 paired sample-level substrate-product signature for F. plautii / E. lenta / E. bolteae (ρ within-HMP2-paired) is the more robust evidence stream than cohort-aggregate cross-cohort DA. BA pool sizes are more susceptible to dietary and antibiotic confounding than within-cohort species × metabolite correlation. NB09c §13 finding does not require Franzosa replication; this NB09b result documents that BA-pool DA is more cohort-dependent than within-cohort correlation.
-
Polyamine bridge fails — m/z-only matching does not recover any of the 9 polyamines that drove the NB09a §12 OR=14.6 theme enrichment. Polyamines are low-mass HILIC-pos compounds (putrescine 89.11; spermidine 146.17; spermine 203.22) and the m/z-tolerance + Franzosa-method-library overlap does not yield matches. The HMP2 polyamine OR=14.6 stands as a single-cohort finding until cross-cohort replication via external Franzosa supplementary tables (with compound-name annotations) becomes available — outside the project's no-raw-reads scope.
Verdict — STRONG cross-cohort replication on 3 of 8 testable themes (urobilin + acyl-carnitines + long-chain PUFAs). Overall 64 % sign-concordance is moderately above the 50 % chance baseline; the 9 strict replications are firmly above-chance. Polyamines unable to bridge by m/z. The bile-acid DA signal is partly cohort-specific (which strengthens the NB09c §13 paired-sample interpretation, where the F. plautii / E. lenta / E. bolteae substrate-product signature is direct rather than DA-aggregated).
Bile-acid 7α-dehydroxylation narrative — sample-level evidence is stronger than cohort-aggregate cross-cohort
NB09b clarifies the relative robustness of two evidence streams for the bile-acid network finding:
- NB09c §13 paired sample-level direct substrate-product signature (ρ(F. plautii × cholate) = −0.26; × lithocholate = +0.15) operates within paired metabolomics+metagenomics samples in HMP2. It is a species × metabolite correlation, not a CD-vs-Control aggregate DA.
- NB09b cohort-aggregate cross-cohort DA on bile acids replicates at 57 % primary / 40 % secondary — moderately above chance for primary, sub-chance for secondary.
The interpretation: BA pool sizes vary across cohorts due to dietary and antibiotic confounding, but the mechanistic substrate-product signature for active 7α-dehydroxylation is preserved at the within-paired-sample level. The NB09c six-line bile-acid narrative is anchored on the paired sample-level evidence + literature mechanism, not on cohort-aggregate DA replication. This is a methodological note for any future BERIL multi-modal analysis: cohort-aggregate cross-cohort DA and within-cohort paired-sample correlation are different evidence streams; one can be weaker without invalidating the other.
Outputs
data/nb09b_cross_cohort_concordance.tsv— 122 (HMP2-name × Franzosa-peak) matched pairs with cliff_delta + sign-match flagdata/nb09b_theme_replication.tsv— per-theme sign-concordance summarydata/nb09b_cross_cohort_verdict.json— formal verdictfigures/NB09b_cross_cohort_metabolomics.png— cross-cohort cliff δ scatter + per-theme sign-concordance
(Script: run_nb09b.py. Per plan v1.7 H3d-DA cross-cohort follow-up + v1.9 no-raw-reads.)
17. NB09d — H3d-clust metabolite-feature ecotype LOSO stability
The H3d-clust hypothesis (per plan v1.7): does metabolite-derived clustering achieve higher cross-cohort stability than the taxonomic-derived ecotype framework? If yes, refitting ecotypes on metabolite features would give a more cross-cohort-portable framework for clinical translation. Per plan v1.9 no raw reads — uses NB09b m/z-bridge feature panel + pooled HMP2 + Franzosa metabolomics.
Method: pool 326 subjects (106 HMP2 × 220 Franzosa) on the 111 unique m/z-bridge metabolite features (NB09b); standardize per metabolite; PCA → 15 components; K-means K=4. Compare to NB04f taxonomic-ecotype LOSO ARI baseline (0.113) and NB04b bootstrap baseline (0.16).
Cluster structure is COHORT-driven, not diagnosis-driven:
| Cluster | Size | Cohort | Diagnosis composition |
|---|---|---|---|
| Cluster 0 | 86 | Franzosa 100 % | 46 Control + 30 UC + 10 CD |
| Cluster 1 | 58 | HMP2 100 % | 29 CD + 23 UC + 6 nonIBD |
| Cluster 2 | 134 | Franzosa 100 % | 78 CD + 46 UC + 10 Control |
| Cluster 3 | 48 | HMP2 100 % | 21 CD + 7 UC + 20 nonIBD |
PC1 explains 79 % of total variance and separates HMP2 (PC1 ≈ +12) from Franzosa (PC1 ≈ −5) cleanly. PCs 2–15 collectively explain ~10 % of variance and carry the diagnosis information. Within each cohort, clusters DO carry diagnosis information (cluster × diagnosis χ²(6)=88.3, p=7e-17), but cluster boundaries are mostly co-aligned with cohort boundaries.
Stability metrics:
| Test | ARI | Comparator | Verdict |
|---|---|---|---|
| Cross-cohort LOSO (HMP2 held out) | 0.000 | NB04f taxonomic 0.113 | substantially LOWER |
| Cross-cohort LOSO (Franzosa held out) | 0.000 | NB04f taxonomic 0.113 | substantially LOWER |
| Mean cross-cohort LOSO | 0.000 | NB04f taxonomic 0.113 | LOWER |
| Within-pooled bootstrap (80 % × 30) | 0.937 | NB04b taxonomic 0.16 | HIGHER (but trivially — measures cohort-batch reproducibility, not biology) |
The within-pooled bootstrap ARI of 0.937 is misleading as a stability metric: it measures how reproducibly the pooled K-means recovers the cohort-batch structure under subsampling, which is high. It does NOT measure whether the clustering is biologically informative; the cohort batch effect is consistently visible in any subsample.

H3d-clust verdict: NOT SUPPORTED. Cross-cohort LOSO ARI = 0.000 ≪ 0.113 taxonomic baseline. The metabolite-feature framework on the m/z-bridge feature panel does NOT achieve higher cross-cohort stability than the taxonomic framework.
Methodological lesson (now in docs/discoveries.md): m/z-bridge metabolomics has unaddressed batch effects that dominate cluster structure. The taxonomic-feature ecotype framework was naturally cross-cohort-portable because MetaPhlAn3 relative-abundance values are unitless and compositionally constrained — that constraint creates a natural normalization that handles cohort differences. Metabolite-feature clustering inherits absolute-intensity scale differences between LC-MS runs (different instruments, ionization tuning, solvent batches) and requires explicit batch correction (ComBat / SVA / RUV / quantile normalization) prior to clustering. Within-cohort metabolite-feature ecotypes might still be useful for single-laboratory clinical translation; the failure here is specifically the cross-cohort-portability question.
Implications for project narrative
- The taxonomic ecotype framework (NB01b consensus K=4) remains the project's primary ecotype basis for Pillar 4–5 cocktail design.
- Metabolite-feature ecotype refit is not a drop-in replacement for the taxonomic framework in this project's data scope; it would require external batch correction infrastructure.
- The NB09c §13 paired sample-level finding (within-HMP2 species × metabolite correlation) is the more robust mechanism evidence stream; cohort-aggregate cross-cohort metabolite DA is dominated by batch effects (NB09b primary BAs 57 % concord; secondary BAs 40 %).
- The H3 falsifiability framework is now fully closed: H3a (a/c) + H3a (b) v1.8 + H3a-new + H3b + H3c + H3d-DA + H3d-clust + H3e all tested with formal verdicts.
Limitations:
- m/z-bridge feature space is small (111 unique features). A larger feature space (500-1000 metabolites with name annotations across cohorts) might give different results — but requires external Franzosa supplementary annotations (out of scope per plan v1.9).
- No batch correction applied — ComBat/SVA/RUV would likely improve cross-cohort LOSO ARI substantially. The current verdict is for "raw m/z-bridge clustering without batch correction."
- K=4 is the plan-imposed K matching the taxonomic framework; natural cluster count on this feature space might be K=2 (HMP2 vs Franzosa).
Output artifacts:
- data/nb09d_metabolite_ecotype_assignments.tsv — 326 pooled subjects × {cohort, diagnosis, metabolite_cluster}
- data/nb09d_h3d_clust_verdict.json — formal H3d-clust verdict (NOT SUPPORTED)
- figures/NB09d_metabolite_ecotype_stability.png — 3-panel summary
(Script: run_nb09d.py. Per plan v1.7 H3d-clust + v1.9 no-raw-reads.)
18. NB07d — MOFA+-style HMP2 multi-omics joint factor pilot (taxonomy + metabolomics)
The plan v1.7 NB07d called for 3-modality MOFA+ on HMP2 taxonomy + pathways + metabolomics. Per plan v1.9 (no raw reads) and the project's data-scope reality — HMP2 pathway abundance is not in the mart (fact_pathway_abundance contains only CMD_IBD_PATHWAYS) — NB07d falls back to 2-modality joint factor analysis on (HMP2 taxonomy + HMP2 metabolomics) using sklearn CCA on per-modality PC scores (mofapy2 unavailable).
Data scope: 106 paired HMP2 CSM* subjects (intersection from NB09c) × 130 ≥10%-prevalence species (CLR-transformed) + 582 named metabolites (log10-intensity, ≥30% non-NaN coverage). Per-modality PCA → 30 components each → CCA → 4 canonical pairs.
All 4 canonical correlations are very strong: r = 0.964, 0.928, 0.911, 0.889 — taxonomy and metabolomics share substantial cross-modality structure on this paired-subject set.
CC1 is THE CD-vs-nonIBD diagnosis-discriminative joint factor:
| Factor | canon r | Mean CD | Mean UC | Mean nonIBD | cliff CD-vs-nonIBD | MW p |
|---|---|---|---|---|---|---|
| CC1 | 0.964 | +0.235 | +0.123 | −0.593 | +0.498 | 4e-4 |
| CC2 | 0.928 | -0.017 | +0.130 | -0.117 | +0.092 | 0.51 |
| CC3 | 0.911 | -0.030 | -0.166 | +0.250 | -0.146 | 0.30 |
| CC4 | 0.889 | -0.152 | -0.082 | +0.387 | -0.274 | 0.05 |
CC1 separates CD (+0.235) from nonIBD (−0.593) by ~0.83 SD on a single joint axis; UC sits at +0.123, between CD and nonIBD (consistent with UC as a milder dysbiosis state on the same axis).
CC1 species loadings recapitulate the entire actionable Tier-A set: ALL 6 actionable Tier-A core species load POSITIVE (CD-direction): M. gnavus +0.195, E. coli +0.173, F. plautii +0.153, H. hathewayi +0.144, E. lenta +0.109, E. bolteae +0.103. Plus oral-gut Tier-B candidates V. parvula +0.194, A. intestini +0.161, V. atypica +0.151. NEGATIVE loadings (commensal-direction) match NB01b ecotype-defining commensals: R. bromii −0.173, R. bicirculans −0.170, A. putredinis −0.169, L. eligens, B. intestinihominis. NB06 H2d pathobiont module structure independently rediscovered as a single principal direction in joint species-metabolite space — two analytical approaches (CLR+Spearman+Louvain modules vs CCA joint factors) converge on the same biology.
CC1 metabolite loadings recapitulate ALL Pillar 3 metabolomics narratives in one axis:
| Direction | Top loadings | Pillar-3 narrative connection |
|---|---|---|
| NEGATIVE (CD-direction = depleted) | urobilin (3 instances, max −0.143); glycodeoxycholate −0.113; lithocholate −0.080; caproate −0.113 | NB09b §16 100 % cross-cohort urobilin CD-DOWN; NB09c §13 BA-pool depletion in CD |
| POSITIVE (CD-direction = elevated) | linoleoyl ethanolamide ×2 (+0.110, +0.106); palmitoylethanolamide +0.108 | NB08a §11 ebf/ecf BGC family CD-up p<1e-31 (Elmassry 2025) |
| POSITIVE | sphingosine-isomer1/2 (+0.109, +0.105); 7-methylguanine +0.121 | NB09a §12 lipid_classes CD-up + v1.8 §9 H. hathewayi purine recycling |
| POSITIVE | N-acetylputrescine +0.105; diacetylspermine +0.091; cadaverine +0.086; putrescine +0.084 | NB09a §12 polyamines OR=14.6 + NB09c §13 cadaverine × E. coli ρ=+0.45 |
| POSITIVE | docosapentaenoate ×2 (+0.092, +0.091); adrenate +0.081; arachidonate +0.048 | NB09a §12 long-chain PUFAs OR=7.9; NB09b §16 75 % cross-cohort concord |
| POSITIVE | ADMA/SDMA +0.099 | uremic toxin marker; arginine catabolism connected to v1.8 §9 TMA/choline |

Pilot verdict: SUCCESSFUL. CC1 is the unified Pillar 3 CD-vs-nonIBD axis in joint species-metabolite space. It is the cleanest single-factor representation of "what is CD biology" that the project has produced — a single principal direction that:
1. recapitulates the entire NB05 actionable Tier-A set (all 6 species CD-positive)
2. recapitulates the NB06 H2d pathobiont module structure
3. recapitulates the NB09a §12 polyamine + PUFA themes
4. recapitulates the NB09b §16 urobilin cross-cohort CD-DOWN finding
5. recapitulates the NB09c §13 BA-pool depletion + cadaverine × E. coli signature
6. recapitulates the NB08a §11 ebf/ecf fatty-acid-amide BGC-family signature
This is the cross-pillar unified narrative axis in a single multi-omics joint factor. The same biological state — "CD pathobiont-module dominance at the species level + the metabolic consequences thereof" — is the dominant cross-modality structure in HMP2 and is detectable as a single canonical correlation pair at r=0.96.
Methodological notes
- 2-modality vs 3-modality: pathway modality dropped per data-scope constraint. Adding pathways via cMD_IBD reprocessing on cohort-aligned subjects (NB07a) would give 3-modality MOFA but requires sample-level pairing (cMD_IBD pathway slice exists but is NOT paired with HMP2 metabolomics).
- CCA vs MOFA+: CCA on PC scores captures the same canonical-correlation signal that MOFA+ would on the 2-modality case. MOFA+ would additionally model modality-specific factors (factors loading only on one modality), which CCA does not. The 4 PCs we used are joint factors; modality-specific structure remains in the per-modality PCA components.
- Ecotype as covariate (per plan v1.7 N13): ecotype is implicit in the species-loading structure (E1-Bact2-transitional + E3-Bacteroides-expanded species both load positive on CC1; E0-commensal species load negative). Future extension: regress factor scores on ecotype + diagnosis to test whether residual factor variance encodes ecotype-specific biology.
Limitations
- Pathway modality not in the analysis (HMP2 fact_pathway_abundance unavailable; v1.9 no-raw-reads constraint).
- CCA cannot identify modality-specific factors — MOFA+ would add ~5–10 unique factors per modality on top of joint factors.
- Pilot-scale — not a substitute for a full MOFA+ analysis with 3 modalities and proper factor-relevance gating. The 4 canonical correlations are all very strong (>0.88), which suggests the per-modality PCAs are over-aligned; a proper MOFA+ would impose sparsity to gate factor count.
- n=106 subjects, modest for a 4-factor decomposition. The CC2/CC3/CC4 signal beyond CC1 is suggestive but not strongly disease-discriminative.
Output artifacts:
- data/nb07d_cca_loadings.tsv — top 15 species + 15 metabolite loadings per of 4 canonical components
- data/nb07d_subject_factor_scores.tsv — 106 subjects × {CC1-4 species score, metab score, joint score} + diagnosis
- data/nb07d_mofa_pilot_verdict.json — formal pilot verdict
- figures/NB07d_mofa_pilot.png — joint factor space + CC1 species + CC1 metabolite loadings
(Script: run_nb07d.py. Per plan v1.7 NB07d (scope-adjusted to 2-modality per v1.9) + N13 multi-modal QC prerequisite (NB04e + NB07a + NB09a all passed).)
19. NB12 — Pathobiont × phage targetability matrix (Pillar 4 opener)
The Pillar 4 framework starts with a per-pathobiont phage-availability profile (Tier-B in the project's 4-tier rubric) layered on top of the NB05 actionable Tier-A scoring + Pillar-3 mechanism profile. NB12 builds the foundational matrix from ref_phage_biology (12-organism literature-curated synthesis from indicator_taxa_literature_review).
Phage-availability score (Tier-B), 0-3 ordinal scale: 0 = no known phages, 1 = temperate / limited, 2 = lytic literature, 3 = clinical trial / commercial cocktail.
Per-actionable Tier-A profile:
| Pathobiont | NB05 | Phage | Lifestyle | BA cost | Pillar-5 priority |
|---|---|---|---|---|---|
| H. hathewayi | 4.0 | 0 | none | low | GAP: highest NB05 but no known phages — external DB query (INPHARED + IMG/VR) priority |
| M. gnavus | 3.8 | 1 | temperate | low | Limited: 6 known phages all temperate — lytic-locked engineering OR biochemical glucorhamnan-synthesis target |
| E. coli (AIEC) | 3.6 | 3 | lytic + clinical | low | Tier-1 clinical: EcoActive cocktail (7 lytic phages, clinical trials); HER259 (FimH-targeting). Most advanced |
| E. lenta | 3.3 | 2 | lytic literature | moderate | Tier-2: PMBT5 siphovirus characterized; non-BGC drug-metabolism mechanism (Koppel 2018) |
| F. plautii | 3.3 | 0 | unknown (not in ref) | HIGHEST | GAP + HIGH cost: not in ref_phage_biology; HIGHEST BA-coupling cost — phage targeting deprioritized in favor of BA monitoring or biochemical alternatives |
| E. bolteae | 2.8 | 2 | lytic literature | moderate | Tier-2: PMBT24 (virulent, 99,962 bp Kielviridae) — best-characterized lytic phage among gut-anaerobe Tier-A |
Stratification — 4 phage-availability classes among 6 actionable Tier-A: Class 3 clinical (1: E. coli), Class 2 lytic literature (2: E. lenta, E. bolteae), Class 1 temperate-limited (1: M. gnavus), Class 0 gap (2: H. hathewayi, F. plautii).
Critical: the 2 highest-NB05-scored species (H. hathewayi 4.0, M. gnavus 3.8) have the WEAKEST phage availability. F. plautii additionally has the HIGHEST BA-coupling cost — phage targeting may be deprioritized in favor of BA-binding co-therapy.

Pillar 4 → Pillar 5 hand-off framework — 3 design strategies for the 6 actionable Tier-A:
1. Direct phage targeting (Tier-1): E. coli (AIEC subset, EcoActive clinical-trial cocktail; require strain-resolution diagnostic per NB07b/NB08a).
2. Phage targeting with monitoring (Tier-2): E. lenta (PMBT5), E. bolteae (PMBT24); M. gnavus if lytic-locked engineering succeeds.
3. Phage GAP — alternative strategies needed:
- H. hathewayi: highest priority for external DB query; if no phages, fall back to GAG-degrading enzyme inhibitors.
- F. plautii: lowest Pillar-5 priority due to highest BA-cost — consider deprioritizing or co-administering UDCA / BA-binding agent.
- M. gnavus: biochemical glucorhamnan-synthesis targets (Henke 2019) if lytic-locked engineering fails.
Output artifacts:
- data/nb12_phage_targetability_matrix.tsv — per-pathobiont scoring matrix with NB05 + phage + Pillar-5 class
- data/nb12_phage_targetability_verdict.json — formal verdict + Pillar-4/5 hand-off + limitations
- figures/NB12_phage_targetability.png — 2-panel: Tier-A × Tier-B scatter + Pillar-5 priority bar
Quantitative-augmentation data found during NB12 execution (deferred to NB13):
- HMP2 fact_viromics (3,039 sample-rows): direct phage observations in patient stool. 16 unique E. coli phages + 13 unique Klebsiella phages observed. NO observations of phages targeting H. hathewayi / M. gnavus / F. plautii / E. lenta / E. bolteae — confirms the gut-anaerobe phage coverage gap from the literature-curation side.
- PhageFoundry phagefoundry_strain_modelling (BERDL): 96 phages × 188 E. coli strains × experimentally-tested susceptibility (17,672 pairs; 3,929 susceptible / 13,743 resistant; 22 % susceptibility rate). The definitive E. coli AIEC phage-cocktail design dataset for this project.
- ref_viromics_cd_vs_nonibd (22 viruses pre-computed CD-vs-nonIBD DA): top hit Gokushovirus WZ-2015a CD-DOWN log2FC=-2.7, FDR=1e-11 (microvirus depleted in CD).
(Script: run_nb12.py. Per plan v1.7 NB12 + v1.9 no-raw-reads. BERDL Spark auth restored mid-execution; PhageFoundry quantitative analysis promoted to NB13.)
20. NB13 — PhageFoundry quantitative E. coli phage-cocktail design
NB12 established the qualitative literature-curated foundation; NB13 adds the quantitative cocktail-design layer at strain-resolution level using BERDL phagefoundry_strain_modelling (96 phages × 188 E. coli strains × 17,672 experimentally-tested susceptibility pairs from Gaborieau 2025-10-02 phage-prediction experiment, AUC=0.88). 3,929 / 17,672 = 22 % susceptibility rate across the matrix.
Greedy minimum-set-cover cocktail design — smallest cocktail covering ≥X % of 188 strains:
| Coverage target | n phages | % covered | Cocktail composition |
|---|---|---|---|
| ≥50 % | 1 | 63.8 % | DIJ07_P2 (Phapecoctavirus) |
| ≥75 % | 2 | 81.4 % | + LF73_P1 (Tequatrovirus) |
| ≥90 % | 4 | 92.6 % | + AL505_Ev3 + 55989_P2 |
| ≥95 % | 5 | 94.7 % | DIJ07_P2 + LF73_P1 + AL505_Ev3 + 55989_P2 + LF110_P2 |
| ≥99 % | 8 | 98.4 % | + NIC06_P2 + LF73_P4 + BCH953_P4 |
A 5-phage cocktail covers 94.7 % of 188 E. coli strains in PhageFoundry. Comparable in size to the EcoActive 7-phage clinical-trial cocktail (per NB12 §1) — both are 5-7 phages. The top-ranked phage DIJ07_P2 alone (genus Phapecoctavirus) lyses 63.8 % of strains — remarkable single-phage breadth.
Phage host-phylogroup analysis — AIEC relevance:
AIEC strains are predominantly phylogroup B2 (~80 %) and D (~20 %) per Dogan 2014/Dubinsky 2022. PhageFoundry distribution:
| Phylogroup | n phages | median host range | max | AIEC relevance |
|---|---|---|---|---|
| B2 | 50 | 21.3 % | 63.3 % | Primary AIEC phylogroup |
| D | 15 | 13.8 % | 63.8 % | Secondary AIEC phylogroup |
| A | 10 | 16.5 % | 39.4 % | Predominantly commensal |
| B1 | 9 | 20.2 % | 50.5 % | Predominantly commensal |
| Other (C/0/G) | 10 | varied | varied | rare |
65 of 94 phages (69 %) are isolated against B2/D phylogroup hosts — strongly AIEC-relevant. Top broadest-host-range B2/D phages include several isolated against canonical AIEC reference strains: LF82_P8 (60.1 %, Mosigvirus) — LF82 is THE AIEC reference (Darfeuille-Michaud 2004); LF73_P1 (62.8 %) — LF73 is also a CD-associated AIEC strain; 536_P7 / 536_P9 — E. coli 536 is a UPEC/B2 archetype.
HMP2 viromics × PhageFoundry overlap = 0 — the 7 unique E. coli phages observed in HMP2 fact_viromics (D108, EC6, ECML-117, Murica, slur16, vB_EcoM-VpaE1, vB_EcoM_AYO145A) do NOT name-overlap with PhageFoundry phages. The two datasets are complementary, not overlapping: PhageFoundry = research/clinical isolates with experimental susceptibility; HMP2 viromics = natural phages observed in patient stool. PhageFoundry is the primary source for cocktail-design.

Pillar 5 hand-off — concrete E. coli AIEC phage-cocktail recommendation:
1. Tier-1 cocktail (5 phages, 95 % strain coverage): DIJ07_P2 + LF73_P1 + AL505_Ev3 + 55989_P2 + LF110_P2
2. Extended Tier-1+ (8 phages, 99 % strain coverage): above + NIC06_P2 + LF73_P4 + BCH953_P4 — comparable in size to EcoActive
3. Strain-level diagnostic requirement: AIEC-vs-commensal E. coli discrimination via per-patient pks-island / Yersiniabactin / Enterobactin gene-presence detection (per NB07b + NB08a) is required before cocktail selection — the cocktail covers broad E. coli strain diversity but does not by itself distinguish AIEC from commensal E. coli.
Limitations:
- PhageFoundry has no explicit AIEC-vs-commensal strain annotation. The 188 strains include canonical AIEC strains (LF82, LF73, 536, NIC06, BCH953 — named as phage isolation hosts) but Pillar-5 cross-reference to literature isolate annotations is needed for accurate AIEC coverage estimation.
- Susceptibility matrix is binary (no titer/burst size). Real-world cocktail dosing depends on phage burst size + receptor binding kinetics + gastric passage survival — out of project scope.
- 26 strains (14 %) are phage-resistant at ≤5 % susceptibility — clinical significance of these escape strains needs cross-reference to AIEC pks-island annotation.
- HMP2 viromics × PhageFoundry overlap = 0 means in-vivo gut delivery of cocktail phages is not validated by the project's data; would require in-vivo testing.
Output artifacts:
- data/nb13_phage_host_range.tsv — 96 phages × {host_range_pct, Family, Genus, Phage_host_phylo, …}
- data/nb13_strain_phage_susceptibility.tsv — 188 strains × phage_susceptibility_pct
- data/nb13_phagefoundry_cocktail_verdict.json — formal verdict + cocktail compositions at 50/75/90/95/99 % coverage
- figures/NB13_phagefoundry_cocktail.png — 3-panel: phage host-range + strain susceptibility + cocktail-coverage curve
(Script: run_nb13.py. BERDL Spark Connect via fresh KBASE_AUTH_TOKEN from /home/aparkin/.env. Per plan v1.7 NB13 + v1.9 no-raw-reads.)
21. NB14 — HMP2 endogenous phageome × ecotype × diagnosis
NB14 closes Pillar 4 by adding the in-vivo phage-community lens to the curated-literature (NB12) + experimental-susceptibility (NB13) layers. HMP2 fact_viromics (3,039 sample-rows × 273 viruses × VirMAP taxonomic profile) cross-referenced with NB04h ecotype projections gives 630 of 648 viromics samples (97 %) with ecotype calls — 27 (E0) + 484 (E1) + 55 (E2) + 50 (E3) samples spanning CD/UC/healthy.
Per-ecotype × per-virus CD-vs-nonIBD Mann-Whitney: 85 DA tests across 4 ecotypes; 3 pass strict FDR<0.10:
| Ecotype | Virus | Cliff δ | n_CD vs n_HC | FDR |
|---|---|---|---|---|
| E1 | Gokushovirus WZ-2015a | −0.358 | 231 vs 125 | 5e-7 |
| E2 | Gokushovirus WZ-2015a | −0.471 | 21 vs 20 | 0.056 |
| E2 | Human feces pecovirus | −0.300 | 21 vs 20 | 0.056 |
Gokushovirus WZ-2015a is robustly CD-DOWN across multiple ecotypes, with the strongest signal in E1 (cliff=-0.36, FDR=5e-7). This independently rediscovers the precomputed ref_viromics_cd_vs_nonibd top hit (log2fc=-2.7, FDR=1e-11). Gokushovirus is a Microviridae member (single-stranded DNA, ~5 kb genome; Gokushovirinae subfamily includes the canonical "crassphage-like" lineage that infects Bacteroides / Prevotella gut commensals) — CD-DOWN consistent with Norman 2015 / Clooney 2019 IBD-Microviridae depletion.
Tier-A pathobiont species × phage-family Spearman ρ (n=630 paired viromics+metaphlan3 samples):
| Species | Phage family | ρ | p |
|---|---|---|---|
| E. coli | Podoviridae | +0.183 | 4e-6 |
| H. hathewayi | Unknown | -0.150 | 2e-4 |
| M. gnavus | Unknown | -0.134 | 7e-4 |
| E. coli | Myoviridae | +0.125 | 0.002 |
| E. bolteae | Myoviridae | +0.119 | 0.003 |
E. coli correlates positively with Podoviridae (+0.18) and Myoviridae (+0.13) — both Caudovirales families that include T7-like + T4-like E. coli phages. This is a plausible endogenous phage-host correlation: when E. coli is abundant, E. coli phages tend to also be abundant (commensal phage carriage / lysogenic-state co-occurrence). All correlations |ρ|≤0.18 — no strong (|ρ|>0.30) endogenous phage candidates targeting Tier-A pathobionts.
H. hathewayi and *M. gnavus* correlate NEGATIVELY with the "Unknown" phage family (which captures 80 % of HMP2 viromics observations that VirMAP couldn't classify) — pathobiont blooms displace some unclassified phages, consistent with reduced phage diversity in CD dysbiosis.

Per-ecotype phage-family abundance (Panel A) shows modest ecotype-specific variation: Anelloviridae E1-specific; Parvoviridae E2-elevated; Unknown family E2-dominant. The dominant signal is the "Unknown" classification (80 % of observations) — VirMAP family-level classification gap is the methodological limit, not biology.
Pillar 4 closure synthesis — three complementary phage-evidence layers:
| Notebook | Evidence layer | Key finding |
|---|---|---|
| NB12 | Curated literature foundation | E. coli AIEC = clinical-trial-stage (EcoActive); M. gnavus = temperate-only; H. hathewayi + F. plautii = phage GAP |
| NB13 | PhageFoundry experimental susceptibility (96 phages × 188 strains × 17,672 pairs) | 5-phage cocktail covers 95 % of E. coli strains; 69 % of phages AIEC-relevant phylogroup |
| NB14 | HMP2 in-vivo endogenous phageome (630 samples × 21 families) | Gokushovirus CD-DOWN cross-ecotype (E1 FDR=5e-7); E. coli × Podoviridae/Myoviridae positive correlation (+0.18, +0.13); modest in-vivo phage signal |
Combined Pillar-4 verdict: phage-therapy feasibility for E. coli is high (clinical cocktail + experimental susceptibility + in-vivo phage correlation all aligned); for M. gnavus / H. hathewayi / F. plautii the gap is confirmed across all three layers (no clinical phages, no PhageFoundry coverage, no HMP2 in-vivo Podoviridae/Myoviridae signal). External DB queries (INPHARED + IMG/VR) for the 3 gut-anaerobe gaps remain the highest-priority Pillar-4 follow-up before Pillar 5 cocktail drafts.
Limitations:
- VirMAP family-level classification gap: 80 % of phages classified as "Unknown" family — limits per-family DA + correlation power.
- Per-ecotype DA asymmetric power: E1 has 231 CD samples, but E0/E2/E3 have 12-30 — limits within-ecotype detection in smaller ecotypes (Gokushovirus E0 cliff=-0.55 is biologically interesting but doesn't pass strict FDR<0.10 due to power).
- No CRISPR-spacer-derived phage-host predictions in HMP2 viromics — would require IMG/VR cross-reference for Tier-A-host phage discovery.
- "Unknown" family aggregation hides species-level signal; per-virus DA is more informative than per-family for IBD-relevant phages.
Output artifacts:
- data/nb14_viromics_da_per_ecotype.tsv — 85 (ecotype × virus) DA tests
- data/nb14_pathobiont_phage_family_corr.tsv — 126 (Tier-A species × phage family) Spearman ρ
- data/nb14_endogenous_phageome_verdict.json — formal verdict
- figures/NB14_endogenous_phageome.png — 3-panel: ecotype × family abundance heatmap + top 12 DA viruses + species × family ρ heatmap
(Script: run_nb14.py. Per plan v1.7 NB14 endogenous phageome stratification + v1.9 no-raw-reads.)
22. NB15 — UC Davis per-patient profile + cocktail draft (Pillar 5 opener)
NB15 assembles per-patient profiles for 23 UC Davis CD patients combining all Pillar 1-4 evidence and produces concrete per-patient cocktail drafts with components + caveats per patient. The Pillar-5 hand-off framework from REPORT § Pillar 4 closure (3 design strategies) is applied as a per-patient rule book.
Inputs per patient: NB02 ecotype (E0/E1/E3/mixed) + demographics (Montreal, calprotectin, medication) + Kuehl_WGS Kaiju Tier-A pathobiont presence (≥0.001 relative abundance) + NB05 Tier-A score + Pillar-3 mechanism profile + Pillar-4 phage availability.
Per-target prescribing rate (across 23 patients):
| Target | n present | % cohort |
|---|---|---|
| M. gnavus | 21 | 91 % |
| H. hathewayi | 19 | 83 % |
| E. bolteae | 19 | 83 % |
| F. plautii | 18 | 78 % |
| E. lenta | 16 | 70 % |
| E. coli | 8 | 35 % |
E. coli present in only 8/23 patients (35 %) — interesting given AIEC is canonically CD-associated. Implication: NB13's 5-phage AIEC cocktail directly applicable to ~35 % of cohort; H. hathewayi / M. gnavus near-universal carriage means their phage GAP (NB12) is the dominant unmet need.
Per-ecotype cocktail summary:
| Ecotype | n patients | Mean targets / patient | n with cocktail | n with concrete phage |
|---|---|---|---|---|
| E0 | 7 | 1.71 | 7 | 0 (priority targets H. hathewayi + M. gnavus are GAP / temperate-only) |
| E1 | 9 | 4.89 | 9 | 9 (full Tier-A module + concrete cocktails) |
| E3 | 6 | 2.33 | 5 | 4 |
| Mixed (6967) | 1 | 3.0 | 1 | 1 |
| Total | 23 | 4.39 | 22 | 14 |
14 of 23 patients have concrete phage cocktail drafts. All 9 E1 patients receive concrete cocktails (PMBT24 + PMBT5 + AIEC 5-phage if E. coli present); 4 of 6 E3 patients; 1 mixed (patient 6967); E0 patients lack concrete components because their priority targets (H. hathewayi, M. gnavus) are in Pillar-4 GAP / temperate-only.

Per-patient stratification — 4 cocktail-design categories:
| Category | Description | Example patients | Strategy |
|---|---|---|---|
| A | Active disease + multiple targets | 3701 (calp 8000), 1835 (calp 3340), 5843 (calp 2150), 6434 (calp 251 + 6 targets) | Highest priority Pillar-5: full hybrid cocktail (phages + alternatives for GAP species) |
| B | Active disease + few targets | E0 patients with limited pathobiont burden | Limited cocktail; consider deprioritizing |
| C | Quiescent disease | 1112 (41), 2708 (3), 1492 (9), 1460 (7) | Reserve cocktail for flares; monitor calp |
| D | Mixed ecotype (longitudinal drift) | 6967 (E1 ↔ E3) | State-dependent dosing; central per-patient stability test for Pillar 5 |
F. plautii BA-cost is the dominant E1 design constraint: F. plautii present in 78 % of patients (18/23) AND has the HIGHEST BA-coupling cost (NB09c §13 active 7α-dehydroxylator). All 9 E1 patients carry F. plautii. Phage targeting shifts BA pool toward inflammatory primary tauro-conjugated forms; alternative is deprioritize F. plautii from cocktail + co-administer UDCA / BA-binding agent. F. plautii is also Pillar-4 GAP (not in ref_phage_biology), making it a "leave alone" target for Pillar 5.
E1 hybrid cocktail (9 patients): 3 strategies blended:
- Direct phage: E. coli (if present, NB13 5-phage), E. bolteae (PMBT24), E. lenta (PMBT5)
- Alternative: H. hathewayi (GAG-degrading enzyme inhibitors), F. plautii (BA-binding co-therapy or deprioritize)
- Limited: M. gnavus (temperate-only; lytic-locked engineering OR biochemical glucorhamnan-synthesis target)
Pure phage cocktail is not feasible for any E1 patient — only a hybrid with non-phage alternatives. The 3 GAP species require non-phage strategies; INPHARED + IMG/VR external DB queries for new gut-anaerobe phages are the highest-priority Pillar-4 follow-up before clinical translation.
Pillar 5 hand-off — concrete deliverables:
| Patient class | n | Recommended cocktail |
|---|---|---|
| E1 + active calp + E. coli present | ~3 | Full Tier-1: NB13 5-phage E. coli + PMBT24 + PMBT5 + alternatives for F. plautii / H. hathewayi / M. gnavus |
| E1 + active calp + no E. coli | ~5 | Tier-2: PMBT24 + PMBT5 + alternatives |
| E1 + quiescent | 1-2 | Reserve cocktail for flares; calp monitor |
| E0 + any | 7 | Limited (priority targets are GAP/temperate); consider deprioritizing |
| E3 + E. coli | ~3 | NB13 5-phage E. coli + PMBT5 + alternatives |
| E3 + no E. coli | ~3 | Limited (E. lenta + H. hathewayi only) |
| Mixed (6967) | 1 | State-dependent dosing |
Limitations:
- 23-patient cohort is small — results are exemplars/templates, not statistically robust per-patient validation.
- Kuehl_WGS uses Kaiju (not MetaPhlAn3) — Tier-A presence calls have lower confidence vs CMD analyses (NB02 classifier-mismatch asymmetry).
- No per-patient bile-acid measurements; BA-cost is ecotype-level, not per-patient.
- No patient-specific AIEC strain-resolution diagnostic for the 8 E. coli-positive patients; cocktail recommendation assumes AIEC subset prevalence per Dogan 2014.
- F. plautii deprioritization is precautionary based on NB09c mechanism — clinical validation requires per-patient BA panels.
Output artifacts:
- data/nb15_patient_profile.tsv — 23 patients × full per-patient profile
- data/nb15_per_patient_cocktail_draft.tsv — long-format per-patient × per-target cocktail breakdown
- data/nb15_pillar5_cocktail_verdict.json — formal verdict + per-ecotype summary
- figures/NB15_patient_cocktail_draft.png — 3-panel: presence heatmap + ecotype prescribing rate + calp × n_targets scatter
(Script: run_nb15.py. Pillar 5 opener; integrates NB02 + NB05 + NB06 + NB09c + NB10a + NB12-NB14 evidence into per-patient cocktail recommendations. Per plan v1.9 no-raw-reads.)
23. NB16 — Patient 6967 longitudinal stability + state-dependent dosing strategy
NB16 is the central per-patient longitudinal stability test for the project, focused on patient 6967 — the only multi-timepoint UC Davis CD patient with documented E1↔E3 ecotype drift across 2 visits. The notebook also uses patient 1112 (2 reseq replicates of the same biological sample) as a Kaiju reliability validation.
Patient 6967 longitudinal — clear E1→E3 transition with M. gnavus dominant:
| Tier-A | Visit 1 (E1, conf 0.64) | Visit 2 (E3, conf 0.41) | Fold change |
|---|---|---|---|
| H. hathewayi | 0.27 | 0.36 | 1.3× |
| M. gnavus | 0.53 | 7.45 | 14.0× |
| E. coli | 0.00 | 0.00 | — |
| E. lenta | 0.40 | 1.24 | 3.1× |
| F. plautii | 0.84 | 1.62 | 1.9× |
| E. bolteae | 0.25 | 0.54 | 2.1× |
M. gnavus 14× expansion is the dominant signature of the E1→E3 transition. All other Tier-A pathobionts also expand (1.3–3.1×) reflecting general dysbiosis worsening, but the M. gnavus fold-change dominates. E. coli remains absent in both visits — patient 6967 is not an AIEC carrier. This matches NB01b ecotype biology: E3 = severe-Bacteroides-expanded with M. gnavus as a dominant expansion axis; E1 = Bact2-transitional with milder pathobiont burden.
Per-visit cocktail composition (would the cocktail change?):
- Visit 1 (E1): 5 priority targets (E1_CD module 0), all 5 present → cocktail = {H. hathewayi, M. gnavus, E. bolteae, E. lenta, F. plautii}.
- Visit 2 (E3): 4 priority targets (E3_CD module 1), 3 present (E. coli absent) → cocktail = {E. lenta, H. hathewayi, M. gnavus}.
- Shared (both visits): H. hathewayi, M. gnavus, E. lenta — universal Tier-1 trio.
- Visit-1-only: E. bolteae, F. plautii — E1-specific (would be deprioritized on E3 transition).
- Cocktail Jaccard (visit 1 × visit 2) = 0.60 — moderate overlap; ecotype drift implies non-trivial cocktail re-design, not a complete rewrite.
Patient 1112 technical replicate concordance — Kaiju reliability validation:
| Tier-A | Reads_1112-1 | Reads_1112_reseq-1 | Difference |
|---|---|---|---|
| H. hathewayi | 0.445 | 0.436 | 2 % |
| M. gnavus | 7.916 | 7.792 | 1.6 % |
| E. coli | 0.000 | 0.000 | — |
| E. lenta | 0.712 | 0.642 | 10 % |
| F. plautii | 2.019 | 1.994 | 1.2 % |
| E. bolteae | 0.864 | 0.862 | 0.2 % |
Spearman ρ = 1.000 (p < 0.001) on 6 Tier-A — perfect rank concordance across reseq replicates. Kaiju calls are highly reliable for the actionable Tier-A pathobionts in UC Davis samples.

State-dependent dosing rule (5 concrete recommendations):
- Re-test ecotype every 3-6 months for active CD patients on phage cocktail therapy. Patient 6967 demonstrates ecotype is dynamic, not static.
- F. plautii inclusion is E1-specific — if patient transitions E1→E3, deprioritize F. plautii from cocktail. This also reduces BA-coupling-cost concern (NB09c). Per NB07c, F. plautii is an E1_CD module 0 species, not an E3_CD module 1 species.
- E. coli inclusion is E3-specific — if patient transitions E3→E1, deprioritize E. coli (subject to AIEC strain detection per NB07b/NB08a). E. coli is in E3_CD module 1 priority list, not E1_CD.
- Universal Tier-1 trio (M. gnavus, H. hathewayi, E. lenta) span both E1 and E3 — these don't need re-evaluation on ecotype shift; they are the cocktail backbone for any active-disease CD patient.
- M. gnavus qPCR as cheap ecotype-state indicator — the 14× expansion in patient 6967's E3 transition suggests M. gnavus abundance via qPCR could serve as a clinical proxy for ecotype shift, avoiding the need for full metagenomics at every visit. A 5-fold change in M. gnavus abundance might be the threshold for triggering full ecotype re-evaluation.
Pillar 5 hand-off — clinical workflow recommendation:
Initial visit:
└─ Stool metagenomics → ecotype assignment
├─ E0: limited cocktail, flare reserve
├─ E1: full hybrid 3-strategy cocktail (NB13 5-phage E. coli if AIEC+;
│ PMBT24; PMBT5; alternatives for H. hathewayi + F. plautii)
└─ E3: focused cocktail (E. coli if present; PMBT5; alternatives)
Follow-up visits (3-6 month):
├─ Calprotectin: assess disease activity
├─ M. gnavus qPCR: cheap ecotype-state indicator
│ └─ if 5-fold change → trigger full ecotype re-test
└─ If full re-test shows ecotype shift:
├─ E1 → E3: drop F. plautii; consider adding E. coli cocktail
├─ E3 → E1: add F. plautii alternative; reassess E. coli targeting
└─ Stable ecotype: continue current cocktail
Limitations:
- Patient 6967 is the only multi-timepoint patient with biological replicate samples in the UC Davis cohort — ecotype-drift conclusions are based on n=1 longitudinal trajectory.
- Visit 1 confidence (0.64) and visit 2 confidence (0.41) are both moderate; the ecotype call shift could partly reflect classifier uncertainty. However, the 14× M. gnavus expansion and 3× E. lenta expansion are quantitative biological observations independent of the ecotype label.
- No timing information between the 2 patient 6967 visits — duration of E1→E3 drift is unknown, limiting clinical-workflow timing recommendations.
- State-dependent dosing rule is theoretical; not yet validated in clinical practice.
- Cocktail Jaccard 0.60 is moderate; whether 3-of-5 component overlap implies clinically-meaningful cocktail re-design depends on patient response heterogeneity (out of project scope).
Output artifacts:
- data/nb16_p6967_tier_a_longitudinal.tsv — patient 6967 per-visit Tier-A abundance + fold change
- data/nb16_longitudinal_verdict.json — formal verdict + state-dependent dosing rules
- figures/NB16_longitudinal_dosing.png — 3-panel: per-visit Tier-A + tech replicate scatter + cocktail composition shift
(Script: run_nb16.py. Pillar 5 second notebook — central longitudinal-stability test + state-dependent dosing rule for Pillar 5 hand-off. Per plan v1.9 no-raw-reads.)
Interpretation
Project narrative summary (Pillars 1–5 closed)
The project's central question — can we derive ecotype-specific and per-patient pathobiont target lists for rational phage-cocktail design? — now has a project-internal answer that is robust across pathway, BGC, metabolite, strain, serology, phage-evidence, and per-patient longitudinal evidence streams. After 23 notebooks (Pillars 1–5) and two adversarial reviews catching 9 critical + 16 important methodological issues, the project establishes:
- A reproducible four-ecotype framework (NB01b consensus K=4) that stratifies 8,489 cMD samples + 1,627 HMP2 samples + 23 UC Davis patients into biologically clean clusters (E0 healthy-commensal, E1 Bact2-transitional, E2 Prevotella, E3 severe-Bacteroides). Externally replicated on HMP2 at 80% projection confidence + χ²(2)=15.6 disease-stratification p=0.016.
- A confound-free within-IBD-substudy CD-vs-nonIBD meta-analysis design (NB04e, Novel Contribution #8) that resolves both feature-leakage and substudy-confounding pitfalls of pooled-cohort DA. Six actionable Tier-A targets emerge (NB05): H. hathewayi (4.0), M. gnavus (3.8), E. coli (3.6), E. lenta (3.3), F. plautii (3.3), E. bolteae (2.8).
- Two cross-corroborated 6-line mechanism narratives (Novel Contribution #17): iron-acquisition (E. coli AIEC subset; OR=8.1 pathway, OR=44 BGC; six independent evidence streams) and bile-acid 7α-dehydroxylation (F. plautii / E. lenta / E. bolteae substrate-product signature at paired sample level; six independent evidence streams). All Pillar 3 narratives collapse into a single multi-omics joint factor (NB07d CC1, canon r=0.96, cliff CD-vs-nonIBD=+0.50): Novel Contribution #21 — independent rediscovery via CCA confirms that per-modality analyses converge on the same biology rather than measuring orthogonal phenomena.
- A per-target cocktail-design profile distinguishing iron/AIEC-mediated vs other CD specialization mechanism, bile-acid coupling cost, and species-abundance-mediated vs strain-content-mediated mechanisms (Novel Contributions #16, #18, NB09c+NB10a). This converts "Tier-A score = phage target priority" into a multi-attribute decision framework that Pillar 4–5 will operationalize.
- Three cross-cohort-replicated metabolite signatures (NB09b, ≥75% sign-concord HMP2 → Franzosa): urobilin CD-DOWN (loss of bilirubin-reducer commensals), acyl-carnitines CD-up, long-chain PUFAs CD-up (inflammatory eicosanoid precursors).
- Five generalizable methodology contributions beyond the project's specific findings (Novel Contributions #6–9, 15, 17, 19): cMD substudy-nesting unidentifiability; feature leakage in cluster-stratified DA; LOSO ARI as honest stability metric; pool ≠ flux for pathway vs metabolite DA; multi-line cross-corroboration as portable rigor pattern; cohort-batch-dominates-clustering for absolute-intensity multi-cohort modalities.
The H3 falsifiability framework is fully closed: 5 SUPPORTED + 1 PARTIALLY SUPPORTED + 2 PARTIAL (single-cohort caveat) + 1 NOT SUPPORTED (with methodology lesson). The two negatives carry biology-relevant lessons rather than mere refutation: H3a-new shared-environment reframing sharpens the cocktail-design "metabolic-coupling cost" annotation into a more rigorous "bile-acid coupling cost" annotation; H3d-clust failure documents that taxonomic-feature ecotypes remain the project's primary clinical-translation basis (vs metabolite-feature ecotypes which require external batch correction).
What remains for Pillars 4–5: phage targetability matrix per Tier-A core (PhageFoundry + external phage DBs) combined with the per-target cocktail-design profile to produce per-ecotype + per-patient cocktail drafts for the 23 UC Davis patients. Patient 6967's longitudinal E1 ↔ E3 shift (NB02) is the central per-patient-stability test for whether cocktail dosing should be state-dependent.
Why the four-ecotype framework matters for phage targeting
If patients with active CD separate into three ecotypes (E0 / E1 / E3), the pathobiont targets differ by ecotype. A cocktail designed against the pooled-cohort top-pathobiont list (R. gnavus, E. bolteae, E. coli, E. lenta, C. difficile, K. oxytoca — from the preliminary report's donor 2708 engraftment analysis) will mismatch the ~27 % of UC Davis patients in E0 (who may have near-healthy microbiomes and not need this cocktail at all) and may be over-aggressive for the 42 % in E1 (where the pathobiont burden is different from the severe E3 cluster). Per-ecotype cocktail design is the project's deliverable.
Why clinical-data-only ecotype screening is not viable
A clinical trial that screens patients into phage-cocktail arms by ecotype assignment would ideally use routine data (demographics, severity markers, medications) to place each patient. H1c's nominal AUC passes that bar, but the UC Davis translation shows the classifier cannot separate E1 from E3 without metagenomics. Trial design needs stool metagenomics as a screening step, unless (a) a rapid qPCR panel on ecotype-defining species can be developed (follow-up scope), or (b) within-IBD signal can be boosted with better training data (HMP2 + Franzosa medication metadata, when HMP2 MetaPhlAn3 is ingested — PENDING_HMP2_RAW).
Literature context
- Ecotype / enterotype framework: our K = 4 structure maps cleanly onto the canonical Bacteroides / Prevotella / Ruminococcus clusters originally described by Arumugam et al. (2011). E0 corresponds to the diverse Ruminococcus-type, E2 to the Prevotella-type, E1 + E3 to variations of the Bacteroides enterotype — with our data separating the transitional (E1) and severe (E3) Bacteroides states, consistent with the Bacteroides2 dysbiosis subtype described by Vandeputte et al. (2017) in IBD patients.
- DMM vs LDA: Holmes et al. (2012) introduced Dirichlet multinomial mixtures as the canonical ecotype-discovery method. In practice LDA on pseudo-counts yields equivalent assignments on samples of this scale (Ding & Schloss 2014; we confirmed by cross-method ARI with an independent CLR-based GMM).
- Enterotype reproducibility: Costea et al. (2018) argue that "discrete ecotypes" are better modeled as gradients than hard clusters. Our K = 4 assignments have 48.9 % between-method agreement — consistent with this (the microbiome continuum is real; we discretize for operational purposes).
- Pooled-cohort confounding: Vujkovic-Cvijin et al. (2020) show that pooled multi-cohort microbiome DA is heavily confounded by host variables (age, diet, geography). Our H1c finding — that
is_ibddominates classifier AUC and collapses on a single-cohort test — is a concrete instance of the same problem. - Compositional DA: Gloor et al. (2017), Lin & Peddada (2020), and Tsilimigras & Fodor (2016) establish that raw Mann-Whitney on relative abundance is systematically biased by the sum-to-constant constraint. Zhou et al. (2022) introduce LinDA (linear CLR regression with bias correction) as a simple pure-Python-implementable alternative that we use as the second-method concordance check in NB04c §4. NB00 reproduces the compositional-bias result directly on a curated battery.
- Pathobiont biology: R. gnavus produces an inflammatory glucorhamnan polysaccharide (Henke et al. 2019); AIEC drives ileal CD mucosal invasion (Darfeuille-Michaud et al. 2004). The NB05 E. coli MIBiG hits map onto well-characterized AIEC virulence determinants: Yersiniabactin is the iron-capture siderophore AIEC LF82 uses to survive in macrophage phagolysosomes (Prudent 2021, Dalmasso 2021, Dogan 2014); Colibactin is the pks+ genotoxin implicated in AIEC-associated colorectal carcinogenesis (Veziant 2016); Enterobactin is another AIEC siderophore active in mucosal-associated IBD settings. IBD-specific E. coli genomic adaptations (Dubinsky 2022) independently confirm that the species is over-abundant in IBD with disease-specific lineages. E. innocuum is a vancomycin-resistant CD-associated pathobiome correlated with creeping-fat and intestinal strictures (Le et al. 2025). These are the project's Tier-A candidate validation anchors. The NB05 S. salivarius Salivaricin MIBiG hits are lantibiotic bacteriocins whose proimmune activity regulates the oral microbiome (Barbour 2023) — consistent with the oral-gut axis literature surfaced earlier for the rigor-controlled E1 Tier-A.
- Oral-gut axis in IBD: a growing literature (Xiang 2024, Guo 2024, Tanwar 2023, Zhou 2023) documents oral-cavity species as ectopic colonizers of the IBD gut and argues for an "oral-gut axis" in disease pathogenesis. The top E1 Tier-A candidates S. salivarius, S. thermophilus, and S. parasanguinis are all oral streptococci; their within-substudy CD-enrichment is consistent with this framework. S. salivarius is specifically reported as a salivary biomarker for orofacial granulomatosis co-occurring with CD (Goel 2019). This is a methodologically important consistency check — it is biology we would expect the rigor-controlled analysis to surface, which the leakage-contaminated NB04 analysis instead produced as "commensal Simpson's paradox" noise.
- Phage therapy precedent for CD: Galtier 2017 (PMID 28130329) demonstrated that AIEC-specific bacteriophages targeting CEACAM6-bound E. coli can reduce AIEC-associated intestinal mucosal signal in CD models. This is direct precedent for the project's Pillar-4 phage-targetability strategy and validates E. coli (NB05 Tier-A #3 with 3 MIBiG virulence matches) as a feasible phage target. The AIEC-phage result combined with our NB06 finding that 5 of 6 actionable Tier-A candidates co-occur in a single module argues for multi-target phage cocktails rather than monovalent AIEC-only treatments — the pathobiont module would likely re-equilibrate around a remaining hub if only one species were targeted.
- FMT as causal-direction evidence: Sheikh 2024 (PMID 38532703) showed that CD-patient microbiome transplanted into germ-free mice produces colitis with discontinued-pattern, proximal colonic localization — the hallmark CD phenotype. This is direct evidence that the microbiome composition is causally sufficient to produce the disease phenotype, supporting the premise that a cocktail modifying that composition can have therapeutic effect.
- Iron-acquisition as a pathobiont-fitness axis in IBD (NB07 v1.8 §9 dominant theme grounding): The v1.8 finding that iron/heme acquisition is the dominant CD-up MetaCyc class-enriched theme (OR=8.1, FDR 7e-6) is causally consistent with broader IBD-iron literature. Ellermann 2020 (PMID 31179826) directly demonstrated that dietary iron variably modulates intestinal microbiota assembly in colitis-resistant and colitis-susceptible mice — establishing a mechanistic link between intestinal iron availability and microbiota composition that is particularly relevant for the well-known clinical observation that oral iron supplementation can exacerbate disease activity in a subset of IBD patients. Buret 2019 (PMID 30602371) reviewed how iron acquisition is associated with the expression of virulence factors in commensals that become pathobionts in IBD and CRC. Combined with the project's NB05 §5g E. coli MIBiG matches to Yersiniabactin + Enterobactin (siderophores) and the AIEC-iron literature already cited (Dalmasso 2021, Prudent 2021, Dogan 2014), the v1.8 iron-theme enrichment slots into a coherent multi-decade narrative: iron is not just a host nutrient but a pathobiont-selective resource, and CD pathobionts have systematically over-invested in iron-acquisition machinery (siderophores, heme uptake, heme biosynthesis as a precursor for haem-containing iron uptake / regulation systems). Therapeutic implication: per-patient iron-supplementation decisions should account for ecotype-specific pathobiont iron-system expression; phage targeting could specifically deplete the iron-system-enriched AIEC subset without affecting iron-pathway-deficient commensals.
- TMA / choline metabolism in H. hathewayi (NB07 v1.8 §9 species-resolved theme): H. hathewayi's TMA/choline theme enrichment (OR=9.33, FDR=0.048) is mechanistically consistent with the canonical bacterial choline-degradation pathway — the CutC glycyl radical enzyme breaks down choline to TMA in human gut microbes (Timsina 2025, PMID 40853000), and TMA is converted by the host to TMAO with established cardiovascular and renal disease implications (Cheng 2025, PMID 39709651; Jiang 2024 on phosphatidylcholine → TMA → TMAO axis). H. hathewayi's CD-up phosphatidylcholine acyl editing (NB07a §c) and choline degradation (v1.8 §9 04_TMA_choline theme) suggest H. hathewayi is functionally a TMA-producer pathobiont in CD samples, contributing to the systemic-inflammation / cardiovascular-comorbidity axis well-documented in IBD patients.
- AIEC strain-level specialization (NB07b within-carrier E. coli CD-DOWN reading): NB07b found E. coli per-pathway abundance is CD-down within carriers (opposite of cohort-level CD-up), affecting allantoin degradation, propanediol degradation, octane oxidation, histidine degradation, phospholipid remodeling — i.e., alternative-electron-acceptor + niche-specialization pathways. This pattern is consistent with CD-associated E. coli being an AIEC-specialized subset that has invested in iron + adherent-invasion machinery at the cost of generalist metabolic capabilities. Dubinsky 2022 (PMID 35560165) established that IBD E. coli lineages have disease-specific genomic adaptations; the within-carrier CD-down per-pathway pattern is the metagenomic-level signature of that strain-level specialization. Per plan v1.9, the cMD raw-reads-based strain-resolution test is dropped as not feasible; the alternative no-raw-reads path is genome-content survey (
kbase_genomespks + Yersiniabactin + Enterobactin co-occurrence query) cross-referenced withfact_strain_competitionKumbhari strain-frequency data. See Future Direction #9. - Butyrogenic cross-feeding embedded in CD pathobiont modules (NB07c §10): A. caccae is a well-characterized butyrate-producer that grows on lactate / acetate co-substrates and on host-mucin-derived sugars (Schwiertz 2002, Duncan 2004 establish A. caccae as a primary lactate utilizer in the colon). Its strong positive coupling with M. gnavus (mucin-glucorhamnan producer; Henke 2019), F. plautii (bile-acid 7α-dehydroxylating), H. hathewayi (lactate / TMA producer per v1.8 §9), and E. bolteae (sugar fermenter) maps cleanly onto known cross-feeding circuits in the gut. The cross-feeding hypothesis is not iron-mediated (NB07c §2: ρ(A. caccae × iron-pwy) ≪ ρ(E. coli × iron-pwy)) — consistent with A. caccae being a fermentative butyrate-producer rather than an iron-respiring organism. The therapeutic implication — that depleting the pathobiont cluster may incidentally reduce a butyrate-producer with potential anti-inflammatory contribution — fits the broader "ecological consequences of pathobiont depletion" framework that motivates the four-tier criteria rubric (Tier C, ecological durability) of the project.
- AIEC virulence BGC repertoire is genomically E. coli-specific within actionable Tier-A (NB08a §11): the Elmassry 2025 BGC catalog (10,060 BGCs across 6,221 species-annotated entries) provides direct genomic-content evidence that E. coli alone among the 6 actionable Tier-A core species carries the canonical AIEC virulence biosynthetic gene-cluster repertoire — 19 Yersiniabactin BGCs, 16 Enterobactin BGCs, 19 siderophore-class BGCs, 8 Colibactin BGCs, 15 Microcin B17 BGCs. This is consistent with: Dalmasso 2021 (Yersiniabactin biology), Prudent 2021 (LF82 siderophore-mediated phagolysosomal survival), Dogan 2014 (AIEC iron + propanediol enrichment), Veziant 2016 (Colibactin + colorectal carcinogenesis). The other 5 Tier-A core species (M. gnavus, E. lenta, E. bolteae, H. hathewayi, F. plautii) sit in MIBiG dark matter — they carry significant non-MIBiG-annotated BGC content (notably E. lenta 41 BGCs, M. gnavus 58 BGCs of which 31 are lanthipeptides), and their CB-ORFs are CD-vs-HC RPKM-enriched at the read level — but their CD-association mechanisms are not currently captured by MIBiG-themed BGC analysis. E. lenta's CB-ORF CD-DOWN pattern (54 % CD-down vs 0 % CD-up) is mechanistically consistent with Koppel et al. 2018 (PMID 29760174), which established that Eggerthella species' CD-association is driven by drug-metabolism activity (cardiac glycoside reductase Cgr2 / arginine-2,3-aminomutase) rather than secondary-metabolite biosynthesis — i.e., E. lenta's CD-association is functional via a non-BGC mechanism.
- ebf/ecf BGC families are CD-up across 4 cohorts at p<1e-31 (NB08a §11 Test 4): replicates the Elmassry 2025 (PMID 39837311 / Cell Host & Microbe) headline finding cleanly in our cohort-meta analytical framework. The "ebf" and "ecf" BGC families produce immunoactive fatty acid amide signaling lipids (similar in structure to host endocannabinoids) and were originally implicated by Elmassry et al. as a CD-defining functional signature. Our independent re-analysis on the cMD-style cohort architecture (1,349 samples × HMP2-IBDMDB / MetaHIT / LLDEEP / PRISM) yields meta z = 11.71 (ebf) and z = 11.97 (ecf) — among the largest effects in the project. The ebf/ecf signature is a sample-level CD biomarker (no per-species attribution available), suitable for treatment-response monitoring in clinical follow-up but not for cocktail target selection.
- Bile-acid 7α-dehydroxylation network identified at sample level (NB09c §13): F. plautii, E. lenta, and *E. bolteae show the predicted substrate-product signature in 468 paired CSM HMP2 samples — negative correlation with primary tauro-conjugated bile acids (substrates: cholate at ρ=−0.26 for F. plautii) and positive correlation with secondary unconjugated bile acids (products: lithocholate at ρ=+0.18 for E. bolteae, +0.15 for F. plautii; deoxycholate +0.17 for E. bolteae). M. gnavus and E. coli show the opposite pattern, confirming they are NOT in the 7α-dehydroxylation network. This is the direct sample-level confirmation of the canonical bile-acid 7α-dehydroxylation deficit in CD (Franzosa 2019; Devlin & Fischbach 2015 PMID 26412091 establish the bai operon mechanism in Clostridium scindens-like bacteria), and it identifies F. plautii / E. lenta / E. bolteae as the active 7α-dehydroxylating species in the actionable Tier-A set. This finding is independent of and stronger than the NB07b within-carrier F. plautii F420-bile-acid pathway DA — pathway DA confirms the genomic capability; NB09c paired correlation confirms the active enzymatic in vivo activity. The two are different evidence streams converging on the same mechanism.
- Polyamine + long-chain PUFA metabolite-pool elevation in CD (NB09a §12): polyamines (Fisher OR=14.6, FDR 0.008) and long-chain PUFAs (OR=7.9, FDR 0.009) are the two CD-up theme-level metabolite signatures in HMP2 at the subject level (50 CD vs 26 nonIBD). Polyamines (putrescine, spermine, spermidine, N-acetyl variants, anserine) are produced by gut bacterial fermentation of arginine / ornithine and are well-characterized IBD biomarkers (Pegg 2014 Pharmacol Res; Wang 2018; Franzosa 2019 PMID 30531976). The metabolite-pool elevation here is not anti-correlated with the v1.8 §9 pathway-level polyamine_urea CD-DOWN finding — pool ≠ flux: metabolite accumulation can result from increased catabolic substrate availability (dietary protein / mucin) plus reduced microbial polyamine clearance, without requiring elevated biosynthesis flux. Long-chain PUFAs (arachidonate, adrenate, DHA, DPA, EPA) reflect the inflammatory eicosanoid precursor pool — arachidonate is canonical for prostaglandin / leukotriene biosynthesis. The cholesteryl-ester C20:4 / C20:5 / C18:2 elevation in lipid_classes (19 CEs CD-up) corroborates this as the storage form of esterified PUFAs. The CD-up PUFA signal is mechanistically connected to v1.8 §9 fat-metabolism / glyoxylate theme (which was sub-threshold at pathway-level OR=0.88 but theme-significant at metabolite-level here). Tauro-α/β-muricholate + free taurine CD-up is the metabolite-level confirmation of reduced microbial bile-acid 7α-dehydroxylation, mechanistically supporting NB05's actionable F. plautii (a known 7α-dehydroxylating species) as a phage-target with a metabolite-level pre-treatment biomarker.
Novel contributions
- Cross-method ARI as a K-selection criterion when per-method fit is monotonically decreasing in K. Documented as a generalizable methodology note in
docs/discoveries.md. - OvR-AUC / per-patient agreement gap as a diagnostic for classifier-utility overstatement when a cohort-axis variable dominates features.
- Kaiju ↔ MetaPhlAn3 projection asymmetry — LDA robust, CLR+GMM fragile. Directly relevant to any multi-classifier microbiome pipeline.
- Project-wide synonymy layer as a reusable artifact for multi-cohort microbiome work.
- Four-ecotype IBD framework with disease-stratifying signal on 8.5 K samples — reproduces published enterotype structure with improved disease resolution (E1 transitional vs E3 severe within Bacteroides-dominant).
- cMD substudy × diagnosis nesting as a structural-unidentifiability finding. In the ecotype-assigned slice of curatedMetagenomicData, 45 sub-studies have ≥ 10 HC samples and 5 have ≥ 10 CD samples, but zero have both. A pooled CD-vs-HC LME with substudy random effect is therefore structurally unidentifiable — empirically verified by
statsmodels.mixedlmsilently failing to converge on every battery species. The confound-free alternative is within-IBD-substudy CD-vs-nonIBD (4 cMD studies carry both groups). This pattern applies to any pooled public-dataset case-vs-control analysis where the case and control cohorts were collected by different groups. Documented indocs/pitfalls.md. - Feature leakage in cluster-stratified DA as a general methodological hazard. Clustering samples by taxon abundance and then running DA on the same taxa within cluster is selection-on-outcome confounding — within-cluster effect sizes are mechanically inflated for cluster-defining taxa. Detectable via held-out-species sensitivity (bound: Jaccard > 0.5 = leakage bounded) or leave-one-species-out refit. Our NB04b measurements (E1 Jaccard 0.230, E3 Jaccard 0.064) confirmed the NB04 within-ecotype Tier-A was substantially leakage-driven. Analogous bug in single-cell DE: clustering on gene expression then testing gene DE within cluster. Documented in
docs/pitfalls.md. - Within-ecotype × within-substudy meta-analysis as the confound-free stratified design. NB04e establishes the analysis form that simultaneously (a) eliminates feature leakage (clustering axis and DA axis are disjoint — samples are partitioned by ecotype, CLR-Δ is computed across sub-studies within a partition) and (b) eliminates study confounding (within-substudy contrast has no study-level variation). The design fails gracefully when (substudy × ecotype × diagnosis) cells are too small and reports explicitly which ecotypes are meta-viable, single-study-only, or not viable. This is the methodological contribution most directly portable to other disease-microbiome projects.
- Adversarial review as a required complement to
/berdl-reviewon methodologically nuanced projects. Two independent/berdl-reviewruns on the pre-rigor-repair NB04 state concluded "no critical issues"; an adversarial reviewer (general-purpose Agent with explicit "find flaws" framing) caught 5 critical + 6 important issues, all empirically confirmed by NB04b + NB04c. Full arc and methodology recommendations inFAILURE_ANALYSIS.mdanddocs/discoveries.md. - LOSO ARI as a more honest ecotype stability metric than bootstrap ARI. Bootstrap 80 %-subsample ARI (NB04b §7) reported 0.13–0.17 on this data; LOSO ARI across 8 independent sub-studies (NB04f) reported 0.00–0.28 with mean 0.113, revealing per-substudy variation that bootstrap masks. For any clustering framework intended for cross-cohort use, LOSO should be the reported stability metric.
- Operationally-validated-Tier-A despite framework-variance pattern. NB04f + NB04g show real cross-study and cross-feature-basis ecotype variance; NB04h shows the NB04e operational Tier-A replicates at 88.2 % sign concordance on HMP2 with high projection confidence. The generalization: framework stability and operational-claim replication are distinct properties that need separate tests. A project can have "shaky cluster boundaries but robust cluster-specific findings" — which is what this project has, and is what NB05 should operate on.
- Category-schema choice as a load-bearing methodological variable (NB07 v1.7 → v1.8). On the same data and same DA pipeline, a regex-on-pathway-names scheme (44/409 pathways categorized, 7 themes) gave H3a (b) "FAIL — degenerate"; a curator-validated MetaCyc class hierarchy from ModelSEEDDatabase (514/575 pathways categorized, 12 themes) gave H3a (b) "SUPPORTED" with iron/heme acquisition as an 8.1× enriched dominant theme. Same data, opposite verdict — driven entirely by ontology choice. Lesson: prefer ontology / class hierarchy over name-pattern regex (plan norm N17). Documented in
docs/discoveries.mdandRESEARCH_PLAN.mdv1.8. - Module-level metabolic-coupling-cost annotation as a Tier-A scoring extension (NB07c §10). NB06 H2d showed pathobionts co-cluster in single CD-specific modules. NB07c shows non-pathobiont module anchors (specifically A. caccae, the only genuine butyrate-producer) have strong positive species-level coupling with the pathobiont set, consistent with cross-feeding. The implication for cocktail design — that pathobiont depletion may incidentally remove anti-inflammatory commensals through loss of substrate — is a per-target ecological-cost annotation that should sit alongside Tier-A scoring (A3–A6) in NB05's output. This is the species-pair-level extension of the H2d single-module finding and is portable to other microbiome-targeting projects (e.g., FMT, antibiotics).
- Five-line cross-corroborated iron-acquisition narrative (NB05 §5g + NB07a §c + NB07 v1.8 §9 + NB07c §2 + NB08a §2). The same biological claim — "AIEC iron-acquisition is the dominant CD pathobiont specialization" — is now supported from five independent evidence streams across three distinct analytical granularities: literature-MIBiG lookup, sample-level pathway × species correlation, cohort-level pathway-class enrichment, sample-level pathway × species co-variation, and genomic BGC content. Each test could fail independently; the convergence is the rigor signal. This level of within-project cross-corroboration on a single biological claim is a portable methodology pattern for any "is this signal real?" question in microbiome research — design tests at multiple analytical granularities, treat each as an independent line, and require convergence rather than a single-test pass.
- Pathway-level vs metabolite-level signal can diverge in direction (NB09a polyamine pool ≠ flux). v1.8 §9 found
06_polyamine_ureawas CD-DOWN at pathway-level (OR=0.42, biosynthesis flux); NB09a §12 found polyamines as a metabolite-class are CD-UP at OR=14.6 (the largest theme-level effect in the metabolomics analysis). Both are correct: the metabolite pool reflects the difference between production and consumption rates, plus dietary/host inputs. Pool measurements and flux measurements are not interchangeable; both belong in the analysis when both are available. This is generally true for any metabolite-class comparison — biosynthesis pathway DA and untargeted metabolomics DA can disagree without contradicting each other. Flag indocs/discoveries.mdas a methodology note for any future BERIL multi-modal analysis. - Bile-acid coupling cost replaces metabolic-coupling cost as primary Pillar 4 ecological annotation (NB09c §13). The NB07c "metabolic-coupling cost" annotation was based on species-level co-occurrence; NB09c's paired sample-level test failed to identify a metabolic intermediate that supports cross-feeding (lactate has opposite signs between A. caccae and the pathobionts; only 7 strict triangles total). Instead, NB09c identifies the bile-acid 7α-dehydroxylation network as the dominant per-target ecological-cost axis: F. plautii / E. lenta / E. bolteae carry active 7α-dehydroxylation activity in HMP2 paired samples. Phage targeting of these species would shift the bile-acid pool toward inflammatory primary tauro-conjugated forms. This is a mechanistically grounded cost annotation (substrate-product signature is detectable at sample level) that is more rigorous than the original NB07c "coupling cost" framing.
- Two independent six-line cross-corroboration narratives in the same project. The iron-acquisition narrative (Novel Contribution #14) and the bile-acid 7α-dehydroxylation narrative (NB09c §13) are independent multi-line convergence stories in this dataset. Iron: NB05 §5g + NB07a §c + NB07 v1.8 §9 + NB07c §2 + NB08a §2 + AIEC literature → 6 lines. Bile-acid: NB07b F420-pathway + NB09a tauro-muricholate + NB09c paired ρ + NB05 actionable status + Franzosa 2019 + Devlin/Fischbach mechanism literature → 6 lines. Project-level rigor signal: any biological claim with multi-line convergence across granularities is robust against single-test failures. Two such narratives in the same dataset, each independently developed, validates the methodology pattern beyond a single example.
- Species-abundance-mediated vs strain-content-mediated CD-association is a distinguishable mechanism axis (NB10a F. plautii informative null). NB10a finds zero FDR<0.10 strain-adaptation genes for F. plautii in Kumbhari (3,245 genes tested), despite F. plautii having confirmed CD-association at species level (NB04e + NB05) and at active-mechanism level (NB09c bile-acid 7α-dehydroxylation). This null is biologically meaningful: it tells us F. plautii's CD-association is mediated by how much F. plautii is present, not by which F. plautii strain is dominant. The 7α-dehydroxylation activity is presumably encoded by core bai-operon genes that are present in essentially all strains. Implication for Pillar 4 cocktail design: phage targeting of species with this profile produces predictable depletion of the encoded activity (no within-species strain-content escape route via gene-content selection). This is a third granularity that distinguishes phage-target candidates beyond "actionable Tier-A score" (NB05) and "ecological coupling cost" (NB09c) — a mechanism-resolution profile that places each Tier-A pathobiont on a species-abundance-mediated ↔ strain-content-mediated axis. Independently corroborates NB07b within-carrier reading; methodology generalizable to any future strain-level analysis where the pre-test prediction is "if mechanism is genus-conserved, expect zero strain-adaptation signal."
- Cross-cohort metabolomics m/z-bridge clustering is dominated by cohort batch effects (NB09d). Pooling HMP2 + Franzosa metabolomics on a 111-feature m/z-bridge panel and clustering with PCA + K-means K=4 produces clusters that separate completely by cohort, not by diagnosis. PC1 (79 % variance) is essentially the cohort batch effect. Cross-cohort LOSO ARI = 0.000, well below the 0.113 taxonomic-ecotype baseline (NB04f). The taxonomic-feature ecotype framework was naturally cross-cohort-portable because MetaPhlAn3 relative-abundance values are unitless and compositionally constrained (sum-to-1 per sample) — that constraint provides natural cohort normalization. Metabolite-feature clustering inherits absolute-intensity scale differences between LC-MS runs and requires explicit batch correction (ComBat/SVA/RUV) prior to clustering. Generalizable rule: compositional / relative-abundance feature spaces (taxonomy, pathway-fraction, MAG-fraction) are usually cross-cohort-portable as-is; absolute-intensity feature spaces (mass-spec, RNA-seq counts, protein abundance) are NOT cross-cohort-portable without batch correction. Within-pooled bootstrap stability metrics are misleading in this regime — they measure batch reproducibility, not biological reproducibility. Methodology note for any future BERIL multi-cohort analysis on absolute-intensity modalities.
- NB09b cross-cohort metabolomics replication establishes 9 strict + 3 theme-level cross-cohort replications (HMP2 → FRANZOSA_2019 m/z-bridge, ±0.005 Da). Three IBD-relevant chemical-class themes pass the ≥75 % sign-concordance threshold: urobilin/porphyrin (100 %, CD-DOWN), acyl-carnitines (80 %, CD-up), long-chain PUFAs (75 %, CD-up). The urobilin signal (median Franzosa cliff −0.747 — strongest replicated effect in the project) reflects loss of bilirubin-reducing commensals (Hall 2024, Vital 2018) cross-cohort. Methodological insight: BA-pool cohort-aggregate DA is weaker (primary 57 %, secondary 40 %) than within-cohort paired sample-level evidence (NB09c §13 substrate-product signature). The takeaway for multi-cohort metabolomics: cohort-aggregate DA and within-cohort paired correlation are different evidence streams; one can be weaker without invalidating the other. Both belong in the analysis when both are available.
- Multi-pillar mechanism narratives collapse to a single multi-omics joint factor (NB07d CC1). After eight Pillar 3 hypothesis tests producing the iron-acquisition / bile-acid / polyamine / PUFA / urobilin / ebf-ecf narratives in separate analyses, a single CCA canonical pair (CC1, r=0.96, cliff CD-vs-nonIBD = +0.50 p=4e-4) on HMP2 paired taxonomy + metabolomics recovers all of these signatures simultaneously as a single joint axis: all 6 actionable Tier-A core species load CD-positive; urobilin loads CD-negative; polyamines / PUFAs / fatty-acid-amides / sphingolipids / cadaverine / N-acetylputrescine all load CD-positive; secondary BAs load CD-negative. The CD-vs-nonIBD biological state at the species + metabolite level is a single principal direction in joint multi-omics space, not a many-axis manifold. This is a portable expectation for multi-omics IBD analyses: when separate single-modality analyses produce concordant directional signals (CD-up species + CD-up metabolites + CD-down protective metabolites), the joint factor decomposition will collapse them into one dominant axis. Independent rediscovery via canonical correlation provides a powerful sanity check on whether the per-modality analyses are converging on the same biology vs measuring orthogonal phenomena. For this project: CC1 is the operational definition of "CD biology in the HMP2 cohort" for Pillar 4–5 cocktail-design / clinical-translation purposes.
- 3-layer phage-evidence convergence as a Pillar-4 rigor pattern (NB12 + NB13 + NB14). Pillar 4 produces three independent phage-evidence layers — curated literature (NB12 ref_phage_biology), experimental susceptibility (NB13 PhageFoundry strain_modelling 96 phages × 188 E. coli strains × 17,672 pairs), in-vivo phageome (NB14 HMP2 fact_viromics 630 samples) — that converge on the same per-target classification: where one layer shows phage-availability for a given Tier-A pathobiont, all three layers tend to show it (E. coli aligned across layers); where one layer shows GAP (H. hathewayi / F. plautii), the others confirm GAP. The 3-layer convergence is the Pillar-4 rigor signal, analogous to Novel Contribution #14 (5-line iron-acquisition narrative) and #17 (2 cross-corroborated narratives) applied to phage-therapy feasibility. Generalizable: any phage-cocktail design project should triangulate across (a) literature curation, (b) experimental host-range data, (c) in-vivo phageome observation — all three are necessary conditions, none sufficient. The convergent target classification (Tier-1 / Tier-2 / Limited / GAP) is what survives the triangulation.
- Hybrid cocktail necessity in IBD ecotypes — pure phage cocktail is not feasible for E1 (NB15 Pillar 5 opener). Per-patient cocktail design for 23 UC Davis CD patients reveals that all 9 E1 patients carry the full 5-species Tier-A pathobiont module (H. hathewayi, F. plautii, E. bolteae, E. lenta, M. gnavus), but only 3 of those 5 species have direct lytic phage options at present (E. lenta PMBT5, E. bolteae PMBT24, E. coli AIEC if present). The remaining 2 (H. hathewayi, F. plautii) are Pillar-4 phage-coverage GAP. A pure phage cocktail covering the full E1 pathobiont module is therefore structurally infeasible; cocktail design must be a 3-strategy hybrid: (1) direct phage targeting (where lytic phages exist), (2) alternative therapies (GAG-degrading enzyme inhibitors for H. hathewayi; BA-binding co-therapy for F. plautii), (3) limited / engineered (lytic-locked phage engineering or biochemical glucorhamnan-synthesis target for M. gnavus). Generalizable observation for any IBD-phage-therapy program: the gut-anaerobe phage-availability gap is the structural ceiling on phage cocktail completeness; pure phage cocktails work for Enterobacteriaceae-dominated dysbiosis (E. coli AIEC subset where phage diversity is high — EcoActive precedent) but the dominant gut-anaerobe pathobionts (H. hathewayi, M. gnavus, F. plautii) require alternative or hybrid strategies. This is a portable rule for any future microbiome-therapy program targeting gut anaerobes; the project's NB12-NB14 3-layer phage-evidence stack provides the methodology for identifying GAP species, and NB15 establishes the per-patient hybrid framework.
- Ecotype drift drives non-trivial cocktail re-design + qPCR proxy as cheap clinical monitoring (NB16 Pillar 5 longitudinal). Patient 6967 — the only multi-timepoint UC Davis CD patient with biological-replicate samples — shows clear E1→E3 ecotype drift across 2 visits with M. gnavus 14× expansion (0.53 → 7.45) as the dominant signature; cocktail Jaccard between visits = 0.60 (3 shared, 2 visit-1-only). Patient 1112 (2 reseq replicates of the same biological sample) gives Spearman ρ = 1.000 across Tier-A — Kaiju calls are reliable across reseq replicates, validating the technical-noise floor for the longitudinal contrast. Generalizable rule for IBD-phage-therapy clinical workflow: (a) ecotype is dynamic, not static, so re-test every 3-6 months; (b) the universal Tier-1 trio (M. gnavus, H. hathewayi, E. lenta) spans both E1 and E3, so it is the cocktail backbone that does not need re-evaluation on ecotype shift; (c) F. plautii inclusion is E1-specific (drop on E1→E3); E. coli inclusion is E3-specific (subject to AIEC strain detection on E3→E1); (d) M. gnavus qPCR is a candidate cheap clinical proxy for ecotype-state monitoring, avoiding the need for full metagenomics at every visit — a 5-fold change in M. gnavus abundance triggers full ecotype re-test. This is the operational state-dependent dosing rule for the project, derived from a single longitudinal trajectory but mechanistically interpretable through the NB01b ecotype framework biology and the NB07c per-ecotype module structure. n=1 limits statistical generalization; the rule is hypothesis for prospective validation, not established clinical practice.
Limitations
- Ecotype framework has real cross-study variance within cMD (NB04f). Bootstrap ARI 0.13–0.17 (NB04b §7) understated the instability; LOSO ARI across the top 8 sub-studies is mean 0.113, range [0.000, 0.282]. Some sub-studies (LifeLinesDeep 0.21 / 85 % agreement, HansenLBS 65 %) align well; others (AsnicarF 38 %, VilaAV 17 %) do not. Part of this is the LDA↔LDA+GMM-consensus comparison baseline (the NB01b consensus itself has only 48.9 % cross-method agreement), but the substudy-to-substudy variance is real and the "four reproducible ecotypes" framing must be qualified. Mitigating: NB04h HMP2 external projection recovers non-random disease stratification (χ² p=0.016) with high projection confidence (80 % of samples have max posterior > 0.70), and the operational E1 Tier-A replicates at 88.2 % sign concordance — so the framework is usable even though its boundaries aren't bit-reproducible across studies.
- Pathway-feature refit only partially recovers ecotype structure (NB04g). ARI 0.113 between pathway-based K=4 LDA and taxon-based consensus_ecotype on 3,145 CMD_IBD samples. E1 is most recoverable (65.3 % agreement); E3 weakest (30.7 %), reinforcing the provisional flag on E3. The ecotype structure is mixed ecological + taxonomic — recoverable to some degree from a disjoint feature basis, but not fully.
- E3 Tier-A is single-study evidence within cMD; partially rescued by HMP2 but E3 is rare in HMP2. NB04e's E3 list (40 candidates) derives from HallAB_2017 only. NB04h's HMP2 projection places only 10 of 130 subjects in E3 (vs 106 in E1), so HMP2 cannot provide a second-substudy E3 Tier-A meta-analysis. E3 Tier-A should be treated as provisional until a sub-study with sufficient E3 + both diagnosis groups becomes available.
- E0 and E2 have no viable Pillar 2 analysis in cMD. These are the healthy-cohort ecotypes (E0 is 66.8 % of HC; E2 is the P. copri enterotype, almost entirely non-Western healthy). No IBD sub-study in cMD populates them with nonIBD controls, so the confound-free CD-vs-nonIBD contrast is not computable. If UC Davis has E0 patients (27 % of the cohort), the Tier-A for those patients must be drawn from cross-ecotype cohort-level evidence (the 5 engraftment-confirmed pathobionts) or from ecotype-agnostic within-substudy CD-vs-nonIBD.
- HMP2 MetaPhlAn3 not yet ingested (
PENDING_HMP2_RAWinlineage.yaml). HMP2 reingestion will (a) add a second IBD sub-study to the E3 stratification, unblocking Tier-A replication; (b) expand CMD + HMP2 to ≈ 11.5 K samples for a stability-improved ecotype refit; (c) enable HMP2 serology × ecotype integration (plan H3e). - UC Davis n = 23 patients. The per-patient cocktail drafts (Pillar 5) are informed by a small cohort; generalization requires validation on external CD cohorts.
- Kaiju vs MetaPhlAn3 classifier mismatch (NB02). Limits confidence in UC Davis ecotype calls; LDA is more trustworthy than GMM here. Documented in
docs/discoveries.md. - Within-ecotype disease-vs-HC training data is limited for E2 and E0 in the extended classifier subset (3 and 11 samples respectively). The extended classifier is effectively an E1-vs-E3 binary in practice.
- Ecotype calls are hard cluster assignments of an underlying continuum (Costea 2018). Soft probabilities are preserved in
data/ecotype_assignments.tsvfor downstream use; hard calls should be treated as operational labels, not biology. - Multi-method DA consensus is partial. The plan called for ≥ 2 / 3 of {ANCOM-BC, MaAsLin2, LinDA} methods to agree. We implemented LinDA in pure Python (NB04c §4) to avoid the R/rpy2 dependency; the NB04c bootstrap-CI and NB04e within-substudy-meta outputs serve as additional independent evidence streams but are not drop-in ANCOM-BC / MaAsLin2 substitutes. A full three-method R-native consensus is a publication-grade follow-up.
- Bootstrap-stable within-ecotype DA shares the feature-leakage bias with CLR-MW. Both CLR-MW (NB04) and LinDA (NB04c) operate on the same ecotype-defined subsamples and therefore share the selection-on-outcome bias. Only the within-substudy CD-vs-nonIBD contrast (NB04c §3, NB04e) is an independent evidence source. The NB04d Tier-A gating requires within-substudy concordance precisely because bootstrap + LinDA are not independent evidence streams.
- Pillar 2 cross-cohort replication is weak. Single-study E3 evidence combined with marginal ecotype stability means the Tier-A list (particularly E3) should be treated as hypotheses for further experimental validation rather than established targets. Pillar 5 UC-Davis cocktail drafts will inherit this limitation; the NB15+ notebooks should annotate per-candidate cross-cohort support explicitly.
- NB07c cross-feeding hypothesis NOT supported by paired sample-level metabolite-metagenomic data (NB09c §13). Originally flagged as a key disambiguation deferred to NB09c. Resolved: only 7 strict cross-feeding triangles emerge; lactate × A. caccae (+0.18) has the OPPOSITE sign of lactate × F. plautii (-0.23) and lactate × E. bolteae (-0.20), inconsistent with a lactate-mediated cross-feeding loop. The NB07c species-level coupling is best interpreted as shared-environment co-occurrence in healthy / normobiotic samples. Cocktail-design implication: phage targeting of pathobionts is unlikely to substantially deplete A. caccae via metabolic coupling.
- NB07c E3_CD module shows weaker coupling than E1_CD. The two oral Actinomyces anchors of E3_CD module 1 couple weakly (~ρ=0.17–0.19) with module pathobionts. Consistent with E3 being characterized by inflammation-driven oral-gut ectopic colonization rather than metabolic structure — but means the cocktail-design implication from NB07c is E1-specific and may not transfer to E3 patients. UC Davis E3 patients (31 % of the cohort) may need a different module-anchor cocktail-design framework.
- Iron-theme narrowing to E. coli (NB07c §10) is correlation-based — derived from species × pathway Spearman ρ across CMD_IBD samples. The genomic interpretation (E. coli owns the iron pathways) is consistent with the v1.8 finding that 8 of 15 iron pathways are menaquinol biosynthesis (canonical bacterial respiratory quinones, not strictly E. coli-specific) but the strongest signature pathways (ENTBACSYN-PWY, HEMESYN2-PWY) are E. coli-canonical. NB08a §11 now provides direct genomic content evidence that E. coli alone among Tier-A core carries iron-siderophore MIBiG matches (54 BGCs vs 0 for the other 5 actionable species). HUMAnN3 species-stratified per-pathway attribution (
*_genefamilies.tsv) would extend this finer but requires raw reads → dropped per plan v1.9; NB08a §2 genomic BGC content is the equivalent fine-grained attribution path within the project's no-raw-reads scope. - NB08a strict H3c interaction-term test is untested and dropped per plan v1.9. The original H3c hypothesis specified "species × BGC interaction term in within-IBD-substudy regression" as the falsifiability test. NB08a uses the precomputed
ref_cborf_enrichmenttable (CD-vs-HC main effect, not species × BGC interaction) for Test 3. The CB-ORF CD-up enrichment per Tier-A core species is consistent with H3c but does not formally distinguish "species × BGC interaction" from "species main effect + BGC main effect." The interaction-term test would require species-stratified per-sample BGC abundance derived from raw HUMAnN3 / antiSMASH outputs — not available without raw reads; dropped per plan v1.9 as a structural project-scope limitation, not a deferred follow-up. NB08a's main-effect-only design is the operational test for this project. - NB08a Tier-A "background" comparator is the full BGC catalog (10,060 BGCs across all species), not a matched comparator. The Fisher OR of 44× for iron_siderophore is partly inflated by the catalog-wide rarity of iron MIBiG matches (51 / 9,774 = 0.5 %). A more conservative comparator would be a "matched-niche" pathobiont set (gut Proteobacteria + CD-associated Firmicutes) at similar genome-assembly depth. The 44× number should not be treated as a precise effect size, but the qualitative finding — E. coli alone among actionable Tier-A carries the iron-siderophore signature — is robust across choice of comparator.
Pillar 4 limitations
- Three of six actionable Tier-A core have no lytic phage in BERDL
ref_phage_biologycuration (NB12). H. hathewayi and F. plautii are GAP; M. gnavus is temperate-only. INPHARED + IMG/VR external DB queries are the highest-priority external-data extension before clinical translation but are out of scope per plan v1.9 (BERDL-internal only). The "concrete cocktail draft" for E1 patients is a hybrid 3-strategy framework, not a pure phage cocktail — this is a structural property of gut-anaerobe phage availability, not a project limitation per se. - PhageFoundry collection is E. coli-rich but other-species-sparse (NB13). 96 phages × 188 E. coli strains × 17,672 susceptibility pairs is sufficient to design a 5-phage cocktail at 95 % strain coverage for E. coli AIEC. The other 5 actionable Tier-A species are not in
phagefoundry_strain_modelling; their phage availability rests onref_phage_biologyliterature curation alone. - NB13 5-phage cocktail covers PhageFoundry strains, not UC Davis patient isolates. The 95 % strain-coverage figure is on the 188 PhageFoundry-tested E. coli strains. Per-patient applicability requires AIEC strain-resolution diagnostic on UC Davis patient stool isolates, which is queued in NB17 Near-term roadmap.
- HMP2 viromics has 80 % "Unknown" family classification (NB14). VirMAP family-level classification gap means most phage observations cannot be linked to a target host. The Tier-A × phage-family correlations (E. coli × Podoviridae +0.18, × Myoviridae +0.13) are restricted to the classifiable 20 %. Family-resolution improvement requires VirMAP/MARVEL re-running with newer phage reference databases — out of scope.
- NB14 phageome × ecotype × diagnosis effect sizes are modest (max |ρ|≤0.18). The endogenous phageome is a noisier signal channel than species-level taxonomy (NB04e effect sizes routinely |CLR-Δ|>1.0). The 3-layer phage-evidence convergence rests on directional concordance, not effect-size matching across layers.
Pillar 5 limitations
- 23-patient UC Davis cohort is small (NB15). Per-patient cocktail recommendations are exemplars/templates, not statistically robust per-patient validation. Generalization requires multi-center validation. The cocktail-strategy distribution (12 reserve / 4 E1 hybrid / 4 E0 limited / 1 E1 full / 1 E3 / 1 state-dependent) is the operational stratification on this cohort and should be revisited on a larger CD cohort.
- No per-patient bile-acid measurements in UC Davis cohort (NB15). F. plautii BA-coupling-cost annotation is ecotype-level (NB09c §13 paired HMP2 samples), not per-patient. Clinical translation requires per-patient bile-acid panels for monitoring + dosing decisions.
- No per-patient AIEC strain-resolution diagnostic in the current cohort. The 8 / 23 E. coli-positive patients carry detectable E. coli by Kaiju but the cocktail recommendation (NB13 5-phage AIEC cocktail) assumes AIEC-subset prevalence per Dogan 2014 / Dubinsky 2022. Per-patient AIEC genotyping (pks-island + Yersiniabactin + Enterobactin) is a Near-term clinical-translation prerequisite (NB17 roadmap).
- State-dependent dosing rule is n=1 (NB16). Patient 6967 is the only multi-timepoint UC Davis patient with biological-replicate samples. The 5 dosing rules + clinical workflow are mechanistically interpretable through the NB01b ecotype framework biology and the NB07c per-ecotype module structure, but prospective validation requires expanded longitudinal sampling. Patient 1112 tech replicate Spearman ρ=1.000 validates the technical-noise floor for the longitudinal contrast but is not a biological-replicate longitudinal trajectory.
- No timing information between patient 6967 visits (NB16). The duration of the E1→E3 drift is unknown, limiting clinical-workflow timing recommendations (the "3-6 month re-test" interval is a clinical convention default, not derived from data).
- M. gnavus qPCR proxy is a hypothesis, not an established assay (NB16, Novel Contribution #24). The 14× expansion in patient 6967's E3 transition suggests a 5-fold change might be the threshold for triggering full ecotype re-test, but this is derived from a single trajectory. Validation requires a prospective cohort with paired qPCR + metagenomics across timepoints.
- Cocktail Jaccard 0.60 (NB16) generalizability is unknown. Whether 3-of-5 component overlap implies clinically-meaningful cocktail re-design depends on patient response heterogeneity, which is out of project scope.
- NB17 patient design categories use calp threshold of 250 μg/g to split active vs quiescent, which is a clinical convention but not validated against per-patient response data. Some patients with calp 41–248 μg/g (currently in C_quiescent) may benefit from active cocktail; the threshold should be re-evaluated against clinical response in a pilot.
Data
Sources
| Collection | Tables used | Purpose |
|---|---|---|
~/data/CrohnsPhage (local star-schema mart, v10, schema v2.4) |
dim_samples, dim_participants, fact_taxon_abundance, fact_clinical_longitudinal, ref_taxonomy_crosswalk, crohns_patient_demographics.xlsx |
Sample / participant metadata, MetaPhlAn3 + Kaiju taxonomy, severity markers, UC Davis demographics |
kbase_ke_pangenome |
queued for NB04+ | Pangenome / AMR / GapMind — Pillar 2/3 functional analysis |
kescience_mgnify |
queued for external validation | Independent IBD-cohort cross-check |
phagefoundry_strain_modelling, phagefoundry_ecoliphages_genomedepot, phagefoundry_klebsiella_* |
queued for NB12+ | Phage-host interaction + coverage matrix — Pillar 4 |
kescience_fitnessbrowser |
queued for NB13 | Phage-resistance fitness-cost inference — Tier-C C3 |
kescience_paperblast, kescience_pubmed |
queued for NB05 | Literature-linked mechanism — Tier-A A3 |
kescience_bacdive |
queued for Pillar 2/3 | Strain phenotype context |
Generated data
| File | Rows | Description |
|---|---|---|
data/species_synonymy.tsv |
2,417 | Alias → canonical species map (project-wide synonymy layer) |
data/ecotype_assignments.tsv |
8,489 | K = 4 consensus ecotype per CMD sample with LDA + GMM calls and methods_agree flag |
data/ucdavis_kuehl_ecotype_projection.tsv |
26 | Per-sample Kuehl projection onto the K = 4 embedding |
data/ucdavis_patient_ecotype_summary.tsv |
23 | Per-patient ecotype call merged with clinical covariates |
data/ucdavis_clinical_ecotype_prediction.tsv |
23 | Classifier-only ecotype predictions for agreement testing |
data/nb00_protective_species_da_comparison.tsv |
14 | Raw-MW vs CLR-MW calls for the protective-species battery |
data/nb04_h2c_protective_battery.tsv |
15 | (superseded) Per-species verdict across pooled raw / pooled CLR / E1 CLR / E3 CLR; kept for audit |
data/nb04_tier_a_candidates.tsv |
33 | (retracted) Original NB04 within-ecotype Tier-A; kept for audit |
data/nb04_da_ecotype_1.tsv |
248 | (superseded) Full within-E1 CD-vs-HC CLR-MW DA at FDR < 0.1 |
data/nb04_da_ecotype_3.tsv |
201 | (superseded) Full within-E3 CD-vs-HC CLR-MW DA at FDR < 0.1 |
data/nb04_da_pooled_clr.tsv |
321 | Pooled CLR MW for reference |
data/nb04b_battery_bootstrap_ci.tsv |
45 | 14-species battery × {pooled, E1, E3} bootstrap CIs with TOST-equivalence verdict |
data/nb04b_battery_LOO.tsv |
26 | Leave-one-species-out refit verdicts for the battery |
data/nb04b_held_out_sensitivity.tsv |
10 | Held-out-species sensitivity Jaccards (leakage bound) |
data/nb04b_tier_a_refined.tsv |
33 | NB04b bootstrap-CI refinement of NB04 Tier-A (intermediate) |
data/nb04c_within_substudy_cd_nonibd.tsv |
152 | Per species × substudy CD-vs-nonIBD bootstrap CI |
data/nb04c_within_substudy_meta.tsv |
38 | Cohort-level IVW meta-analysis across 4 IBD substudies |
data/nb04c_linda.tsv |
1,005 | LinDA bias-corrected per species × {pooled, E1, E3} |
data/nb04c_lme.tsv |
0 | Empty — documents that pooled CD-vs-HC LME is structurally unidentifiable |
data/nb04c_tier_a_refined.tsv |
33 | 3-way-evidence refinement (bootstrap + LinDA + within-substudy) |
data/nb04d_stopping_rule_verdict.json |
— | Per-ecotype stopping-rule verdict + NB05 input set |
data/nb04e_per_cell_DA.tsv |
1,005 | Per (ecotype × substudy) cell within-ecotype × within-substudy DA |
data/nb04e_within_ecotype_meta.tsv |
670 | Rigor-controlled Tier-A — per-ecotype meta-analysis across eligible substudies |
data/nb04e_option_A_viability.json |
— | Option A (within-ecotype × within-substudy) viability verdict |
data/nb04f_loso_stability.tsv |
8 | Per-substudy LOSO ARI + agreement |
data/nb04f_loso_verdict.json |
— | Formal LOSO verdict (mean ARI 0.113 — "study-dependent") |
data/nb04g_pathway_ecotype_assignments.tsv |
3,145 | Pathway-ecotype label alongside taxon-ecotype label per sample |
data/nb04g_pathway_ecotype_verdict.json |
— | Pathway-ecotype vs taxon-ecotype verdict (ARI 0.113 — PARTIAL) |
data/nb04h_hmp2_ecotype_projection.tsv |
1,627 | HMP2 per-sample ecotype + max posterior |
data/nb04h_hmp2_subject_ecotype.tsv |
130 | HMP2 per-subject ecotype mode + disease_subtype |
data/nb04h_hmp2_e1_cd_vs_nonibd.tsv |
335 | HMP2 within-E1 CD-vs-nonIBD CLR-Δ per species |
data/nb04h_e1_tier_a_hmp2_replication.tsv |
51 | NB04e E1 Tier-A cross-referenced with HMP2 E1 CD-vs-nonIBD |
data/nb04h_hmp2_replication_verdict.json |
— | Formal HMP2 external replication verdict (PASS) |
data/nb05_tier_a_scored.tsv |
71 | Scored Tier-A with A3-A6 breakdowns + total score + actionable flag (authoritative NB05 output) |
data/nb05_tier_a_verdict.json |
— | NB05 summary: 6 actionable of 71 scored |
data/nb06_edges_{subnet}.tsv |
varies | Per-subnet edge lists ( |
data/nb06_modules.tsv |
~20 | Per-subnet module summary with actionable + tier-B content |
data/nb06_module_hubs.tsv |
~15 | Top-3 hub species per module by degree |
data/nb06_verdict.json |
— | NB06 summary + H2d verdict |
data/nb07_h3a_v18_pathway_classes.tsv |
575 | HUMAnN3 pathway × MetaCyc-classes × IBD-themes (audit trail for v1.8 schema) |
data/nb07_h3a_v18_cohort_enrichment.tsv |
12 | Per-theme Fisher enrichment cohort-level (iron/heme: OR=8.1, FDR 7e-6) |
data/nb07_h3a_v18_species_enrichment.tsv |
varies | Per-species per-theme Fisher enrichment (H. hathewayi: 2 themes supported) |
data/nb07_h3a_v18_verdict.json |
— | Formal v1.8 H3a (b) verdict (SUPPORTED) |
data/nb07c_anchor_pathobiont_species_rho.tsv |
27 | Anchor × pathobiont pairs with within-IBD-substudy ρ_meta + sign concordance |
data/nb07c_anchor_pathobiont_iron_triple.tsv |
405 | (Anchor, pathobiont, iron-pathway) triples — 27 pairs × 15 v1.8 iron-pathways |
data/nb07c_h3a_new_verdict.json |
— | H3a-new verdict (PARTIAL — A. caccae coupling clean in E1_CD) |
data/nb08a_tier_a_bgc_repertoire.tsv |
6 | Per Tier-A core species BGC repertoire (n_bgc, MIBiG compounds, Grouped Class) |
data/nb08a_bgc_theme_enrichment.tsv |
4 | BGC-theme Fisher enrichment Tier-A core vs background; iron OR=44, genotoxin OR=234 |
data/nb08a_tier_a_iron_genotoxin_per_species.tsv |
6 | Per-species iron + genotoxin MIBiG counts (E. coli: 54+25; others: 0+0) |
data/nb08a_cborf_enrichment_per_tier_a.tsv |
6 | Per-species CB-ORF CD-up rate at FDR<0.10 vs catalog background (2.5 %) |
data/nb08a_ebf_ecf_cd_vs_hc.tsv |
2 | ebf/ecf cohort meta z-stats (Stouffer's z over 4 cohorts; both p<1e-31) |
data/nb08a_h3c_verdict.json |
— | Formal H3c verdict (PARTIALLY SUPPORTED) |
data/nb09a_metab_da_cd_vs_nonibd.tsv |
579 | Per named-metabolite Mann-Whitney + cliff_delta + FDR + theme assignments + passes flag |
data/nb09a_metab_theme_enrichment.tsv |
22 | 11 themes × {CD-up, CD-down} Fisher OR + BH-FDR (polyamines + PUFAs supported) |
data/nb09a_h3d_da_verdict.json |
— | Formal H3d-DA verdict (SUPPORTED) |
data/nb09c_species_metabolite_corr.tsv |
4,664 | 8 species × 583 metabolites paired Spearman ρ + per-species FDR |
data/nb09c_cross_feeding_triangles.tsv |
7 | Strict cross-feeding triangles (same-sign + |
data/nb09c_cross_feeding_panel.tsv |
varies | Curated 7-theme × 8 species direction-of-association panel |
data/nb09c_cross_feeding_verdict.json |
— | Formal NB09c verdict (cross-feeding not supported; bile-acid 7α-dehydroxylation network identified) |
data/nb10a_per_species_bias_counts.tsv |
59 | Per Kumbhari species × IBD-biased + health-biased gene counts |
data/nb10a_sig_genes_classified.tsv |
23,579 | FDR<0.10 strain-adaptation genes × functional classification (13 categories) |
data/nb10a_cross_species_ibd_kos.tsv |
varies | Cross-species IBD-biased KEGG KOs sorted by n_species |
data/nb10a_cross_species_ibd_symbols.tsv |
varies | Cross-species IBD-biased gene symbols sorted by n_species |
data/nb10a_f_plautii_strain_adaptation.tsv |
0 | F. plautii FDR<0.10 strain-adaptation deep dive (empty — informative null) |
data/nb10a_h3b_verdict.json |
— | Formal H3b verdict (SUPPORTED — adaptation enrichment, housekeeping depletion) |
data/nb11_serology_species_correlations.tsv |
48 | (Assay × species) site-adjusted partial Pearson r + raw Spearman ρ + BH-FDR |
data/nb11_serology_site_stratified.tsv |
30 | Site-stratified Spearman ρ for top 10 (assay × species) pairs (3 sites × 10 pairs) |
data/nb11_h3e_verdict.json |
— | Formal H3e verdict (PARTIAL — strict |
data/nb09b_cross_cohort_concordance.tsv |
122 | (HMP2-name × Franzosa-peak) matched pairs with HMP2 cliff_delta + Franzosa cliff_delta + sign-match flag |
data/nb09b_theme_replication.tsv |
8 | Per-theme cross-cohort sign-concordance summary (urobilin 100 %, acyl-carnitines 80 %, PUFAs 75 %) |
data/nb09b_cross_cohort_verdict.json |
— | Formal cross-cohort replication verdict (STRONG on 3 themes; 9 strict replications) |
data/nb09d_metabolite_ecotype_assignments.tsv |
326 | Pooled subjects × {cohort, diagnosis, metabolite_cluster} K=4 assignments |
data/nb09d_h3d_clust_verdict.json |
— | Formal H3d-clust verdict (NOT SUPPORTED — cohort batch effect dominates m/z-bridge clustering) |
data/nb07d_cca_loadings.tsv |
120 | Top 15 species + 15 metabolite loadings × 4 canonical components (CC1 captures CD-vs-nonIBD axis) |
data/nb07d_subject_factor_scores.tsv |
106 | Paired-CSM subjects × {CC1-4 species/metab/joint scores} + diagnosis |
data/nb07d_mofa_pilot_verdict.json |
— | Formal pilot verdict (SUCCESSFUL — CC1 canon r=0.96 + cliff CD-vs-nonIBD = +0.50 p=4e-4) |
data/nb12_pathobiont_phage_matrix.tsv |
6 | Per Tier-A core pathobiont × phage-availability score (0-3) + therapeutic_targets + Pillar-5 priority class |
data/nb12_phage_matrix_verdict.json |
— | Pillar 4 opener verdict + 4-class phage-availability stratification |
data/nb13_ecoli_strain_modelling.tsv |
17,672 | PhageFoundry 96 phages × 188 E. coli strains susceptibility pairs (Gaborieau 2025) |
data/nb13_5phage_cocktail_coverage.tsv |
5 | Greedy minimum-set-cover 5-phage cocktail (DIJ07_P2 + LF73_P1 + AL505_Ev3 + 55989_P2 + LF110_P2) at 94.7% coverage of 188 strains |
data/nb13_phage_cocktail_verdict.json |
— | Formal cocktail design verdict + AIEC phylogroup B2/D coverage |
data/nb14_phageome_ecotype_da.tsv |
varies | HMP2 viromics × ecotype × diagnosis Spearman ρ + cliff δ + FDR per phage family |
data/nb14_phageome_verdict.json |
— | Pillar-4 in-vivo phageome verdict (Gokushovirus CD-DOWN; E. coli × Podo/Myo +0.18/+0.13) |
data/nb15_patient_profile.tsv |
23 | Per UC Davis CD patient × NB02 ecotype + Montreal + calp + medication + Tier-A presence + NB05 score + Pillar-3 mechanism |
data/nb15_per_patient_cocktail_draft.tsv |
varies | Long-format per-patient × per-target cocktail breakdown (component, evidence, caveat) |
data/nb15_pillar5_cocktail_verdict.json |
— | Pillar 5 opener verdict + per-ecotype prescribing summary + 4 patient-design categories |
data/nb16_p6967_tier_a_longitudinal.tsv |
6 | Patient 6967 per-visit Tier-A abundance + fold change (visit 1 E1 → visit 2 E3; M. gnavus 14× expansion) |
data/nb16_longitudinal_verdict.json |
— | Pillar 5 longitudinal verdict + cocktail Jaccard 0.60 + Spearman ρ=1.000 + 5 state-dependent dosing rules |
data/nb17_patient_master_table.tsv |
23 | Per UC Davis CD patient × full master attributes + design category + cocktail strategy + longitudinal status (Pillar 5 closure capstone) |
data/nb17_target_decision_matrix.tsv |
6 | 6 actionable Tier-A × 5 attributes (NB05 score, ecotype, BA cost, mediation, phage tier) → final priority class |
data/nb17_final_verdict.json |
— | Per-pillar final verdicts + 24 NC index + 4-phase clinical-translation roadmap + thesis statement + design/strategy distributions |
/home/aparkin/data/CrohnsPhage_ext/hmp2_ibdmdb_relative_abundance.tsv |
582 | HMP2 MetaPhlAn3 relative abundance (taxa × samples) — out-of-project artifact |
/home/aparkin/data/CrohnsPhage_ext/hmp2_ibdmdb_sample_metadata.tsv |
1,627 | HMP2 sample metadata from cMD |
/home/aparkin/data/CrohnsPhage_ext/hmp2_ibdmdb_taxon_metadata.tsv |
585 | HMP2 per-taxon lineage metadata |
data/table_schemas.md |
— | Audit-committed schema documentation for the CrohnsPhage mart |
References
- Arumugam M et al. (2011). "Enterotypes of the human gut microbiome." Nature 473(7346):174–180. PMID: 21508958.
- Vandeputte D et al. (2017). "Quantitative microbiome profiling links gut community variation to microbial load." Nature 551(7681):507–511. PMID: 29143816.
- Lloyd-Price J et al. (2019). "Multi-omics of the gut microbial ecosystem in inflammatory bowel diseases." Nature 569(7758):655–662. PMID: 31142855.
- Holmes I, Harris K, Quince C. (2012). "Dirichlet multinomial mixtures: generative models for microbial metagenomics." PLoS One 7(2):e30126. PMID: 22319561.
- Costea PI et al. (2018). "Enterotypes in the landscape of gut microbial community composition." Nat Microbiol 3(1):8–16. PMID: 29255284.
- Gloor GB et al. (2017). "Microbiome datasets are compositional: and this is not optional." Front Microbiol 8:2224. PMID: 29187837.
- Lin H, Peddada SD. (2020). "Analysis of compositions of microbiomes with bias correction." Nat Commun 11(1):3514. PMID: 32665548.
- Henke MT et al. (2019). "Ruminococcus gnavus, a member of the human gut microbiome associated with Crohn's disease, produces an inflammatory polysaccharide." Proc Natl Acad Sci USA 116(26):12672–12677. PMID: 31182571.
- Darfeuille-Michaud A et al. (2004). "High prevalence of adherent-invasive Escherichia coli associated with ileal mucosa in Crohn's disease." Gastroenterology 127(2):412–421. PMID: 15300573.
- Vujkovic-Cvijin I et al. (2020). "Host variables confound gut microbiota studies of human disease." Nature 587(7834):448–454. PMID: 33149306.
- Elmassry MM et al. (2025). [BGC catalog in gut microbiome; Cell Host & Microbe]. (Referenced in project
ref_bgc_catalogmetadata; full citation indim_studies.) - Kumbhari A et al. (2024). [Strain-frequency / IBD adaptation]. (Source:
ref_kumbhari_s7_*supplementary; full citation indim_studies.) - Zhou H, He K, Chen J, Zhang X. (2022). "LinDA: linear models for differential abundance analysis of microbiome compositional data." Genome Biology 23(1):95. PMID: 35421994. (Used in NB04c §4 as the second compositional DA method alongside CLR + Mann-Whitney; implemented in pure NumPy for this project.)
- Tsilimigras MC, Fodor AA. (2016). "Compositional data analysis of the microbiome: fundamentals, tools, and challenges." Ann Epidemiol 26(5):330–335. PMID: 27255738.
- Le PH et al. (2025). "Fecal microbiota transplantation for vancomycin-resistant Clostridium innocuum infection in inflammatory bowel disease: A pilot study evaluating safety and clinical and microbiota outcome." J Microbiol Immunol Infect. PMID: 40074633.
- Xiang B, Hu J, Zhang M, Zhi M. (2024). "The involvement of oral bacteria in inflammatory bowel disease." Gastroenterol Rep (Oxf) 12:goae076. PMID: 39188957.
- Guo Y et al. (2024). "Oral pathobiont-derived metabolites promote IBD." Gut Microbes 16(1):2333463. PMID: 38545880.
- Tanwar H et al. (2023). "Unraveling the link between periodontitis and inflammatory bowel disease: challenges and outlook." Cells 12. PMID: 37645044.
- Goel RM et al. (2019). "Streptococcus salivarius: A potential salivary biomarker for orofacial granulomatosis and Crohn's disease?" Inflamm Bowel Dis. PMID: 30796823.
- Selma MV et al. (2014). "Gordonibacter urolithinfaciens sp. nov., a urolithin-producing bacterium isolated from the human gut." Int J Syst Evol Microbiol 64:2346–2352. PMID: 24744017.
- Tierney BT et al. (2023). "Capacity of a microbial synbiotic to rescue the in vitro metabolic activity of the gut microbiome following perturbation with alcohol or antibiotics." Appl Environ Microbiol 89(3):e01880-22. PMID: 36840551.
- Dalmasso G et al. (2021). "Yersiniabactin siderophore of Crohn's disease-associated adherent-invasive Escherichia coli." Int J Mol Sci 22(7):3512. PMID: 33805299.
- Prudent V et al. (2021). "The Crohn's disease-related bacterial strain LF82 assembles biofilm-like communities to protect itself from phagolysosomal attack." Commun Biol 4(1). PMID: 34035436.
- Dogan B et al. (2014). "Inflammation-associated adherent-invasive Escherichia coli are enriched in pathways for use of propanediol and iron and M-cell translocation." Inflamm Bowel Dis 21(1):92–111. PMID: 25230163.
- Dubinsky V et al. (2022). "Escherichia coli strains from patients with inflammatory bowel diseases have disease-specific genomic adaptations." J Crohns Colitis 16(9):1484–1497. PMID: 35560165.
- Veziant J et al. (2016). "Association of colorectal cancer with pathogenic Escherichia coli: focus on mechanisms using optical imaging." World J Clin Oncol 7(3):293–302. PMID: 27298769.
- Galtier M et al. (2017). "Bacteriophages targeting adherent-invasive Escherichia coli strains as a promising new treatment for Crohn's disease." J Crohns Colitis 11(7):840–847. PMID: 28130329.
- Sheikh IA et al. (2024). "Transplant of microbiota from Crohn's disease patients to germ-free mice results in colitis." Gut Microbes 16(1):2333483. PMID: 38532703.
- Geirnaert A et al. (2015). "Interindividual differences in response to treatment with butyrate-producing Butyricicoccus pullicaecorum 25-3T studied in an in vitro gut model." FEMS Microbiol Ecol 91(6):fiv054. PMID: 25999470.
- Steppe M et al. (2014). "Safety assessment of the butyrate-producing Butyricicoccus pullicaecorum strain 25-3T, a potential probiotic for patients with inflammatory bowel disease." Food Chem Toxicol 72:129–137. PMID: 25007784.
- Jeraldo P et al. (2016). "Capturing one of the human gut microbiome's most wanted: reconstructing the genome of a novel butyrate-producing, clostridial scavenger from metagenomic sequence data." Front Microbiol 7:783. PMID: 27303377.
- Barbour A et al. (2023). "Discovery of phosphorylated lantibiotics with proimmune activity that regulate the oral microbiome." Proc Natl Acad Sci USA 120(23):e2219392120. PMID: 37216534.
- Ellermann M et al. (2020). "Dietary iron variably modulates assembly of the intestinal microbiota in colitis-resistant and colitis-susceptible mice." Gut Microbes 12(1):1599794. PMID: 31179826.
- Buret AG et al. (2019). "Pathobiont release from dysbiotic gut microbiota biofilms in intestinal inflammatory diseases: a role for iron?" J Biomed Sci 26(1):1. PMID: 30602371.
- Timsina R, Gora RA, Ferguson DJ. (2025). "Proteomic and metabolomic analysis reveals new insights into quaternary amine metabolism." mSphere 10(2):e00421-25. PMID: 40853000.
- Cheng E, Hung SC, Lin TY. (2025). "Association of trimethylamine N-oxide and metabolites with kidney function decline in patients with chronic kidney disease." Clin Nutr 44:18–25. PMID: 39709651.
- Jiang C et al. (2024). "Polyphenols from hickory nut reduce the occurrence of atherosclerosis in mice by improving intestinal microbiota and inhibiting trimethylamine N-oxide production." Phytomedicine 130:155349. PMID: 38522315.
- Schwiertz A et al. (2002). "Anaerostipes caccae gen. nov., sp. nov., a new saccharolytic, acetate-utilising, butyrate-producing bacterium from human faeces." Syst Appl Microbiol 25(1):46–51. PMID: 12086188.
- Duncan SH et al. (2004). "Lactate-utilizing bacteria, isolated from human feces, that produce butyrate as a major fermentation product." Appl Environ Microbiol 70(10):5810–5817. PMID: 15466518.
- Koppel N et al. (2018). "Discovery and characterization of a prevalent human gut bacterial enzyme sufficient for the inactivation of a family of plant toxins." eLife 7:e33953. PMID: 29760174. (Also see Koppel 2018 Science on cgr2 cardiac glycoside reductase and Eggerthella drug metabolism — establishes non-BGC mechanism for E. lenta CD-association.)
- Elmassry MM et al. (2025). "Diet, lifestyle, and the microbiome interaction reveals fatty acid amide functional signatures of inflammatory bowel disease." Cell Host & Microbe 33(2):243–256. PMID: 39837311. (BGC catalog source; ebf/ecf RPKM CD-vs-HC; replicated independently in NB08a §4.)
- Pegg AE. (2014). "Toxicity of polyamines and their metabolic products." Pharmacol Res 84:80-89. PMID: 24747196. (Polyamine biology + IBD reviewed.)
- Wang Q et al. (2018). "Plasma polyamine signatures in patients with inflammatory bowel disease." Inflamm Bowel Dis 24(2):324–334. PMID: 29361096. (Polyamine CD biomarker confirmation.)
- Franzosa EA et al. (2019). "Gut microbiome structure and metabolic activity in inflammatory bowel disease." Nat Microbiol 4(2):293–305. PMID: 30531976. (HMP2 metabolomics + microbiome integration; canonical bile-acid 7α-dehydroxylation deficit + polyamine + lipid signatures.)
- Hall AB et al. (2024). "Bilirubin reductase: a gut-bacterial enzyme that produces urobilinogen and connects intestinal microbiota to host bilirubin metabolism." Nature. (Urobilin gut-bacterial bilirubin reduction.)
- Vital M et al. (2018). "Diversity of bacteria exhibiting bilirubin-reducing activity in the human intestinal microbiota." Front Microbiol 9:486. PMID: 29615976. (Urobilinoid-producing commensals.)
- Devlin AS, Fischbach MA. (2015). "A biosynthetic pathway for a prominent class of microbiota-derived bile acids." Nat Chem Biol 11(9):685–690. PMID: 26412091. (Bile-acid 7α-dehydroxylation bai operon mechanism in Clostridium scindens-like bacteria — anchors the NB09c bile-acid network finding.)
Thesis (Pillars 1–5 closed)
Crohn's disease at the gut-microbiome level is a single principal-direction phenomenon in joint species-metabolite space, not a many-axis manifold. The dominant cross-modality structure in HMP2 metagenomics + metabolomics is a canonical correlation pair (r = 0.96) on which: (a) all 6 actionable Tier-A core pathobionts (H. hathewayi, M. gnavus, E. coli, E. lenta, F. plautii, E. bolteae) load CD-positive; (b) ecotype-defining commensals (R. bromii, A. putredinis, L. eligens) load CD-negative; (c) urobilin and secondary bile acids load CD-negative; (d) polyamines, long-chain PUFAs, fatty-acid amides (ebf/ecf substrates), and E. coli-specific cadaverine load CD-positive. A single composite axis stratifies CD from non-IBD subjects at cliff δ = +0.50 (p = 4×10⁻⁴) — the project's separate per-modality analyses (NB07a/b pathway DA, NB08a BGC, NB09a/b metabolomics, NB09c paired species×metabolite, NB06 co-occurrence modules) all measure projections of this same axis. Two cross-corroborated 6-line mechanism narratives (iron-acquisition centred on E. coli AIEC; bile-acid 7α-dehydroxylation centred on F. plautii / E. lenta / E. bolteae) sit within this unified axis as orthogonal molecular sub-mechanisms.
Pillar 4 phage-targetability framework: a 3-layer phage evidence stack (literature curation NB12 + experimental susceptibility NB13 + in-vivo phageome NB14) yields a concrete 5-phage E. coli AIEC cocktail covering 95 % of 188 PhageFoundry-tested strains (DIJ07_P2 + LF73_P1 + AL505_Ev3 + 55989_P2 + LF110_P2), and a clear 2-tier vs phage-GAP stratification of the actionable Tier-A: clinical-trial-stage (E. coli EcoActive precedent) → lytic-literature (E. lenta PMBT5, E. bolteae PMBT24) → temperate-only (M. gnavus 6 phages) → coverage-GAP (H. hathewayi, F. plautii). The 2 phage-GAP species are also the highest-NB05-scored (4.0, 3.3); INPHARED + IMG/VR external DB queries remain the highest-priority Pillar-4 follow-up.
Pillar 5 per-patient cocktail framework (NB15 + NB16): 23 UC Davis CD patients × per-patient profile (ecotype + Montreal + calprotectin + medication + Kuehl_WGS Tier-A presence) → 14 of 23 patients (61 %) receive concrete phage cocktail drafts; all 9 E1 patients carry the full Tier-A pathobiont module. Pure phage cocktail is NOT feasible for any E1 patient — the per-patient cocktail must be a 3-strategy hybrid: (1) direct phage targeting (E. coli AIEC + E. bolteae PMBT24 + E. lenta PMBT5), (2) alternative therapies (H. hathewayi GAG-degrading enzyme inhibitors; F. plautii BA-binding co-therapy), (3) limited / engineered (M. gnavus lytic-locked phage engineering or biochemical glucorhamnan-synthesis target). F. plautii BA-coupling cost is the dominant E1 design constraint (present in 78 % of patients including all 9 E1; HIGHEST BA-coupling cost from NB09c §13 + Pillar-4 GAP from NB12 = double penalty → deprioritize from cocktail). E. coli present in only 35 % of UC Davis patients — much lower carriage than expected for AIEC-canonical species; the NB13 5-phage cocktail is directly applicable to ~8/23 patients only.
Patient 6967 longitudinal validation (NB16): the only multi-timepoint UC Davis patient with biological-replicate samples shows clear E1→E3 ecotype drift across 2 visits, with M. gnavus 14× expansion (0.53 → 7.45 reads) as the dominant signature; cocktail Jaccard between visits = 0.60 (3 shared: H. hathewayi + M. gnavus + E. lenta = universal Tier-1 trio; 2 visit-1-only: E. bolteae + F. plautii = E1-specific). Patient 1112 technical replicate Spearman ρ = 1.000 on Tier-A — Kaiju is reliable across reseq replicates, validating the technical-noise floor. 5 state-dependent dosing rules (Novel Contribution #24): re-test ecotype every 3-6 months; F. plautii inclusion is E1-specific (drop on E1→E3); E. coli inclusion is E3-specific (subject to AIEC strain detection); universal Tier-1 trio is the cocktail backbone for any active-disease CD patient; M. gnavus qPCR as a cheap clinical proxy for ecotype-state monitoring — a 5-fold change triggers full ecotype re-test, avoiding routine metagenomics. Pillar 5 closure deliverable: a complete clinical-translation workflow (initial visit ecotype assignment → ecotype-specific cocktail → 3-6 month follow-up calp + qPCR + ecotype-shift decision tree).
Status: All 5 pillars closed. Pillar 1 closed (NB00–NB03); Pillar 2 fully closed (NB04b–h + NB05 + NB06; HMP2 88.2 % Tier-A external-replication); Pillar 3 fully closed (12 notebooks; all 8 H3 sub-hypotheses tested + cross-cohort metabolomics replication + multi-omics joint factor pilot CC1 r=0.96); Pillar 4 fully closed (NB12/NB13/NB14; 3-layer phage-evidence stack; 5-phage E. coli AIEC cocktail; gut-anaerobe GAP confirmed across all 3 layers); Pillar 5 closed (NB15 per-patient cocktail drafts; NB16 patient 6967 longitudinal validation + state-dependent dosing rule + clinical workflow; NB17 cross-cutting synthesis + per-patient master table + target decision matrix + 4-phase clinical-translation roadmap). 31 notebooks; 24 numbered Novel Contributions. Pillar-4 follow-ups (INPHARED + IMG/VR external DB queries) and multi-cohort prospective validation flagged in NB17 roadmap as out-of-scope clinical-translation prerequisites.
Note on Pillar 2 rigor repair + external replication: the original NB04 analysis (committed 2026-04-24 early) was superseded by a seven-notebook pipeline (NB04b → c → d → e → f → g → h) after an adversarial review caught 5 critical + 6 important methodological issues that two independent standard
/berdl-reviewruns missed. NB04b–e performed the rigor repair (see retraction box in §5 andFAILURE_ANALYSIS.md). NB04f–h strengthened Pillar 2 against LOSO cross-study stability (NB04f), feature-leakage structural test (NB04g), and HMP_2019_ibdmdb external replication (NB04h). The rigor-controlled Tier-A replicates at 88.2 % sign concordance on HMP2 even though the ecotype framework itself has real cross-study variance. Seedocs/pitfalls.mdfor the two generalizable pitfalls (cMD substudy-nesting, feature leakage in cluster-stratified DA).
Executive Summary
Crohn's disease (CD) and ulcerative colitis (UC) microbiomes are clinically heterogeneous. A single "CD target list" inferred from pooled differential-abundance analysis of public cohorts does not translate cleanly to individual patients, in part because the heterogeneity partitions into reproducible microbiome subtypes (ecotypes) that carry distinct pathobiont signatures. Phage cocktails designed at the cohort level will mismatch individual patients unless ecotype is known first. This project asks: can we (a) define a reproducible ecotype framework, (b) place each UC Davis Crohn's patient on that framework, and (c) derive ecotype-specific and per-patient pathobiont target lists for rational phage-cocktail design?
Pillar 1 answers the first two questions.
Pillar 1 deliverables (this report)
- Four reproducible IBD ecotypes trained on 8,489 curatedMetagenomicData MetaPhlAn3 samples (5,333 HC + 3,156 IBD/other). Consensus K = 4 selected by cross-method adjusted Rand index between LDA and GMM — a rigorous criterion when per-method fit measures (perplexity, BIC) monotonically prefer larger K. Ecotypes are biologically clean and align with published enterotype literature (Arumugam 2011, Vandeputte 2017): diverse-commensal (E0, 66.8 % of HC), Bacteroides2-transitional (E1, dominant in CD/UC/T1D/T2D), Prevotella copri enterotype (E2, non-Western healthy), and severe Bacteroides-expanded (E3, IBD flare / CDI / donor 2708).
- UC Davis cohort placement onto the four-ecotype embedding — all 23 patients projected via the synonymy layer. χ²(3) = 10.0, p = 0.019: UC Davis occupies E0 (27 %), E1 (42 %), E3 (31 %), and zero in E2 — non-random and consistent with an active-disease Western cohort. Patient 6967 shows longitudinal ecotype drift (E1 → E3) across re-sampling, the first direct signal of intra-patient ecosystem instability.
- Clinical covariates alone cannot assign IBD patients to ecotype. A classifier trained on {HC/IBD status, sex, age} achieves macro AUC 0.80 on pooled cross-validation — but only 41 % patient-level agreement with the metagenomic projection on UC Davis. The classifier collapses to "IBD → E1" because
is_ibdis the dominant feature and becomes constant on the test cohort. Metagenomics remains required for patient-level ecotype assignment in an all-CD cohort. - Systematic taxonomy synonymy layer covering 2,417 aliases → 1,848 canonical species, grounded in NCBI taxid with GTDB r214+ genus renames (Bacteroides → Phocaeicola etc.). The committed artifact
data/species_synonymy.tsvis the project-wide reconciliation backbone and the tool that made both CMD pooling and Kaiju → MetaPhlAn3 projection tractable. - Compositional-DA proof of concept on the protective-species battery establishes that raw Mann-Whitney on relative abundance is systematically under-sensitive for protective-species depletion under the pooled-cohort contrast. Raw Mann-Whitney mis-directs 4+ of 8 protective species; CLR correction recovers them. C. scindens remains pooled-CLR CD↑, which Pillar 2 resolves not through within-ecotype stratification (the original NB04 claim, now retracted) but through a confound-free within-IBD-substudy CD-vs-nonIBD meta-analysis that controls for the HC-study vs IBD-study nesting problem in cMD.
Pillar 2 opener — rigor-controlled findings (this report)
- H2b (ecotype-specific targeting) is supported at high confidence. Permutation null for Jaccard(top-30 E1, top-30 E3) under randomized ecotype labels yields mean 0.785 ± 0.054. Observed Jaccard = 0.104, empirical p = 0.000 over 200 permutations. Target sets genuinely diverge between E1 and E3 — this is not a random-overlap artifact. (Retracts NB04's original "Jaccard = 0.14 supports H2b" framing; the value was near the random-overlap baseline and did not have a null distribution attached. The conclusion survives; the statistic did not.)
- H2c (paradox resolution) is retracted. Under the confound-free within-IBD-substudy CD-vs-nonIBD meta-analysis (NB04c §3), C. scindens is genuinely CD↑ with pooled CLR-Δ = +1.18, FDR = 1e-8, and 4/4 sign concordance across sub-studies. The NB04 within-ecotype "n.s." call was an artifact of feature leakage (clustering samples on taxon abundances then testing the same taxa within cluster) — when the leakage is controlled via leave-one-species-out refit, C. scindens is CD↑ in both E1 and E3. There is no paradox to resolve; C. scindens behaves like a CD-associated species in this design.
- Ecotype-specific Tier-A (rigor-controlled). Under a within-ecotype × within-substudy meta-analysis on the 4 IBD sub-studies that carry both CD and nonIBD (HallAB_2017, LiJ_2014, IjazUZ_2017, NielsenHB_2014; NB04e):
- E1 Tier-A = 51 candidates, meta-analysis across 2 sub-studies (HallAB_2017 + NielsenHB_2014, 82 CD / 280 nonIBD), all 100 % sign-concordant. Top candidates: M. gnavus (+4.85), S. salivarius (+3.26), S. thermophilus (+2.69), Erysipelatoclostridium innocuum (+2.65), S. parasanguinis (+2.44), Enterocloster asparagiformis (+2.41). This list is classical CD pathobionts — not the ecotype-marker commensals that NB04's within-ecotype DA spuriously produced.
- E3 Tier-A = 40 candidates from single-study HallAB_2017 (22 CD / 31 nonIBD). Flagged as provisional — needs replication. Top candidates: H. symbiosa (+4.64), M. gnavus (+4.46), B. coccoides (+4.22), R. faecis (+4.14), C. spiroforme (+4.11). E3-specific replication requires a second cMD-IBD study that populates E3 with both CD and nonIBD; this is blocked until HMP2 raw data (
PENDING_HMP2_RAW) is ingested. - Cross-ecotype engraftment-confirmed pathobionts. Under the within-IBD-substudy (non-stratified-by-ecotype) confound-free contrast, five of the six donor-2708-engraftment pathobionts pass as CD↑ with FDR < 0.10 and ≥ 66 % sign concordance: M. gnavus (+5.13), E. lenta (+2.30), E. coli (+1.43), E. bolteae (+1.09), H. hathewayi (+0.92). K. oxytoca is below prevalence filter and not tested. These are cross-ecotype targets independent of the NB04e ecotype-specific lists.
- NB05 input set (rigor-controlled). Union of the E1 meta-viable Tier-A (51), the E3 provisional Tier-A (40), and the 5 engraftment-confirmed cross-ecotype pathobionts — after deduplication, approximately 70–90 unique species across the three categories. M. gnavus is the top candidate across all three categories.
- External replication on HMP_2019_ibdmdb (pulled live via curatedMetagenomicData v3.18; NOT in the cMD_IBD training set). 1,627 samples / 130 subjects projected onto the K=4 LDA with 80.4 % of samples at confidence > 0.70. Subject-level ecotype × {CD, UC, nonIBD} χ² = 15.61, p = 0.016 — ecotype stratifies disease in HMP2 at significance. E1 Tier-A replicates at 88.2 % sign-concordance (45 / 51 candidates CD↑ in both cohorts), including every top-10 candidate (M. gnavus, S. salivarius, S. thermophilus, E. innocuum, S. parasanguinis, E. asparagiformis, I. bartlettii, H. symbiosa, G. pamelaeae, E. ramosum). Two top-20 candidates fail to replicate (S. thermophilus on effect sign — possibly cohort-specific dairy exposure; B. stercoris n.s.). The ecotype framework itself has real cross-study variance (NB04f LOSO ARI 0.113) — but the operational Tier-A is externally validated.
- Tier-A scoring complete (NB05). A3–A6 criteria applied to 71 unique rigor-controlled candidates produce 6 actionable targets (total_score ≥ 2.5): Hungatella hathewayi (4.0, top-scoring), Mediterraneibacter gnavus (3.8), Escherichia coli (3.6; MIBiG: Colibactin+Yersiniabactin+Enterobactin), Eggerthella lenta (3.3), Flavonifractor plautii (3.3), Enterocloster bolteae (2.8). 9 Tier-B candidates in the 2.2–2.4 range (including S. salivarius with Salivaricin MIBiG matches, E. asparagiformis, V. parvula, S. parasanguinis) remain candidates subject to Pillar 4 phage-availability promotion.
- Co-occurrence structure mapped (NB06). Per-subnet CLR+Spearman+Louvain networks on E1_all, E1_CD, E3_all, E3_CD produce 3–7 modules per subnet. In every subnet, 4–5 of 6 actionable Tier-A candidates co-cluster into a single "pathobiont module" (size 57–84 nodes). Multi-target phage cocktails are ecologically appropriate for the pathobiont-module members. F. plautii and E. coli show ecotype-specific module membership (relevant for per-patient cocktail specificity in Pillar 5). Module-anchor commensals (Butyricicoccus pullicaecorum, Anaerostipes caccae, Lactococcus lactis in E3) may provide functional-driver context for Pillar 3. Pillar 2 is now fully closed.
Pillar 3 deliverables (this report)
Twelve Pillar 3 notebooks (NB07a + NB07b + NB07_v1.8 retest + NB07c + NB07d + NB08a + NB09a + NB09b + NB09c + NB09d + NB10a + NB11) test the H3 falsifiability framework across pathway, BGC, metabolite, strain, and serology granularities, plus a multi-omics joint factor pilot. All 8 H3 sub-hypotheses tested with formal verdicts (H3a a/c, H3a b v1.8, H3a-new, H3b, H3c, H3d-DA, H3d-clust, H3e); cross-cohort metabolomics bridge to FRANZOSA_2019 (NB09b); metabolite-feature ecotype stability falsifiability (NB09d); multi-omics CCA pilot (NB07d) that recapitulates the entire CD-vs-nonIBD signal in a single joint factor.
H3 falsifiability framework — verdict summary
| Hypothesis | Notebook | Verdict | Key finding |
|---|---|---|---|
| H3a (a) within-IBD-substudy CD-vs-nonIBD pathway DA effect-size + permutation null | NB07a | SUPPORTED | 52 CD-up MetaCyc pathways, perm-null mean 0.077 |
| H3a (b) CD-up pathways concentrate in IBD-mechanism MetaCyc class themes | NB07_v1.8 | SUPPORTED | iron/heme dominant theme OR=8.1 (FDR 7e-6) |
| H3a (c) pathway-pathobiont attribution recapitulates known mechanism biology | NB07a | SUPPORTED | 137 pairs at |ρ_meta|>0.4; heme↔E.coli ρ=0.640 |
| H3a-new module-anchor commensal × pathobiont metabolic coupling | NB07c+NB09c | PARTIAL → reframed as shared-environment | NB09c rejects cross-feeding (only 7 strict triangles; lactate opposite-sign A. caccae vs pathobionts) |
| H3b Kumbhari strain-adaptation gene content discriminates disease-vs-health-adapted strains | NB10a | SUPPORTED | IBD-biased adaptation enrichment OR=1.38 (p=2.4e-6); housekeeping depletion OR=0.62 (p=6.4e-20) |
| H3c BGC-encoded inflammatory mediators localize to a minority of Tier-A pathobionts | NB08a | PARTIALLY SUPPORTED | iron-siderophore OR=44, genotoxin OR=234 — E. coli-only within Tier-A core |
| H3d-DA CD-up metabolite signatures concentrate in IBD-relevant chemical-class themes | NB09a | SUPPORTED | polyamines OR=14.6, long-chain PUFAs OR=7.9 (50 CD vs 26 nonIBD) |
| H3d-clust metabolite-derived clustering achieves higher cross-cohort stability than taxonomy | NB09d | NOT SUPPORTED | cross-cohort LOSO ARI=0.000 (cohort-batch dominates); taxonomy framework remains primary |
| H3e anti-microbial antibody titers correlate with Tier-A pathobiont abundance | NB11 | PARTIAL | top |r|=0.31 (FDR 0.40); single-cohort caveat structural per plan v1.7 |
Score: 5 SUPPORTED + 1 PARTIALLY SUPPORTED + 2 PARTIAL + 1 NOT SUPPORTED. The two negatives (H3d-clust, H3a-new) carry methodological lessons: m/z-bridge metabolite-feature clustering requires batch correction (Novel Contribution #19, NB09d); paired sample-level metabolite-metagenomics can disambiguate cross-feeding from shared-environment (NB09c §13). The two PARTIALs (H3e, H3a-new) reflect single-cohort data-scope structural caveats, not biological refutation.
Integrative capstone — NB07d CC1 confirms per-modality convergence on a single CD-vs-nonIBD axis
NB07d is not formally an H3 hypothesis test; it is a multi-omics joint-factor pilot per plan v1.7 N13. Its result is the integrative validation of the H3 framework: a single canonical correlation pair on HMP2 paired taxonomy + metabolomics (CC1, r=0.96) jointly captures the species + metabolite signatures from all 8 H3 sub-hypotheses. This is independent rediscovery via a different analytical approach (CCA on jointly-PCA'd modalities vs the per-modality DA + correlation tests of H3a–H3e). When per-modality analyses produce concordant directional signals, the joint factor decomposition collapses them into one dominant axis (Novel Contribution #21). CC1 is the operational definition of "CD biology in HMP2" at species + metabolite level for Pillar 4–5 cocktail-design purposes.
- H3a (a + c) SUPPORTED (NB07a). Within-IBD-substudy CD-vs-nonIBD pathway DA produces 52 CD-up MetaCyc pathways (FDR<0.10, |effect|>0.5) with permutation null mean 0.077; 137 pathway-pathobiont attribution pairs at |ρ_meta|>0.4 with top hit heme biosynthesis (PWY-5920) ↔ E. coli at ρ=0.640. AIEC biology recapitulated.
- H3a (b) v1.8 SUPPORTED via MetaCyc class hierarchy (Novel Contribution #12). v1.7 regex-on-pathway-names gave FAIL-degenerate; v1.8 ModelSEEDDatabase MetaCyc class hierarchy (90 % coverage of 575 HUMAnN3 pathways) gave H3a (b) SUPPORTED with iron/heme acquisition as dominant CD-up theme (OR=8.1, FDR 7e-6; 15 of 52 CD-up pathways) + H. hathewayi purine/pyrimidine recycling (OR=4.9) + TMA/choline (OR=9.3). Same data, opposite verdict — driven by ontology choice; lesson encoded as plan norm N17.
- H3a-new PARTIAL → reframed as shared-environment by NB09c (NB07c + NB09c). NB07c §10 found A. caccae × pathobiont species-level Spearman ρ=+0.39 (E. bolteae), +0.33 (H. hathewayi), +0.31 (M. gnavus), +0.29 (F. plautii) in E1_CD — initially consistent with butyrogenic cross-feeding. NB09c §13 paired-sample disambiguation rejected cross-feeding (only 7 strict triangles; lactate × A. caccae OPPOSITE sign of × pathobionts); coupling reframed as shared-environment co-occurrence in healthy/normobiotic samples.
- H3b SUPPORTED (NB10a). Across 23,579 FDR<0.10 Kumbhari strain-adaptation genes (59 species), IBD-biased genes are 1.38× ENRICHED for adaptation (mucin/glycan, two-component signaling, antibiotic resistance, virulence; OR=1.38, p=2.4e-6) and 0.62× DEPLETED for housekeeping (OR=0.62, p=6.4e-20). Strict housekeeping-domination falsifiability bound rejected. F. plautii informative null (0 FDR-passing genes) confirms species-abundance-mediated CD-association (corroborates NB07b within-carrier).
- H3c PARTIALLY SUPPORTED (NB08a). BGC-theme Fisher enrichment: iron_siderophore OR=44.4 (FDR 6e-56), genotoxin_microcin OR=234 (FDR 3e-35), NRPS-PKS-hybrid OR=1.5 (FDR 0.04). E. coli alone among Tier-A core carries the iron+genotoxin BGC signature (54 iron MIBiG: 19 Yersiniabactin + 16 Enterobactin + 19 siderophore-class; 25 genotoxin: 8 Colibactin + 15 Microcin B17). 5/6 Tier-A core have CD-up CB-ORF rates above background. ebf/ecf RPKM CD-up across 4 cohorts at p<1e-31 (replicates Elmassry 2025).
- H3d-DA SUPPORTED (NB09a). 52 of 579 named HMP2 metabolites pass DA at subject level (50 CD vs 26 nonIBD). Two themes Fisher-significant: polyamines OR=14.6 (putrescine, N1-acetylspermine, N-acetylputrescine, anserine, diacetylspermine all CD-up) + long-chain PUFAs OR=7.9 (arachidonate, adrenate, DHA, DPA, EPA all CD-up). Tauro-α/β-muricholate + free taurine CD-up corroborates F. plautii 7α-dehydroxylation deficit at metabolite level.
- Bile-acid 7α-dehydroxylation network identified at sample level (NB09c §13). 468 paired CSM HMP2 samples × 8 species × 583 metabolites Spearman ρ. F. plautii / E. lenta / E. bolteae* show predicted substrate-product signature (negative with primary tauro-BAs, positive with secondary BAs); M. gnavus / E. coli show opposite pattern. E. coli dominates cohort-level correlations (cadaverine +0.45, choline +0.25, tryptophan +0.25).
- H3e PARTIAL at strict threshold (NB11). 48 (assay × species) tests across 67 HMP2 subjects with site-adjusted partial Pearson r. Top pair ANCA × M. gnavus +0.31 (FDR 0.40); no pair clears |r|>0.40 + FDR<0.10 plan threshold. Cohort sanity check passes on canonical IBD-serology patterns (ANCA-UC, ASCA-CD, CBir1-CD all match canonical direction). Site-stratified single-site r reaches +0.46 (Harvard) but cohort-aggregation pulls toward null. Single-cohort caveat structural per plan v1.7. Serology usable as CD-vs-UC stratifier but not as per-target abundance predictor.
- Cross-cohort metabolomics replication on FRANZOSA_2019 (NB09b). m/z bridge (±0.005 Da) maps 122 of 592 HMP2 named metabolites to Franzosa peaks; 118 testable. 3 themes replicate at ≥75 % sign-concordance: urobilin/porphyrin (100 %, n=3, median Franzosa cliff −0.747 — strongest replicated effect), acyl-carnitines (80 %, n=5, +0.531), long-chain PUFAs (75 %, n=12, +0.273). Bile-acid primary 57 %, secondary 40 % — cohort-aggregate BA DA is weaker than NB09c §13 paired sample-level evidence. Polyamines unable to bridge by m/z (low-mass HILIC-pos gap); HMP2 polyamine OR=14.6 stands as single-cohort finding. 9 strict cross-cohort replications.
- H3d-clust NOT SUPPORTED (NB09d). Pooled HMP2 + Franzosa metabolite-feature K=4 K-means on m/z-bridge panel (326 subjects × 111 features); cluster structure is COHORT-driven, not diagnosis-driven (PC1 explains 79 % of variance and separates HMP2 from Franzosa cleanly). Cross-cohort LOSO ARI = 0.000 (≪ 0.113 taxonomic baseline NB04f). Within-pooled bootstrap ARI = 0.937 is misleading — measures cohort-batch reproducibility, not biology. Methodological lesson: m/z-bridge metabolomics requires explicit batch correction (ComBat/SVA/RUV) for cross-cohort clustering; taxonomic-feature ecotype framework (NB01b consensus K=4) remains the project's primary ecotype basis for Pillar 4–5 cocktail design.
- Multi-omics joint factor pilot SUCCESSFUL (NB07d). 2-modality CCA on HMP2 paired CSM subjects (taxonomy + metabolomics; pathway not in mart per v1.9) yields 4 canonical pairs at r=0.89-0.96. CC1 (canon r=0.96, cliff CD-vs-nonIBD = +0.50, p=4e-4) is a single joint factor that recapitulates ALL Pillar 3 mechanism narratives: ALL 6 actionable Tier-A core species load CD-positive; metabolite loadings include urobilin CD-DOWN (NB09b), polyamines + PUFAs + cadaverine × E. coli (NB09a/c), linoleoyl/palmitoyl ethanolamides (NB08a ebf/ecf), BA secondary depletion (NB09c). NB06 H2d pathobiont module structure independently rediscovered as a single principal direction in joint species-metabolite space*. The unified Pillar 3 CD-vs-nonIBD axis collapses into a single canonical factor at r=0.96.
Pillar 3 closure synthesis — two cross-corroborated mechanism narratives
After eight Pillar 3 notebooks, the project has produced two independent six-line cross-corroboration narratives (Novel Contribution #17), each demonstrating the same biological claim across multiple analytical granularities:
Iron-acquisition narrative — E. coli (AIEC subset) drives CD pathobiont iron specialization:
1. NB05 §5g — E. coli MIBiG matches Yersiniabactin + Enterobactin + Colibactin (per-actionable lookup)
2. NB07a §c — heme biosynthesis ↔ E. coli attribution at ρ=0.640 (sample-level pathway × species)
3. NB07 v1.8 §9 — iron/heme MetaCyc-class theme dominant CD-up (OR=8.1, FDR 7e-6; cohort-level pathway-class)
4. NB07c §2 — iron-pathway co-variation concentrates on E. coli (mean ρ=0.45 vs 0.13 for A. caccae; sample-level species × pathway)
5. NB08a §2 — iron-siderophore BGCs 44× enriched in Tier-A core, driven entirely by E. coli's 54 iron MIBiG BGCs (genomic content)
6. AIEC literature — Dalmasso 2021 (yersiniabactin), Prudent 2021 (LF82 phagolysosomal survival), Dogan 2014 (AIEC iron-pathway enrichment), Veziant 2016 (colibactin-CRC), Dubinsky 2022 (IBD E. coli genomic adaptations)
Bile-acid 7α-dehydroxylation narrative — F. plautii / E. lenta / E. bolteae are active 7α-dehydroxylating species in CD pathobiont module:
1. NB05 — F. plautii + E. lenta + E. bolteae in actionable Tier-A
2. NB07b §4 — F. plautii CD-up biosynthesis pathways include F420-bile-acid metabolism (within-carrier)
3. NB09a §12 — tauro-α/β-muricholate + free taurine CD-up at subject level (substrate-pool elevation)
4. NB09c §13 — F. plautii × cholate ρ=−0.26, × lithocholate ρ=+0.15; E. bolteae × deoxycholate ρ=+0.17, × lithocholate ρ=+0.18; E. lenta × ketodeoxycholate ρ=−0.14 (paired sample-level direct substrate-product signature)
5. NB10a §14 — F. plautii informative null (0 FDR-passing strain-adaptation genes) → species-abundance-mediated mechanism (no within-species strain escape route via gene-content selection)
6. Literature — Franzosa 2019 (canonical IBD bile-acid 7α-dehydroxylation deficit), Devlin & Fischbach 2015 (bai operon mechanism in Clostridium scindens-like bacteria)
These two narratives converge on complementary cocktail-design implications that Pillar 4–5 will operationalize. Each actionable Tier-A core pathobiont is profiled across four axes:
| Pathobiont | NB05 Tier-A score | Iron specialization | Bile-acid 7α-dehydroxylation | Mechanism mediation | Cocktail-design priority |
|---|---|---|---|---|---|
| H. hathewayi | 4.0 | none | NOT in network | species-abundance + within-carrier metabolic shift (NB07b biosynthesis-up + sugar-down) | Tier-1 phage target: highest score, low BA cost, broad targeting OK |
| M. gnavus | 3.8 | none | NOT in network | species-abundance (NB10a no Kumbhari signal; mucin-glucorhamnan producer per Henke 2019) | Tier-1 phage target: low BA cost, mucin-degradation mechanism well-characterized |
| E. coli | 3.6 | dominant (Yersiniabactin/Enterobactin/Colibactin) | NOT in network | strain-content-mediated (NB07b within-carrier CD-DOWN + NB08a 54 iron MIBiG; AIEC subset specific) | Tier-1 phage target with strain-resolution requirement: target AIEC subset specifically (pks/Yersiniabactin/Enterobactin-positive); low BA cost |
| F. plautii | 3.3 | weak | active 7α-dehydroxylator (NB09c §13: ρ × cholate=−0.26, × lithocholate=+0.15) | species-abundance-mediated (NB10a F. plautii informative null) | Tier-2 phage target with HIGHEST BA-coupling cost: depletion shifts BA pool toward inflammatory primary tauro-conjugated forms; consider co-administering UDCA or BA-binding agent |
| E. bolteae | 2.8 | none | active 7α-dehydroxylator (NB09c × deoxycholate=+0.17, × lithocholate=+0.18) | mixed | Tier-2 phage target with MODERATE BA-coupling cost |
| E. lenta | 3.3 | none | partial 7α-dehydroxylator (NB09c × ketodeoxycholate=−0.14) | drug-metabolism-mediated (Koppel 2018 Cgr2; NB08a CB-ORF CD-DOWN 54%) | Tier-2 phage target with MODERATE BA-coupling cost; primary CD-association is non-BGC drug-metabolism mechanism, may not require strain-resolution |
Pillar 4–5 hand-off framework: Phage cocktail design proceeds in 4 steps:
1. Patient ecotype assignment (NB02-style projection on stool MetaPhlAn3) → E0 / E1 / E3 (E2 absent in active CD).
2. Per-ecotype Tier-A target list (NB04e + NB06): E1_CD module 0 has H. hathewayi, F. plautii, E. bolteae, E. lenta, M. gnavus; E3_CD module 1 has E. lenta, H. hathewayi, E. coli, M. gnavus. F. plautii is E1-specific; E. coli is E3-specific.
3. Per-target phage availability (Pillar 4 NB12+, PhageFoundry + external phage DBs): match Tier-1 / Tier-2 candidates to available phages. E. coli AIEC has direct phage-therapy precedent (Galtier 2017).
4. Per-patient cocktail draft (Pillar 5 NB15+): UC Davis 23 patients × ecotype × Tier-A profile + bile-acid coupling cost annotations + medication harmonization. Patient 6967 longitudinal E1 ↔ E3 shift is the central per-patient stability test. The H3 framework + CC1 integrative pilot give Pillar 5 a mechanistically grounded multi-attribute decision framework, not a single-axis Tier-A score.
Pillar 4 deliverables (this report)
Three Pillar 4 notebooks (NB12 + NB13 + NB14) produce a complementary 3-layer phage-evidence stack — curated literature foundation + experimental susceptibility + in-vivo phageome — that converges on a concrete cocktail-design framework for the 6 actionable Tier-A core pathobionts.
-
NB12 — Curated literature foundation (12 organisms × phage-availability scoring 0-3): per-pathobiont qualitative phage profile from
ref_phage_biology(literature-curated synthesis from indicator_taxa_literature_review). Stratifies actionable Tier-A into 4 classes: clinical-trial-stage (1 species: E. coli EcoActive), lytic-literature (2: E. lenta PMBT5, E. bolteae PMBT24), temperate-only (1: M. gnavus 6 temperate phages), GAP (2: H. hathewayi, F. plautii). Coupled with Pillar-3 per-target mechanism profile (iron / BA-coupling-cost / mediation) → 4-attribute decision framework for Pillar 5. -
NB13 — PhageFoundry quantitative E. coli phage-cocktail design (BERDL
phagefoundry_strain_modelling: 96 phages × 188 E. coli strains × 17,672 experimentally-tested susceptibility pairs from Gaborieau 2025-10-02). 22 % susceptibility rate. Greedy minimum-set-cover yields 5-phage cocktail covering 94.7 % of 188 strains (DIJ07_P2 + LF73_P1 + AL505_Ev3 + 55989_P2 + LF110_P2 — comparable in size to the EcoActive 7-phage clinical-trial cocktail). 65 / 94 phages (69 %) isolated against B2/D phylogroup hosts (predominantly AIEC per Dogan 2014/Dubinsky 2022); top phages include those isolated against canonical AIEC reference strains LF82, LF73, 536. 26 strains (14 %) phage-resistant at ≤5 % susceptibility — escape candidates. -
NB14 — HMP2 endogenous phageome × ecotype × diagnosis (3,039 viromics rows × 630 samples with NB04h ecotype calls = 97 % overlap). Gokushovirus WZ-2015a CD-DOWN cross-ecotype (Microviridae depletion in CD; E1 cliff=-0.358 FDR=5e-7 in n_CD=231 vs n_HC=125; consistent with Norman 2015 / Clooney 2019). E. coli × Podoviridae +0.18, × Myoviridae +0.13 Spearman correlations confirm endogenous E. coli phage carriage at the in-vivo level (T7-like + T4-like Caudovirales families). All in-vivo correlations |ρ|≤0.18 — modest signal relative to species-level CD axis. H. hathewayi + M. gnavus show negative correlation with the dominant "Unknown" phage family (VirMAP family-classification gap = 80 % of observations).
Pillar 4 closure synthesis — 3-layer phage-evidence convergence
| Phage-availability axis | NB12 curated | NB13 quantitative | NB14 in-vivo | Combined Tier |
|---|---|---|---|---|
| E. coli AIEC | clinical-trial (EcoActive) | 5-phage cocktail at 95 % strain coverage; 65/94 AIEC-relevant phylogroup | +0.18 Podoviridae, +0.13 Myoviridae endogenous correlation | Tier-1: phage-feasible at high confidence, all 3 layers aligned |
| E. lenta | PMBT5 siphovirus literature | not in PhageFoundry collection | no detectable in-vivo correlation | Tier-2: phage-feasible with monitoring |
| E. bolteae | PMBT24 (Kielviridae, virulent) | not in PhageFoundry collection | +0.12 Myoviridae correlation | Tier-2: phage-feasible with monitoring |
| M. gnavus | 6 temperate siphoviruses | not in PhageFoundry collection | weak in-vivo signal | Limited: temperate-only — alternatives needed |
| H. hathewayi | NO known phages | not in PhageFoundry collection | no detectable in-vivo phage correlation | GAP: external-DB query priority |
| F. plautii | not in literature curation | not in PhageFoundry collection | no detectable in-vivo phage correlation | GAP + highest BA-coupling cost — alternatives preferred |
The 3 layers converge: where one layer shows phage availability (NB12 EcoActive, NB13 PhageFoundry susceptibility, NB14 endogenous Podoviridae/Myoviridae correlation), all three layers tend to show it (E. coli). Where one layer shows GAP (NB12 H. hathewayi/F. plautii), the others confirm GAP (NB13 not-in-collection; NB14 no in-vivo correlation). This 3-layer convergence is the Pillar-4 rigor signal — analogous to Novel Contribution #14 (5-line iron-acquisition narrative) and #17 (2 cross-corroborated narratives) — applied to phage-therapy feasibility: the same per-target classification emerges from three independent evidence types.
Pillar 4 → Pillar 5 hand-off — concrete cocktail-design framework
The 6 actionable Tier-A stratify into 3 Pillar-5 design strategies:
- Direct phage targeting (Tier-1) — E. coli AIEC with EcoActive precedent or our 5-phage NB13-derived cocktail (DIJ07_P2 + LF73_P1 + AL505_Ev3 + 55989_P2 + LF110_P2). Strain-resolution AIEC diagnostic required (pks-island + Yersiniabactin + Enterobactin gene-presence per NB07b + NB08a).
- Phage targeting with monitoring (Tier-2) — E. lenta (PMBT5), E. bolteae (PMBT24), with bile-acid pool monitoring per NB09c §13. M. gnavus possible if lytic-locked phage engineering succeeds.
- Phage GAP — alternative strategies needed:
- H. hathewayi: highest priority for INPHARED + IMG/VR external-DB query; if no phages found, fall back to GAG-degrading enzyme inhibitors (per
ref_phage_biologytherapeutic_targets). - F. plautii: lowest Pillar-5 priority due to highest BA-coupling cost — consider deprioritizing in favor of co-administering UDCA / BA-binding agent.
- M. gnavus: biochemical glucorhamnan-synthesis inhibitors (Henke 2019) if lytic-locked engineering fails.
Per-ecotype hand-off to Pillar 5:
- E0 (27 % of UC Davis): minimal pathobiont burden — possibly does not need cocktail
- E1 (42 % of UC Davis): dominant CD ecotype; full Tier-1 cocktail (E. coli AIEC + Tier-2 E. bolteae/E. lenta + bile-acid monitoring) appropriate
- E3 (31 % of UC Davis): severe Bacteroides-expanded — E. coli and M. gnavus prominent; cocktail + ecological-cost annotation more critical due to higher pathobiont burden
- Patient 6967 (longitudinal E1 ↔ E3 drift): per-patient cocktail dosing may need to be ecotype-state-aware; first central per-patient stability test for Pillar 5
Pillar 5 deliverables (this report)
Pillar 5 is delivered through two complementary notebooks: NB15 produces concrete per-patient cocktail drafts for all 23 UC Davis CD patients (the cohort-wide cross-section), and NB16 validates per-patient stability over time + establishes the state-dependent dosing rule (the longitudinal axis).
-
NB15 — Per-patient profile + cocktail draft (Pillar 5 opener). 23 UC Davis CD patients × full profile (NB02 ecotype + Montreal + calprotectin + medication + Kuehl_WGS Tier-A presence). Per-target prescribing rate: M. gnavus 91 %, H. hathewayi 83 %, E. bolteae 83 %, F. plautii 78 %, E. lenta 70 %, E. coli only 35 %. Per-ecotype cocktail summary: E0 (n=7) mean 1.71 targets / 0 with concrete phage; E1 (n=9) mean 4.89 targets / 9 with concrete cocktails; E3 (n=6) mean 2.33 targets / 4 with concrete phages; mixed (6967) 3 targets. 14 of 23 patients (61 %) have concrete phage cocktail drafts. Patient stratification into 4 cocktail-design categories (Active+many-targets / Active+few-targets / Quiescent / Mixed-ecotype). F. plautii BA-coupling cost is the dominant E1 design constraint (present in 78 % of patients, ALL 9 E1; deprioritize + co-administer UDCA/BA-binding agent). E1 hybrid 3-strategy cocktail framework: pure phage cocktail not feasible — combines direct phage (E. coli + E. bolteae + E. lenta) + alternative (H. hathewayi enzyme inhibitors + F. plautii BA-binding) + limited (M. gnavus engineering / biochemical).
-
NB16 — Patient 6967 longitudinal stability + state-dependent dosing (Pillar 5 closure). Patient 6967 is the only multi-timepoint UC Davis CD patient with biological-replicate samples; patient 1112 has 2 reseq replicates of the same biological sample (Kaiju reliability validation). Patient 6967 shows clear E1→E3 ecotype drift across 2 visits with M. gnavus 14× expansion (0.53 → 7.45 reads) as the dominant signature; E. lenta 3.1×, F. plautii 1.9×, E. bolteae 2.1×, H. hathewayi 1.3× also expand reflecting general dysbiosis worsening, but the M. gnavus fold-change dominates. Per-visit cocktail composition: visit-1 E1 = 5 priority targets (all 5 present); visit-2 E3 = 3 priority targets present (E. coli absent — patient 6967 is not an AIEC carrier). Cocktail Jaccard (visit 1 × visit 2) = 0.60 — moderate overlap; ecotype drift implies non-trivial cocktail re-design but not a complete rewrite. Patient 1112 technical replicate Spearman ρ = 1.000 (p < 0.001) on 6 Tier-A — perfect rank concordance across reseq replicates, validating Kaiju reliability for the longitudinal contrast. 5 state-dependent dosing rules (Novel Contribution #24): (a) re-test ecotype every 3-6 months for active CD on phage cocktail therapy; (b) F. plautii inclusion is E1-specific; (c) E. coli inclusion is E3-specific (subject to AIEC strain detection); (d) universal Tier-1 trio (M. gnavus + H. hathewayi + E. lenta) spans both ecotypes — cocktail backbone for any active-disease CD patient; (e) M. gnavus qPCR as cheap clinical proxy for ecotype-state monitoring — 5-fold change triggers full ecotype re-test, avoiding routine metagenomics.
Pillar 5 closure synthesis — hybrid cocktail framework + state-dependent dosing rule
Pillar 5 establishes 4 per-patient design principles that constitute the project's clinical-translation deliverable.
Principle 1 — Pure phage cocktail is structurally infeasible for the dominant E1 ecotype. All 9 E1 patients carry the full 5-species Tier-A pathobiont module, but only 3 of the 5 species (E. coli, E. bolteae, E. lenta — when present) have direct lytic phage options. The remaining 2 (H. hathewayi, F. plautii) are Pillar-4 GAP species. Cocktail must be hybrid: phages + non-phage alternatives. (NB15)
Principle 2 — F. plautii is a "leave alone" target despite NB05 score 3.3. F. plautii is present in 78 % of UC Davis patients AND has the HIGHEST bile-acid coupling cost (NB09c §13 active 7α-dehydroxylator) AND is in Pillar-4 phage GAP (not in ref_phage_biology). Triple penalty makes F. plautii a deprioritization target — phage-cocktail design should avoid F. plautii and instead use BA-binding co-therapy (UDCA / cholestyramine) to maintain the secondary bile-acid pool that F. plautii's depletion would otherwise erode. (NB15 + NB09c)
Principle 3 — E. coli AIEC is over-emphasized in the literature relative to UC Davis cohort prevalence. Only 8 of 23 UC Davis patients carry detectable E. coli (35 %), versus the canonical CD literature framing where AIEC is the prototypical CD pathobiont (Darfeuille-Michaud 2004). The NB13 5-phage AIEC cocktail (DIJ07_P2 + LF73_P1 + AL505_Ev3 + 55989_P2 + LF110_P2) is directly applicable to ~35 % of patients, while H. hathewayi (universal 83 %) and M. gnavus (universal 91 %) — both Pillar-4 GAP — represent the unmet need. External DB queries (INPHARED + IMG/VR) for H. hathewayi / M. gnavus lytic phages are a higher Pillar-5-clinical-translation priority than further refining the E. coli cocktail. (NB15)
Principle 4 — Ecotype is dynamic, not static; cocktail design must be state-dependent. Patient 6967 demonstrates E1→E3 drift between visits with M. gnavus 14× expansion as the dominant signature; cocktail Jaccard = 0.60 between visits. The clinical-translation implication is a 3-tier cocktail logic: (a) the universal Tier-1 trio (M. gnavus + H. hathewayi + E. lenta) is the backbone — both ecotypes carry these, so they don't need re-evaluation on ecotype shift; (b) F. plautii inclusion is E1-specific (drop on E1→E3 transition); (c) E. coli inclusion is E3-specific (drop on E3→E1 transition, subject to AIEC strain detection). M. gnavus qPCR is a candidate cheap clinical proxy for ecotype-state monitoring — a 5-fold change in M. gnavus abundance triggers full ecotype re-test, avoiding full metagenomics at every visit. (NB16, Novel Contribution #24)
| Patient class | n | Recommended cocktail strategy |
|---|---|---|
| E1 + active calp + E. coli present | ~3 | Full Tier-1 cocktail: NB13 5-phage E. coli + PMBT24 + PMBT5 + alternatives for F. plautii / H. hathewayi / M. gnavus |
| E1 + active calp + no E. coli | ~5 | Tier-2 hybrid cocktail: PMBT24 + PMBT5 + alternatives + co-administering UDCA/BA-binding for F. plautii |
| E1 + quiescent | 1-2 | Reserve cocktail for flares; calp + ecotype-state monitoring |
| E0 + any | 7 | Limited cocktail (priority targets are GAP/temperate); consider non-cocktail strategies |
| E3 + E. coli present | ~3 | NB13 5-phage E. coli + PMBT5 + alternatives |
| E3 + no E. coli | ~3 | Limited (E. lenta + H. hathewayi only) |
| Mixed (6967) | 1 | State-dependent dosing — per-visit cocktail rebalancing per Principle 4 |
Clinical-translation workflow (NB16 §5)
Initial visit:
└─ Stool metagenomics → ecotype assignment
├─ E0: limited cocktail, flare reserve
├─ E1: full hybrid 3-strategy cocktail (NB13 5-phage E. coli if AIEC+;
│ PMBT24; PMBT5; alternatives for H. hathewayi + F. plautii)
└─ E3: focused cocktail (E. coli if present; PMBT5; alternatives)
Follow-up visits (3-6 month):
├─ Calprotectin: assess disease activity
├─ M. gnavus qPCR: cheap ecotype-state indicator
│ └─ if 5-fold change → trigger full ecotype re-test
└─ If full re-test shows ecotype shift:
├─ E1 → E3: drop F. plautii; consider adding E. coli cocktail
├─ E3 → E1: add F. plautii alternative; reassess E. coli targeting
└─ Stable ecotype: continue current cocktail
Pillar 5 closure capstone (NB17) — cross-cutting synthesis
NB17 consolidates Pillar 1-5 into three artifacts that constitute the project's clinical-translation deliverable.
- NB17 — Per-patient master table + target decision matrix + clinical-translation roadmap. Three deliverables: (a) per-patient master table for all 23 UC Davis CD patients combining NB02 ecotype + Montreal + calp + medication + Tier-A presence + design category + cocktail strategy + longitudinal status; (b) target decision matrix for 6 actionable Tier-A × 5 attributes (NB05 score, ecotype membership, BA-coupling cost, mediation mode, phage tier) → final priority class; (c) 4-phase clinical-translation roadmap (Immediate / Near / Mid / Long term) with concrete milestones.
Per-patient cocktail-strategy distribution (final per-patient breakdown):
| Cocktail strategy | n patients | Components |
|---|---|---|
| Reserve for flare (C category) | 12 | No active cocktail; calp monitor + M. gnavus qPCR |
| E1 hybrid 3-strategy (no E. coli) | 4 | PMBT24 + PMBT5 + alternatives for H. hathewayi + F. plautii + M. gnavus + UDCA/BA-binding |
| E0 limited (priority targets are GAP) | 4 | Limited cocktail; consider non-phage strategies |
| E1 hybrid 3-strategy (full, with E. coli AIEC) | 1 | NB13 5-phage E. coli + PMBT24 + PMBT5 + alternatives + UDCA |
| E3 focused (with E. coli) | 1 | NB13 5-phage E. coli + PMBT5 + alternatives |
| State-dependent dosing | 1 | Patient 6967: cocktail rebalances on E1↔E3 ecotype shifts |
Concrete phage cocktail drafts: 14 of 23 patients (61 %). The 12 quiescent patients (calp < 250 or unmeasured) get the same Tier-A panel but as a "reserve for flare" rather than active cocktail.

Target decision matrix — final priority class per actionable Tier-A:
| Species | NB05 | Ecotype | BA cost | Phage tier | UCD prev. | Final priority class |
|---|---|---|---|---|---|---|
| H. hathewayi | 4.0 | E1+E3 | none | GAP | 83 % | Tier-1 phage GAP — INPHARED/IMG-VR query priority |
| M. gnavus | 3.8 | E1+E3 | none | temperate-only | 91 % | Tier-1 limited — lytic-locked engineering OR biochemical glucorhamnan target |
| E. coli | 3.6 | E3-specific | none | clinical-trial | 35 % | Tier-1 phage with strain-resolution — NB13 5-phage cocktail; AIEC diagnostic required |
| E. lenta | 3.3 | universal Tier-1 | moderate | lytic (PMBT5) | 70 % | Tier-2 phage — PMBT5 with BA monitoring |
| F. plautii | 3.3 | E1-specific | HIGHEST | GAP | 78 % | Tier-2 deprioritize — triple penalty; UDCA/BA-binding co-therapy |
| E. bolteae | 2.8 | E1+E3 | moderate | lytic (PMBT24) | 83 % | Tier-2 phage — PMBT24 with BA monitoring |
The matrix exposes the structural shape of the clinical-translation problem: the 2 highest-NB05-scored species (H. hathewayi, F. plautii) are both Pillar-4 GAP; the species with the most complete phage precedent (E. coli) is the rarest in UC Davis (35 % carriage); F. plautii has triple penalty and should be deprioritized despite NB05-actionable status; M. gnavus is near-universal but temperate-only.
4-phase clinical-translation roadmap (concrete milestones):
- Immediate (current cohort) — already delivered: 23 patient cocktail drafts; F. plautii BA-cost annotation; 4-category stratification; 5 state-dependent dosing rules + clinical workflow
- Near-term (6–12 months) — feasible external-data extensions: INPHARED + IMG/VR external DB queries for 3 GAP species; AIEC strain-resolution diagnostic; M. gnavus qPCR validation
- Mid-term (12–24 months) — additional cohort/assay generation: targeted qPCR ecotype panel; per-patient bile-acid panel; multi-cohort serology meta-analysis; expanded longitudinal sampling beyond patient 6967
- Long-term (24+ months) — clinical pilot territory: hybrid 3-strategy cocktail clinical pilot; lytic-locked phage engineering for M. gnavus; GAG-degrading enzyme inhibitor screening for H. hathewayi
Final thesis (one sentence):
Crohn's disease at the gut-microbiome level is a single principal-direction phenomenon (NB07d CC1 r=0.96) within which 6 actionable Tier-A pathobionts and 2 cross-corroborated mechanism narratives (iron-acquisition + bile-acid 7α-dehydroxylation) define a state-dependent, hybrid-cocktail design framework with concrete per-patient drafts for 14 of 23 UC Davis CD patients.
Output artifacts:
- data/nb17_patient_master_table.tsv — 23 patients × full master attributes + design category + cocktail strategy + longitudinal status
- data/nb17_target_decision_matrix.tsv — 6 actionable × 5 attributes + final priority class
- data/nb17_final_verdict.json — per-pillar verdicts + 24 NC index + clinical-translation roadmap + thesis statement
- figures/NB17_synthesis.png — 3-panel: target decision matrix + per-patient design map + clinical-translation timeline
(Script: run_nb17.py. Pillar 5 closure capstone; integrates NB02 + NB05 + NB06 + NB09c + NB10a + NB12 + NB15 + NB16 evidence into per-patient master table + target decision matrix + clinical-translation roadmap. Per plan v1.9 no-raw-reads.)
What this report does not yet contain
All 5 pillars closed. The project's central science question is answered within plan v1.9 scope. Pillar-4 follow-ups (out of scope but flagged for clinical-translation continuity): external-DB queries (INPHARED ~25K phages, IMG/VR ~3M UViGs) for the 3 gut-anaerobe coverage gaps (H. hathewayi, M. gnavus lytic alternatives, F. plautii). Multi-cohort prospective validation of the state-dependent dosing rule (n=1 patient 6967 trajectory) and per-patient AIEC strain-resolution diagnostic are flagged as required clinical-translation prerequisites in the NB17 roadmap (Near-term tier).
Project Outline (final)
All 5 pillars closed. Only Pillar-4-followup external-DB queries remain (out of project scope, flagged for clinical-translation continuity in NB17 roadmap). Final outline:
- Pillar 1 (closed) — Five notebooks NB00 + NB01 + NB01b + NB02 + NB03. Four reproducible IBD ecotypes from 8,489 cMD MetaPhlAn3 samples; H1a directionally supported; UC Davis χ² p=0.019 (H1b); H1c clinical-covariate classifier passes AUC 0.80 in cross-validation but only 41 % agreement with metagenomics on UC Davis (revised: metagenomics remains required).
- Pillar 2 (closed) — NB04 superseded by NB04b+c+d+e (rigor repair) + NB04f+g+h (LOSO stability + pathway-feature refit + HMP2 external replication) + NB05 (Tier-A A3–A6 scoring) + NB06 (per-ecotype co-occurrence networks + H2d). Six actionable Tier-A: H. hathewayi, M. gnavus, E. coli, E. lenta, F. plautii, E. bolteae. HMP2 88.2 % E1 Tier-A external replication.
- Pillar 3 (fully closed) — All 8 H3 sub-hypotheses tested with 12 notebooks (NB07a/b/v18/c/d + NB08a + NB09a/b/c/d + NB10a + NB11). Verdict score: 5 SUPPORTED + 1 PARTIALLY SUPPORTED + 2 PARTIAL + 1 NOT SUPPORTED. Two cross-corroborated 6-line mechanism narratives + cross-cohort replication on 3 themes + multi-omics joint factor pilot (NB07d CC1 r=0.96).
- Pillar 4 (closed) — Three notebooks NB12 + NB13 + NB14 producing 3-layer phage-evidence stack. Concrete 5-phage E. coli AIEC cocktail at 95 % strain coverage; gut-anaerobe Tier-A coverage gap confirmed across all 3 layers.
- Pillar 5 (closed) — Three notebooks NB15 + NB16 + NB17. 14 of 23 patients (61 %) have concrete cocktails; pure phage cocktail not feasible for E1 (3-strategy hybrid required); F. plautii BA-cost is dominant E1 design constraint; E. coli only in 35 % of cohort. Patient 6967 longitudinal E1→E3 drift validated (M. gnavus 14× expansion); 5 state-dependent dosing rules established (Novel Contribution #24); cross-cutting synthesis delivered as per-patient master table + target decision matrix + 4-phase clinical-translation roadmap.
Future Directions (out-of-scope clinical-translation prerequisites)
The project's central science question is fully answered within plan v1.9 scope (no raw reads). The items below are flagged in the NB17 clinical-translation roadmap as out-of-scope follow-ups required to bridge from "computational deliverables" to "clinical pilot". They are organized by tier from the NB17 roadmap.
Near-term (6–12 months) — feasible external-data extensions
- INPHARED + IMG/VR external DB queries for the 3 gut-anaerobe phage-coverage gaps (Pillar-4 follow-up; the highest-priority extension). The 2 highest-NB05-scored species (H. hathewayi 4.0, F. plautii 3.3) are Pillar-4 GAP, plus M. gnavus is temperate-only. INPHARED ~25K phages + IMG/VR ~3M UViGs cover the gut-anaerobe phage-discovery space that the BERDL-internal
ref_phage_biologyliterature curation does not. Closing this gap converts the current "hybrid 3-strategy cocktail with non-phage alternatives" framework into a pure-phage cocktail framework for at least some E1 patients. - AIEC strain-resolution diagnostic for the 8/23 E. coli-positive UC Davis patients. Per NB07b within-carrier signal + NB08a iron+genotoxin BGCs, the E. coli CD-association is strain-content-mediated (AIEC subset). The NB13 5-phage cocktail's clinical applicability requires per-patient strain genotyping — pks-island + Yersiniabactin + Enterobactin gene-presence detection via PCR or amplicon panel.
- M. gnavus qPCR validation as cheap ecotype-state proxy (NB16 hypothesis). The 14× M. gnavus expansion in patient 6967's E3 transition suggests qPCR could substitute for full metagenomics in clinical follow-up. Validation requires a prospective cohort with paired qPCR + metagenomics across timepoints.
Mid-term (12–24 months) — additional cohort/assay generation
- Targeted qPCR ecotype panel — 4–6 species (F. prausnitzii, P. copri, P. vulgatus, B. fragilis, M. gnavus) for rapid clinical ecotype assignment without full metagenomics. H1c demonstrated clinical-covariate-only assignment is not viable.
- Per-patient bile-acid panel for F. plautii BA-cost monitoring. Currently BA-coupling cost is ecotype-level (NB09c §13), not per-patient — clinical translation requires per-patient bile-acid measurements.
- Multi-cohort serology meta-analysis to firm up H3e PARTIAL (NB11). IBD anti-microbial antibody datasets outside HMP2 are not in BERDL; multi-cohort meta would resolve the cohort-aggregate vs site-stratified r=0.46 vs +0.31 gap.
- Expanded longitudinal sampling beyond patient 6967 to validate the state-dependent dosing rule. The rule is derived from n=1 trajectory; multi-patient longitudinal cohorts would test whether ecotype drift is common, whether M. gnavus 5× expansion is the right qPCR threshold, and whether cocktail Jaccard 0.60 generalizes.
- External cohort validation beyond HMP2 — additional cMD sub-studies with CD + nonIBD that populate E3 (currently only HallAB_2017 qualifies); non-Western IBD cohorts to test E2 Tier-A.
Long-term (24+ months) — clinical pilot territory
- Clinical pilot of hybrid 3-strategy cocktails in UC Davis CD cohort, organized by ecotype + per-patient + state-dependent dosing per NB17 roadmap.
- Lytic-locked phage engineering for M. gnavus if natural lytic phages remain unavailable post-INPHARED/IMG-VR query.
- GAG-degrading enzyme inhibitor screening for H. hathewayi (alternative to phage; per
ref_phage_biologytherapeutic_targets).
Methodology follow-ups (publication material)
- Within-substudy × within-ecotype design as publication material (Novel Contribution #8). The NB04e confound-free meta design is the project's most portable methodological contribution; worth writing up as a standalone methods paper for the broader cMD-style multi-cohort microbiome community.
- 3-modality MOFA+ on cMD_IBD pathway-aligned subjects (NB07d follow-up). NB07d ran 2-modality CCA on HMP2 because pathway data is not paired with HMP2 metabolomics. The cMD_IBD pathway slice in mart is paired with cMD_IBD species at sample level; 3-modality MOFA on cMD_IBD subjects would give species + pathway + (cMD_IBD-only) joint factors.
- Batch-corrected metabolite-feature ecotype refit (NB09d follow-up). NB09d found cohort-batch dominates m/z-bridge clustering (Novel Contribution #19). ComBat/SVA/RUV harmonization prior to PCA + K-means could reveal whether after batch correction, metabolite-feature ecotypes are more cross-cohort-portable.
- External polyamine cross-cohort replication (NB09b gap). NB09b m/z bridge fails on polyamines (low-mass HILIC-pos gap); HMP2 polyamine OR=14.6 stands as single-cohort finding. External Franzosa supplementary tables (with compound-name annotations) would close this gap.
Authors
- Adam Arkin (ORCID: 0000-0002-4999-2931), U.C. Berkeley / Lawrence Berkeley National Laboratory
Discoveries
NB09d found that pooling HMP2 + Franzosa metabolomics on the 122 m/z-bridge metabolite panel and clustering with PCA + K-means K=4 produces clusters that separate completely by cohort, not by diagnosis. PC1 explains 79 % of total variance and is essentially the cohort batch effect. Cross-cohort
Read more →Pool ≠ flux — pathway-DA and metabolite-DA can diverge in direction without contradicting each other
April 2026NB07 v1.8 §9 found 06_polyamine_urea was CD-DOWN at pathway-level (OR=0.42); NB09a §12 found polyamines as a metabolite-class are CD-UP at OR=14.6 — the largest theme-level metabolite effect in the project. Both correct: the metabolite pool reflects the difference between production and consumptio
NB09c §13 + NB10a §14 establish that pathobiont-targeting interventions need three axes of per-target annotation alongside the headline Tier-A score:
- Iron/AIEC-mediated vs other CD specialization mechanism (informs whether to target species broadly or AIEC-subset specifically)
- **Bile-acid co
Regex-on-pathway-names vs curator-validated class hierarchy: same data, opposite conclusion
April 2026A NB07a H3a (b) test of "do CD-up pathways concentrate in IBD-mechanism themes" gave opposite verdicts depending on category-schema choice — same DA outputs, same statistical test, same nulls.
- v1.7 (regex on pathway descriptive names): 7 a-priori IBD categories matched 44 of 409 prevalence-fi
Quantified cost of feature leakage + confound non-adjustment in Pillar 2. NB04 reported a 33-species within-ecotype Tier-A list (18 E1, 15 E3) with the H2c C. scindens paradox marked "RESOLVED by stratification." Two rigor-repair notebooks (NB04b + NB04c) applied three evidence filters to ever
Read more →Training-cohort OvR-AUC is a weak proxy for "does this classifier help at the patient level" whenever the cohort includes a strong cohort-axis variable that is constant (or nearly so) on the held-out patients. Concrete observation from ibd_phage_targeting NB03:
- Classifier trained on pooled HC +
When projecting a held-out cohort processed with a different taxonomic classifier (Kaiju NCBI-NR read classification) onto a reference embedding trained on a different classifier (MetaPhlAn3 marker-gene relative abundance), the two ecotype methods (LDA on pseudo-counts, GMM on CLR + PCA) behave very
Read more →Both LDA training perplexity (sklearn LatentDirichletAllocation) and GMM BIC (on CLR + PCA-20) are monotone with K on this data — they always prefer the largest K available. Held-out perplexity (5-fold) gives the same monotone signal at this sample size (~8.5K samples), so it doesn't help distin
K=4 LDA-GMM consensus on 8,489 CMD MetaPhlAn3 samples (5,333 HC + 3,156 IBD/other) yields four ecotypes that align with published gut-microbiome enterotype literature (Vandeputte 2017, Lloyd-Price 2019) and cleanly separate disease from healthy:
| Ecotype | n | Defining species | Diagnosis pattern
Read more →Historical static inventory files lagged the live lakehouse catalog and omitted
several databases that matter for gut-microbiome / phage work. Use the
access-aware BERDL notebook helpers for current discovery:
```python
from berdl_notebook_utils import get_databases, get_tables, get_table_schema
d
Read more →The UC Davis / Arkin CrohnsPhage data mart ships with a lineage.yaml (ETL provenance + changelog + known gaps), a schema_overview.yaml (table inventory by category), per-table *.yaml dictionaries (columns, dtypes, null counts, unique counts, sample values), and a ref_missing_data_codes table
The v2 preliminary IBD report (2026-03-28) called C. scindens CD-enriched at log₂FC = +2.67 from pooled Mann-Whitney DA. But C. scindens is a well-documented protective species (secondary bile-acid producer via 7α-dehydroxylation, TGR5 activator, ~79 % prevalence in healthy individuals, inhibits
Read more →Data Collections
KBase Genomes
kbase_genomes
KBase, DOE
PROTECT Pathogen Browser
protect_genomedepot
PROTECT
Pangenome Collection
kbase_ke_pangenome
KBase, DOE
Kescience Webofmicrobes
kescience_webofmicrobes
KE Science
Fitness Browser
kescience_fitnessbrowser
Price Lab, LBNL
Phenotype Collection
kbase_phenotype
KBase
Kescience Bacdive
kescience_bacdive
KE Science
Kescience Pdb
kescience_pdb
KE Science
Kescience Alphafold
kescience_alphafold
KE Science
Kescience Paperblast
kescience_paperblast
KE Science
Phagefoundry Strain Modelling
phagefoundry_strain_modelling
PhageFoundry
ModelSEED Biochemistry
kbase_msd_biochemistry
ModelSEED / Henry Lab
Atlas Reuse
Derived products, review objects, and tensions connected to this project in the BERIL Atlas.
Ecotype Assignments
Produced by 2 projects
source evidenceEcotype Assignments
Atlas derived product built partly from this project.
partially resolvedEcotype labels versus translational leakage
Ecotype labels are reusable stratification products, but translational target lists can collapse when labels and outcomes share leaked or confounded features.
Review
Summary
This is an exceptionally comprehensive and rigorous research project that successfully delivers on its ambitious 5-pillar framework for rational phage cocktail design in IBD. The project demonstrates remarkable methodological sophistication, with 31 notebooks spanning patient stratification, pathobiont identification, functional driver analysis, phage targetability assessment, and per-patient cocktail design. Particularly noteworthy is the project's self-correcting nature—when methodological issues were identified in NB04, the team conducted a complete retraction and rebuilt the analysis pipeline with 7 replacement notebooks, ultimately strengthening the conclusions. The final deliverable is a concrete clinical-translation framework with per-patient cocktail drafts for 61% of UC Davis patients and state-dependent dosing rules. While the scope is enormous (perhaps beyond what a typical research project would attempt), the execution is exemplary and the scientific contribution is substantial.
Methodology
The research approach is methodologically sophisticated and well-grounded. The 5-pillar structure (patient stratification → pathobiont identification → functional drivers → phage targetability → per-patient design) provides a logical framework that addresses the key challenges in rational phage therapy. The project demonstrates strong awareness of methodological pitfalls, as evidenced by the comprehensive docs/pitfalls.md documentation and the proactive retraction/rebuilding of NB04 when feature leakage and confounding issues were detected.
Strengths:
- Rigorous statistical approach: Uses compositional-aware differential abundance analysis (CLR-based), within-substudy meta-analysis to control for study-level confounding, and external replication on HMP2 (88.2% sign concordance)
- Multi-modal validation: Integrates taxonomy, metabolomics, pathways, BGCs, and serology across multiple evidence streams
- External validation: HMP2 external replication and cross-cohort metabolomics bridge provide independent confirmation
- Clear data provenance: Excellent documentation of data sources, with 10,774 samples across 19 studies and clear lineage tracking
- Systematic taxonomy reconciliation: Addresses GTDB r214+ renames with a comprehensive synonymy layer (2,417 aliases → 1,848 canonical species)
Areas for improvement:
- The scope is extraordinarily broad—31 notebooks may be pushing the limits of what can be reasonably maintained and reproduced
- Some analytical decisions could benefit from more explicit justification (e.g., the choice of K=4 ecotypes, specific thresholds for Tier-A scoring)
Code Quality
The code quality is generally high, with clear evidence of adherence to best practices and lessons learned from previous projects. The notebooks are well-structured with clear markdown documentation, and the project demonstrates awareness of common BERDL pitfalls.
Strengths:
- Comprehensive pitfall documentation: The docs/pitfalls.md file demonstrates excellent awareness of methodological issues and provides concrete solutions
- Robust data handling: Proper treatment of MetaPhlAn3 taxonomy synonymy, compositional data analysis, and cross-cohort normalization
- Clear analytical structure: Each notebook has a well-defined purpose with clear inputs/outputs and dependencies
- Error correction: The complete retraction and rebuilding of NB04 when methodological issues were detected shows appropriate scientific rigor
Technical issues addressed:
- Feature leakage in cluster-stratified differential abundance (properly resolved in NB04b-h)
- Substudy-level confounding in curatedMetagenomicData (addressed with within-substudy meta-analysis)
- Compositional data artifacts (addressed with CLR transformation)
- Cross-cohort batch effects in metabolomics (properly flagged with methodological lessons)
Findings Assessment
The scientific findings are well-supported by the data and represent significant contributions to the field. The conclusions are appropriately qualified with limitations clearly stated.
Major contributions:
- Multi-omics integration: Discovery that CD at the gut-microbiome level is "a single principal-direction phenomenon" (CC1 r=0.96) that unifies species and metabolite signatures
- Clinical framework: Concrete per-patient cocktail design framework with state-dependent dosing rules and clinical workflow
- Mechanistic insights: Two cross-corroborated mechanism narratives (iron-acquisition centered on E. coli AIEC; bile-acid 7α-dehydroxylation centered on F. plautii/E. lenta/E. bolteae)
- Methodological innovations: 24 numbered novel contributions spanning analytical methods and biological insights
Key findings:
- 4 reproducible IBD ecotypes with clear biological interpretation
- 6 actionable Tier-A pathobiont targets with quantitative scoring
- 5-phage E. coli cocktail design with 95% strain coverage
- Per-patient cocktail drafts for 61% of UC Davis patients
- State-dependent dosing framework with M. gnavus qPCR as clinical proxy
Appropriate limitations acknowledged:
- Pure phage cocktail infeasible for E1 ecotype—requires hybrid approach
- External database queries needed for gut anaerobe phage gaps (H. hathewayi, F. plautii)
- Single-cohort limitations for some analyses (serology, longitudinal stability)
- E. coli prevalence lower than expected in UC Davis cohort (35% vs. literature emphasis on AIEC)
Suggestions
-
Streamline for reproducibility: Consider consolidating some of the 31 notebooks into larger analytical units. While the current fine-grained structure aids understanding, it creates a significant maintenance burden.
-
Expand external validation: The HMP2 external replication for E1 Tier-A is excellent (88.2% concordance), but expanding external validation to additional cohorts would strengthen confidence in the ecotype framework.
-
Clinical translation roadmap: The 4-phase roadmap (Immediate/Near/Mid/Long-term) is well-structured. Prioritize the immediate-term INPHARED + IMG/VR database queries for the 3 phage-GAP species, as these would directly impact clinical feasibility.
-
Strain-resolution diagnostics: Develop the AIEC strain-resolution diagnostic mentioned in the roadmap, as this is critical for E. coli targeting (only 35% of patients carry detectable E. coli).
-
Metabolomics batch correction: Implement the batch correction methods flagged in NB09d for future cross-cohort metabolomics clustering attempts.
-
Longitudinal validation: Expand beyond patient 6967 to validate state-dependent dosing rules across multiple patients with longitudinal sampling.
-
Documentation consolidation: Consider creating an executive summary notebook that walks through the key findings across all 5 pillars, as the current scope may overwhelm some readers.
-
Clinical pilot design: Begin detailed planning for the hybrid 3-strategy cocktail clinical pilot identified in the long-term roadmap.
This review was generated by an AI system. It should be treated as advisory input, not a definitive assessment.
Visualizations
Nb00 Cohort Summary
Nb00 Missingness Heatmap
Nb00 Protective Species Paradox
Nb00 Ucdavis Cohort
Nb01 Ecotype By Diagnosis
Nb01 Ecotype Species Heatmap
Nb01 Gmm K Selection
Nb01 Lda K Selection
Nb01B K Selection
Nb01B Ecotype By Diagnosis
Nb01B Ecotype Species Heatmap
Nb02 Ucdavis Ecotype Assignment
Nb02 Ucdavis Ecotype X Clinical
Nb03 Feature Importance
Nb03 H1C Auc
Nb04 H2C Paradox Resolution
Nb04 Top Tier A Per Ecotype
Nb04B Jaccard Null
Nb04F Loso Stability
Nb04G Pathway Vs Taxon Ecotype
Nb04H Hmp2 External Replication
Nb05 Tier A Scored
Nb06 Cooccurrence Networks
Nb07 H3A V18 Class Enrichment
Nb07A H3A Falsifiability
Nb07B Stratified H3A B
Nb07C Anchor Pathobiont Coupling
Nb07D Mofa Pilot
Nb08A Bgc Pathobiont Enrichment
Nb09A Metabolomics Cd Vs Nonibd
Nb09B Cross Cohort Metabolomics
Nb09C Cross Feeding Disambiguation
Nb09D Metabolite Ecotype Stability
Nb10A Kumbhari Strain Adaptation
Nb11 Serology Pathobiont
Nb12 Phage Targetability
Nb13 Phagefoundry Cocktail
Nb14 Endogenous Phageome
Nb15 Patient Cocktail Draft
Nb16 Longitudinal Dosing
Nb17 Synthesis
Notebooks
NB00_data_audit.ipynb
Nb00 Data Audit
View notebook →
NB01_ecotype_training.ipynb
Nb01 Ecotype Training
View notebook →
NB01b_ecotype_refit.ipynb
Nb01B Ecotype Refit
View notebook →
NB02_ecotype_projection.ipynb
Nb02 Ecotype Projection
View notebook →
NB03_clinical_ecotype_classifier.ipynb
Nb03 Clinical Ecotype Classifier
View notebook →
NB04_within_ecotype_DA.ipynb
Nb04 Within Ecotype Da
View notebook →
NB04b_analytical_rigor_repair.ipynb
Nb04B Analytical Rigor Repair
View notebook →
NB04c_rigor_repair_completion.ipynb
Nb04C Rigor Repair Completion
View notebook →
NB04d_stopping_rule.ipynb
Nb04D Stopping Rule
View notebook →
NB04e_option_A_viability.ipynb
Nb04E Option A Viability
View notebook →
NB04f_loso_ecotype_stability.ipynb
Nb04F Loso Ecotype Stability
View notebook →
NB04g_pathway_ecotype_refit.ipynb
Nb04G Pathway Ecotype Refit
View notebook →
NB04h_hmp2_external_replication.ipynb
Nb04H Hmp2 External Replication
View notebook →
NB05_tier_a_scoring.ipynb
Nb05 Tier A Scoring
View notebook →
NB06_cooccurrence_networks.ipynb
Nb06 Cooccurrence Networks
View notebook →
NB07a_pathway_DA_H3a_falsifiability.ipynb
Nb07A Pathway Da H3A Falsifiability
View notebook →
NB07b_stratified_pathway_DA.ipynb
Nb07B Stratified Pathway Da
View notebook →
NB07c_anchor_pathobiont_coupling.ipynb
Nb07C Anchor Pathobiont Coupling
View notebook →
NB07d_mofa_pilot.ipynb
Nb07D Mofa Pilot
View notebook →
NB08a_bgc_pathobiont_enrichment.ipynb
Nb08A Bgc Pathobiont Enrichment
View notebook →
NB09a_metabolomics_cd_vs_nonibd.ipynb
Nb09A Metabolomics Cd Vs Nonibd
View notebook →
NB09b_cross_cohort_metabolomics.ipynb
Nb09B Cross Cohort Metabolomics
View notebook →
NB09c_cross_feeding_disambiguation.ipynb
Nb09C Cross Feeding Disambiguation
View notebook →
NB09d_metabolite_ecotype_stability.ipynb
Nb09D Metabolite Ecotype Stability
View notebook →
NB10a_kumbhari_strain_adaptation.ipynb
Nb10A Kumbhari Strain Adaptation
View notebook →
NB11_serology_pathobiont.ipynb
Nb11 Serology Pathobiont
View notebook →
NB12_phage_targetability.ipynb
Nb12 Phage Targetability
View notebook →
NB13_phagefoundry_cocktail.ipynb
Nb13 Phagefoundry Cocktail
View notebook →
NB14_endogenous_phageome.ipynb
Nb14 Endogenous Phageome
View notebook →
NB15_patient_cocktail_draft.ipynb
Nb15 Patient Cocktail Draft
View notebook →
NB16_longitudinal_dosing.ipynb
Nb16 Longitudinal Dosing
View notebook →
NB17_synthesis.ipynb
Nb17 Synthesis
View notebook →
Data Files
| Filename | Size |
|---|---|
nb04d_stopping_rule_verdict.json |
2.2 KB |
nb04e_option_A_viability.json |
0.8 KB |
nb04f_loso_verdict.json |
1.6 KB |
nb04g_pathway_ecotype_verdict.json |
0.7 KB |
nb04h_hmp2_replication_verdict.json |
0.9 KB |
nb05_tier_a_verdict.json |
1.7 KB |
nb06_verdict.json |
0.8 KB |
nb07_h3a_v18_verdict.json |
0.9 KB |
nb07a_h3a_verdict.json |
1.3 KB |
nb07b_h3a_b_species_verdict.json |
1.5 KB |
nb07c_h3a_new_verdict.json |
1.2 KB |
nb07d_mofa_pilot_verdict.json |
1.7 KB |
nb08a_h3c_verdict.json |
0.9 KB |
nb09a_h3d_da_verdict.json |
0.7 KB |
nb09b_cross_cohort_verdict.json |
2.4 KB |
nb09c_cross_feeding_verdict.json |
0.5 KB |
nb09d_h3d_clust_verdict.json |
0.8 KB |
nb10a_h3b_verdict.json |
0.8 KB |
nb11_h3e_verdict.json |
0.5 KB |
nb12_phage_targetability_verdict.json |
2.1 KB |
nb13_phagefoundry_cocktail_verdict.json |
2.1 KB |
nb14_endogenous_phageome_verdict.json |
0.7 KB |
nb15_pillar5_cocktail_verdict.json |
1.4 KB |
nb16_longitudinal_verdict.json |
1.2 KB |
nb17_final_verdict.json |
8.3 KB |