141
Atlas pages
9
Topics
73
Data pages
14
BERDL tenants
48/48
Collection coverage
41/41
Evidence-backed pages
6/8
Reused products
4
Tracked tensions
12
Opportunities

Start Here

Open overview page
How to read the Atlas: start with a topic for synthesis, data for reusable collections and joins, or claims/hypotheses when you need agent-sized reasoning units.

Generated Maps

Built from Atlas frontmatter and inventory signals

Topic Landscape

Science topics are the first interpretive map: each one connects projects to reusable claims, directions, hypotheses, and data.

Critical Minerals and Metal Biology

Progressive synthesis of metal fitness, tolerance, validation, and critical-mineral research opportunities.

Metal-specific genes remain core-enriched Lanthanide-dependent methylotrophy is widespread and soil-linked Metal type diversity predicts ecological niche breadth Gene targets for critical-mineral bioprocessing 10 sources 5 collections
AMR, Resistance Ecology, and Co-selection

Synthesis of AMR gene distribution, fitness cost, cofitness support networks, environment structure, and metal co-selection opportunities.

AMR mechanism composition is environment-structured Metal-specific genes remain core-enriched Prophage density predicts AMR repertoire breadth Metal type diversity predicts ecological niche breadth 8 sources 4 collections
Pangenome Architecture and Gene-Content Evolution

Cross-project synthesis of pangenome openness, core/accessory structure, functional composition, conservation, and gene-content tradeoffs.

Pangenome openness shapes functional opportunity Open pangenomes carry broader pathway diversity Genomes and Pangenomes Pangenome Openness Metrics 7 sources 3 collections
Fitness-Validated Gene Function

Synthesis of essential genes, metabolic dependency, ICA modules, dark genes, and functional annotation repair through fitness evidence.

Lab fitness can predict field ecology Metal-specific genes remain core-enriched Fitness Phenotypes Genome-Fitness-Pangenome Join 7 sources 4 collections

9 topics integrate 69 project references.

Data Landscape

Data pages separate physical BERDL collections from cross-collection analytical roles and reusable outputs.

Pangenome Collection

Pangenome data for 293,059 genomes across 27,690 microbial species derived from GTDB r214. Includes core/accessory gene classification, functional annotations, and ANI relationships.

KBase 72 sources 10 collections
Fitness Browser

Gene fitness data from transposon mutant experiments across 40+ bacterial organisms. Identify essential genes and condition-specific fitness effects.

KE Science 45 sources 4 collections
ENIGMA CORAL

Subsurface microbial ecology and geochemistry data from the ENIGMA SFA project at the Oak Ridge Reservation (ORR), Tennessee. Covers environmental sampling (groundwater from boreholes), 16S amplicon community profiling, isolated bacterial strains with genomes, and geochemical measurements. Includes the SSO (subsurface observatory) site with spatial and temporal community profiles.

ENIGMA 25 sources 10 collections
Nmdc Metadata

Discovered BERDL database. Add curated description as this collection is used.

NMDC 15 sources 47 collections

48/48 collections have Atlas pages.

Claim to Experiment Flow

The Atlas should let a reader move from synthesis to reusable claims, then into directions and concrete hypotheses.

Claims

Evidence-backed statements reused across topics.

Metal-specific genes remain core-enriched Lab fitness can predict field ecology AMR mechanism composition is environment-structured Ecotype analyses need rigor gates before translation 8 sources
Directions

High-value research opportunities grounded in existing work.

Gene targets for critical-mineral bioprocessing Metal-AMR co-selection at contaminated sites Rare-earth gene discovery via cross-metal inference Fitness-validated community design 5 sources
Hypotheses

Testable units that can become projects or experiments.

Bakta reannotation resolves novel metal families Structures support metal-binding or transport roles Metal contamination co-selects AMR mechanisms Metal tolerance scores predict field isolation context 8 sources
Opportunities

Concrete next analyses prioritized from Atlas evidence and reusable products.

Rare-Earth RB-TnSeq Design Metal-AMR Site Co-Selection Analysis Ecotype Label Validation Benchmark Lab-to-Field Fitness Transfer Audit 12 sources

8 claims, 5 directions, 8 hypotheses, and 12 opportunities are currently indexed.

Derived Product Reuse

Reusable scores, labels, mappings, and joins are the outputs most likely to compound across projects.

Metal Tolerance Scores

Reusable species and gene-level metal tolerance signals derived from metal fitness projects and environmental validation work.

4 sources 3 collections
Ecotype Assignments

Reusable within-species or community ecotype labels that support environmental validation, microbiome stratification, and downstream hypothesis tests.

4 sources 3 collections
Environment Harmonization Labels

Reusable environment category and coordinate-quality labels that make cross-collection ecology joins safer.

3 sources 4 collections
Dark Gene Prioritization Tables

Reusable ranked dark-gene candidates, covering sets, and experiment plans derived from fitness, pangenome, annotation, and ecology evidence.

3 sources 4 collections

8 derived products are currently tracked from 74 data pages.

Tension and Resolution

Apparent conflicts are preserved as reviewable objects with evidence on multiple sides and resolving work.

Lab fitness signals versus field ecology

Lab-derived fitness or tolerance scores can predict ecology in some settings, but metadata quality and field complexity limit broad generalization.

4 sources 3 collections
Ecotype labels versus translational leakage

Ecotype labels are reusable stratification products, but translational target lists can collapse when labels and outcomes share leaked or confounded features.

4 sources 3 collections
Metal specificity versus general stress

Metal fitness hits include both specific metal biology and broad stress response, so engineering targets need specificity filters and counter-ion controls.

5 sources 2 collections
Metal-AMR co-selection readiness

BERIL has the pieces to test metal-AMR co-selection, but the current evidence is a strong opportunity rather than a resolved result.

10 sources 4 collections

4 tension pages track where synthesis needs reconciliation.

Metrics To Watch

Open metrics method
48/48

Collection Coverage

Canonical BERDL databases with data_collection Atlas pages.

47

Cross-Collection Reuse

Projects that combine two or more BERDL collections.

26

Under-Explored Collections

Collections with no parsed project references yet.

37

Dark-Matter Metadata

Collections needing stronger curation or complete schema discovery.

42

Caveat Load

Low-confidence Atlas pages that need review before heavy reuse.

41/41

Evidence Coverage

Claims, directions, hypotheses, derived products, and opportunities with evidence metadata.

6/8

Derived Product Reuse

Promoted derived products with at least one declared downstream project.

4

Unresolved Tensions

Conflict pages still needing resolving analysis or experiments.

12

Opportunity Coverage

Concrete next analyses connected to Atlas evidence, products, and tensions.

0

Blocked Opportunities

Opportunity pages that cannot proceed until prerequisite data or review is available.

9/9

Topic Drill-Down Depth

Topic pages with enough sections and related links to support progressive disclosure.

topic medium confidence

Critical Minerals and Metal Biology

Progressive synthesis of metal fitness, tolerance, validation, and critical-mineral research opportunities.

10 projects 5 collections
topic medium confidence

AMR, Resistance Ecology, and Co-selection

Synthesis of AMR gene distribution, fitness cost, cofitness support networks, environment structure, and metal co-selection opportunities.

8 projects 4 collections
topic medium confidence

Pangenome Architecture and Gene-Content Evolution

Cross-project synthesis of pangenome openness, core/accessory structure, functional composition, conservation, and gene-content tradeoffs.

7 projects 3 collections
topic medium confidence

Fitness-Validated Gene Function

Synthesis of essential genes, metabolic dependency, ICA modules, dark genes, and functional annotation repair through fitness evidence.

7 projects 4 collections
topic medium confidence

Microbial Ecotypes, Environment, and Field Validation

Synthesis of species-level ecotypes, environmental embeddings, lab-field validation, ENIGMA ecology, and metadata limitations.

12 projects 8 collections
topic medium confidence

Host Microbiome Translation

Synthesis of IBD phage targeting, formulation design, metabolomics caveats, patient stratification, and intervention cost accounting.

5 projects 4 collections
topic medium confidence

Plant Microbiome Function and Agriculture

Synthesis of plant-associated microbial function, beneficial/pathogenic duality, compartment structure, PGP markers, and pangenome ecology.

5 projects 3 collections
topic low confidence

Mobile Elements, Phage, and Genome Plasticity

Synthesis of phage ecology, prophage signals, defense systems, mobile-element gene flow, and intervention relevance.

8 projects 4 collections
topic medium confidence

Metabolic Capability, Dependency, and Community Design

Synthesis of GapMind capability, fitness dependency, metabolic models, community ecology, and design-ready derived data.

7 projects 6 collections

Collections

48 pages

Data Types

10 pages

Joins

1 pages

Gaps

2 pages
candidate benchmark

Ecotype Label Validation Benchmark

Build a benchmark that tests whether ecotype labels survive stricter metadata, batch, and holdout validation.

impact high readiness high 1 tensions
candidate analysis

Metal-AMR Site Co-Selection Analysis

Test whether metal contamination at BER-relevant sites co-selects antibiotic resistance mechanisms using metal fitness, AMR profiles, and environmental metadata.

impact high readiness high 1 tensions
candidate analysis

Dark Gene Structure Prioritization

Prioritize dark gene families for mechanistic review by joining fitness, cofitness, annotation novelty, and AlphaFold structure signals.

impact high readiness medium 1 tensions
candidate validation

Lab-to-Field Fitness Transfer Audit

Audit where laboratory fitness effects predict field ecology and where geochemistry, taxonomy, or metadata completeness blocks transfer.

impact high readiness medium 1 tensions
candidate experiment

Rare-Earth RB-TnSeq Design

Design the first rare-earth fitness experiment by ranking candidate genes from cross-metal specificity, conservation, annotation, and structure evidence.

impact high readiness medium 1 tensions
candidate analysis

CF Formulation Score Reuse Test

Find a first downstream consumer for CF formulation scores by testing whether ranked carbon contexts improve strain or community design decisions.

impact medium readiness high 0 tensions
candidate productization

Derived Product Readiness Burn-Down

Review candidate and promoted derived products to close missing consumers, artifacts, caveats, and review routes before they become default inputs.

impact medium readiness high 0 tensions
candidate analysis

Functional Innovation KO Atlas Reuse Test

Test whether the Functional Innovation KO Atlas helps explain pangenome, pathway, or plant-microbiome signals beyond generic annotation summaries.

impact medium readiness high 0 tensions
candidate curation

Low-Confidence Collection Curation

Reduce Atlas caveat load by upgrading high-value low-confidence collection pages with schemas, reuse examples, and missing-data labels.

impact medium readiness high 0 tensions

Research Primitives

Start with claims
claim medium

Metal-specific genes remain core-enriched

Metal-specific genes are functionally distinct from general stress genes but remain enriched in the core genome.

claim medium

Lab fitness can predict field ecology

Several projects suggest that lab-measured fitness signals can align with environmental abundance or isolation context when validation data are available.

claim medium

AMR mechanism composition is environment-structured

AMR mechanisms differ by environment, making resistance ecology a field and context problem rather than only a clinical annotation problem.

claim high

Ecotype analyses need rigor gates before translation

Ecotype-derived target lists can collapse under leakage, confound, and independent-evidence checks, so translation requires explicit gates.

claim medium

Lanthanide-dependent methylotrophy is widespread and soil-linked

XoxF markers are far more common than canonical MxaF markers across the BERDL pangenome, with strong soil/sediment enrichment and important marker-calibration caveats.

claim medium

Pangenome openness shapes functional opportunity

Pangenome openness and gene-content class affect which functions are stable, variable, mobile, or available for niche adaptation.

claim medium

Prophage density predicts AMR repertoire breadth

Pangenome-scale prophage marker density is a strong species-level predictor of AMR breadth, while gene-level AMR-prophage proximity is weaker and threshold-sensitive.

claim medium

Metal type diversity predicts ecological niche breadth

Genus-level metal resistance type diversity predicts broader ecological niche breadth after phylogenetic control, while total AMR burden is less informative.

direction medium

Gene targets for critical-mineral bioprocessing

Use metal-specific fitness signals, annotations, modules, and structures to prioritize genes for bioleaching and biorecovery.

direction medium

Metal-AMR co-selection at contaminated sites

Test whether metal contamination selects for AMR genes, mechanisms, or support networks across DOE-relevant environments.

direction low

Rare-earth gene discovery via cross-metal inference

Use cross-metal response structure to rank candidates for first rare-earth-element fitness experiments.

direction medium

Fitness-validated community design

Design microbial communities using tolerance, metabolic capability, measured dependency, and risk annotations.