Opportunity Hooks

Environment, Geochemistry, and Ecology

Why This Lens Exists

Tenant and database boundaries do not match how scientists or agents plan analyses. This data-type lens groups collections by the kind of evidence they provide and the questions they make easier to ask.

Collections In This Lens

  • enigma_coral
  • planetmicrobe_planetmicrobe
  • planetmicrobe_planetmicrobe_raw
  • nmdc_ncbi_biosamples
  • kbase_ke_pangenome

Best Uses

Use this lens to find reusable source data before choosing a specific collection. It is especially useful for agents that need to identify complementary data, select join keys, or explain why a derived product is reusable across projects.

Recent project use: Harvard Forest warming uses NMDC metadata/results to expose omics-layer and design confounds; MicrobeAtlas metal ecology uses global 16S ecology to estimate niche breadth; MGnify and soil metal projects expose coordinate coverage, geochemistry joins, and spatial-validation requirements.

Metrics To Watch

  • Evidence traces: which projects already used these collections.
  • Reuse: whether derived products exist beyond a single project.
  • Under-explored combinations: collection pairs with obvious scientific value but few projects.
  • Caveat load: schema gaps, identifier mismatches, sampling bias, and staging status.
  • Field validation quality: coordinate coverage, spatial blocking, project effects, confound structure, and effect-size reporting.

Caveats

Do not treat collection co-membership as proof that a join is valid. A join recipe should name stable identifiers, filtering rules, and failure modes before it supports a claim.