Research Areas
Projects automatically grouped by shared research themes, data dependencies, and collections.
Metal & Tolerance
9 projectsBacDive Isolation Environment × Metal Tolerance Prediction
CompletedDo bacteria isolated from metal-contaminated environments have higher predicted metal tolerance scores than bacteria from uncontaminated environments?
BacDive Phenotype Signatures of Metal Tolerance
CompletedCan BacDive-measured bacterial phenotypes (Gram stain, oxygen tolerance, metabolite utilization, enzyme activities) predict metal tolerance as measure...
Counter Ion Effects on Metal Fitness Measurements
CompletedWhen bacteria are exposed to metal salts (CoCl₂, NiCl₂, CuCl₂), how much of the observed fitness effect is caused by the metal cation versus the count...
Gene-Resolution Metal Cross-Resistance Across Diverse Bacteria
CompletedIs the genetic architecture of metal cross-resistance conserved across phylogenetically diverse bacteria, or is it rewired species by species?
Pan-Bacterial Metal Fitness Atlas
CompletedAcross diverse bacteria subjected to genome-wide fitness profiling under metal stress, what is the genetic architecture of metal tolerance — is it enc...
Global Biogeography of Environmental Bacterial Metal Resistance and Spatial Sampling Gaps
CompletedWhere on Earth are metal-resistant environmental bacteria most abundant, and where are the gaps in public metagenomic sampling that prevent us from kn...
Metal-Specific vs General Stress Genes in the Metal Fitness Atlas
CompletedAmong the 12,838 metal-important genes identified by the Metal Fitness Atlas, which are specifically required for metal tolerance vs general stress su...
Metal Resistance Ecology: Phylogenetic Conservation vs. Environmental Selection
CompletedDo metal-resistance functions in the global microbiome reflect phylogenetic constraint (conserved in lineages) or environmental selection (enriched in...
Soil Metal Concentrations Drive Functional Gene Shifts in the Environmental Microbiome
CompletedWhich microbial functional gene categories (COGs) are significantly associated with soil metal concentrations at global scale, and do these associatio...
Essential & Core
6 projectsGene Conservation, Fitness, and the Architecture of Bacterial Genomes
CompletedHow does a gene's importance for bacterial survival relate to its evolutionary conservation, and what does the conserved genome actually look like?
Conservation vs Fitness -- Linking FB Genes to Pangenome Clusters
CompletedAre essential genes preferentially conserved in the core genome, and what functional categories distinguish essential-core from essential-auxiliary ge...
The 5,526 Costly + Dispensable Genes
CompletedWhat characterizes genes that are simultaneously burdensome (fitness improves when deleted) and not conserved in the pangenome? Are they mobile elemen...
The Pan-Bacterial Essential Genome
CompletedWhich essential genes are conserved across bacteria, which are context-dependent, and can we predict function for uncharacterized essential genes usin...
The Pan-Bacterial Essential Metabolome
CompletedWhich biochemical reactions are universally essential across bacteria, and what does the essential metabolome reveal about the minimal core metabolism...
Fitness Effects vs Conservation -- Quantitative Analysis
CompletedIs there a continuous gradient from essential genes (core) to dispensable genes (accessory) across the full fitness spectrum, and what does the fitnes...
Openness & Open
4 projectsOpenness vs Functional Composition
In ProgressDo species with open pangenomes show different COG functional enrichment patterns than species with closed pangenomes?
Pangenome Openness Analysis
CompletedDo open pangenomes show different patterns of environmental vs phylogenetic effects compared to closed pangenomes?
Pangenome Openness, Metabolic Pathways, and Phylogenetic Distances
In ProgressHow do pangenome characteristics (open vs. closed) correlate with metabolic pathway completeness, phylogenetic distances, and species ecology?
Pangenome Openness, Metabolic Pathways, and Biogeography
In ProgressDo pangenome characteristics (open vs. closed) correlate with metabolic pathway diversity and biogeographic distribution patterns?
Metabolic & Capability
3 projectsAnnotation-Gap Discovery via Phenotype-Fitness-Pangenome-Gapfilling Integration
CompletedCan we systematically identify and resolve metabolic annotation gaps in bacterial genomes by integrating experimental growth phenotypes, gene fitness ...
Metabolic Capability vs Metabolic Dependency
CompletedJust because a bacterium's genome encodes a complete metabolic pathway (metabolic *capability*), does the organism actually depend on it? Can we disti...
Metabolic Capability vs Dependency
CompletedWhen a bacterium's genome encodes a complete biosynthetic or catabolic pathway, does the organism actually depend on it? Can we use fitness data to di...
Modules & Core
3 projectsCo-fitness Predicts Co-inheritance in Bacterial Pangenomes
CompletedDo genes with correlated fitness profiles (co-fit) tend to co-occur in the same genomes across a species' pangenome? Does functional coupling constrai...
Pan-bacterial Fitness Modules via Independent Component Analysis
CompletedCan we decompose RB-TnSeq fitness compendia into latent functional modules via robust ICA, align them across organisms using orthology, and use module...
Fitness Modules x Pangenome Conservation
CompletedAre ICA fitness modules enriched in core pangenome genes, and do cross-organism module families map to the core genome?
Environmental & Environment
3 projectsEcotype Correlation Analysis
CompletedWhat drives gene content similarity between bacterial genomes: environmental similarity or phylogenetic relatedness?
Ecotype Reanalysis: Environmental-Only Samples
CompletedDoes the environment effect on gene content become stronger when analysis is restricted to genuinely environmental samples, excluding human-associated...
AlphaEarth Embeddings, Geography & Environment Explorer
CompletedWhat do AlphaEarth environmental embeddings capture, and how do they relate to geographic coordinates and NCBI environment labels?
Resistance & Antibiotic
2 projectsProphage-AMR Co-mobilization Atlas
CompletedAt pangenome scale (293K genomes, 27K species), are antibiotic resistance genes preferentially located within or adjacent to prophage regions, and doe...
Antibiotic Resistance Hotspots in Microbial Pangenomes
In ProgressWhich microbial species and ecological environments show the highest concentration of antibiotic resistance genes, and can we predict resistance accum...
Subsurface & Clay
2 projectsSubsurface Bacillota_B Specialization
CompletedWithin the Bacillota_B phylum (Desulfosporosinus, BRH-c8a Peptococcaceae, BRH-c4a Desulfotomaculales, and other obligate-anaerobe deep-subsurface Firm...
Self-Sufficiency, Anaerobic Toolkit, and Cultivation Bias in Clay-Confined Cultured Bacterial Genomes
CompletedDo BERDL's cultured bacterial genomes from clay-confined deep-subsurface environments recapitulate the genomic signatures the recent literature has id...
Functional & Differentiation
2 projectsCOG Functional Category Analysis
CompletedHow do COG functional category distributions differ across core, auxiliary, and novel genes in bacterial pangenomes?
Ecotype Functional Differentiation
CompletedDo gene-content ecotypes within bacterial species differ in their COG functional profiles, showing differentiation in adaptive functions while sharing...
Distribution & Berdl
2 projectsPGP Gene Distribution Across Environments & Pangenomes
CompletedDoes environmental selection shape the distribution of plant growth-promoting (PGP) bacterial genes across the BERDL pangenome (293K genomes, 27K spec...
SNIPE Defense System: Prevalence and Taxonomic Distribution in the BERDL Pangenome
CompletedHow prevalent are SNIPE (Surface-associated Nuclease Inhibiting Phage Entry) homologues across the 293K-genome BERDL pangenome, and does their taxonom...
Environmental & Resistance
2 projectsEnvironmental Resistome at Pangenome Scale
CompletedDo antimicrobial resistance gene profiles differ between ecological niches across 27,000 bacterial species? Using 83K AMR gene clusters mapped to 293K...
Pan-Bacterial AMR Gene Landscape
CompletedWhat is the distribution, conservation, phylogenetic structure, functional context, and environmental association of antimicrobial resistance (AMR) ge...
Resistance & Antimicrobial
2 projectsAMR Co-Fitness Support Networks
CompletedWhat genes are co-regulated with antimicrobial resistance (AMR) genes across growth conditions, and do these "support networks" explain the uniform fi...
Fitness Cost of Antimicrobial Resistance Genes
CompletedDo antimicrobial resistance (AMR) genes impose a fitness cost in the absence of antibiotic selection pressure? Using genome-wide RB-TnSeq fitness data...
Field & Effects
2 projectsField vs Lab Gene Importance in Desulfovibrio vulgaris Hildenborough
CompletedWhich genes matter for survival under environmentally-realistic conditions but appear dispensable in the lab, and vice versa? Do field-relevant fitnes...
Lab Fitness Predicts Field Ecology at Oak Ridge
CompletedDo lab-measured fitness effects under contaminant stress predict the field abundance of Fitness Browser organisms across Oak Ridge groundwater sites w...
Core & Paradox
2 projectsCore Gene Paradox -- Why Are Core Genes More Burdensome?
CompletedWhy are core genome genes MORE likely to show positive fitness effects when deleted, and what functions and conditions drive this burden paradox?
Temporal Core Genome Dynamics
In ProgressHow does core genome composition change over sampling time, and do genes transition in and out of core status?
Explorer & Kescience
2 projectsPaperBLAST Data Explorer
CompletedWhat does the `kescience_paperblast` collection contain, how current is it, and what are its coverage patterns across organisms, domains of life, and ...
Web of Microbes Data Explorer
CompletedWhat does the `kescience_webofmicrobes` exometabolomics collection contain, which organisms overlap with the Fitness Browser, and how well do metaboli...
Dark & Them
2 projectsFunctional Dark Matter — Experimentally Prioritized Novel Genetic Systems
CompletedWhich genes of unknown function across 48 bacteria have strong fitness phenotypes, and can biogeographic patterns, pathway gap analysis, and cross-org...
Truly Dark Genes — What Remains Unknown After Modern Annotation?
CompletedAmong the ~6,400 Fitness Browser genes that remain functionally unannotated even after bakta v1.12.0 reannotation, what distinguishes them from "annot...
Independent Studies
27 projectsAcinetobacter baylyi ADP1 Data Explorer
CompletedWhat is the scope and structure of a comprehensive ADP1 database, and how do its annotations, metabolic models, and phenotype data intersect with BERD...
ADP1 Deletion Collection Phenotype Analysis
CompletedWhat is the condition-dependent structure of gene essentiality in *Acinetobacter baylyi* ADP1, as revealed by the de Berardinis single-gene deletion c...
ADP1 Triple Essentiality Concordance
In ProgressAmong genes that TnSeq says are dispensable in *Acinetobacter baylyi* ADP1, does FBA correctly predict which ones have growth defects? Can direct muta...
AlphaFold MSA Depth as a Lens on the Bacterial Annotation Gap
CompletedDoes AlphaFold MSA depth predict functional annotation richness in the bacterial pangenome, and is structural novelty (low MSA depth) systematically e...
Within-Species AMR Strain Variation
CompletedWithin a species, how does the AMR repertoire vary between strains, and what drives that variation?
Aromatic Catabolism Support Network in ADP1
CompletedWhy does aromatic catabolism in *Acinetobacter baylyi* ADP1 require Complex I (NADH dehydrogenase), iron acquisition, and PQQ biosynthesis when growth...
BERDL Data Atlas — Inventory, Topic Map, and Cross-Reference Synergies
CompletedWhat data is available in BERDL (across tenants, agencies, and programs), what biological topics does it cover, and where do datasets amplify each oth...
Caulobacter Fur–Lipid A Loss
In ProgressWhy does inactivation of *fur* (the ferric uptake regulator) permit the loss of lipid A in *Caulobacter crescentus*, when no equivalent connection to ...
CF Protective Microbiome Formulation Design
In ProgressCan we build a multi-criterion framework that explains measured *P. aeruginosa* PA14 inhibition from metabolic competition, growth kinetics, and patie...
ENIGMA Carbon Census 1
CompletedFor 83 groundwater- and necromass-derived carbon compounds proposed for community enrichment and isolate phenotyping, what is known about (1) their en...
Contamination Gradient vs Functional Potential in ENIGMA Communities
CompletedDo high-contamination Oak Ridge groundwater communities show enrichment for taxa with higher inferred stress-related functional potential compared wit...
SSO Subsurface Community Ecology — Spatial Structure, Functional Gradients, and Hydrogeological Drivers
CompletedDoes 16S community similarity across the 9 ENIGMA SSO wells (3x3 grid, ~4 m span at Oak Ridge) recapitulate the spatial arrangement in X, Y, and Z? Wh...
Metabolic Consistency of Pseudomonas FW300-N2E3
CompletedFor *Pseudomonas fluorescens* FW300-N2E3 (ENIGMA groundwater isolate), how consistent are exometabolomic outputs (Web of Microbes), genome-wide gene f...
Gene Function Ecological Agora
CompletedAcross the prokaryotic tree (GTDB r214; 293,059 genomes / 27,690 species), build a multi-resolution **innovation + acquisition-depth atlas of bacteria...
Genotype × Condition → Phenotype Prediction from ENIGMA Growth Curves
In ProgressCan we predict bacterial growth phenotype — at multiple resolutions from binary growth through continuous kinetics to complex dynamics — from genome c...
Harvard Forest Long-Term Warming — DNA vs RNA Functional Response
CompletedAfter ~25 years of +5°C experimental soil warming at the Harvard Forest Barre Woods plot, does the functional transcript pool (metatranscriptome KO/Pf...
Metagenome-Prioritized Phage Cocktails for Crohn's Disease and IBD
CompletedWhich bacterial pathobionts are **enriched, ubiquitous, and non-protective** in IBD, UC, and Crohn's disease patients — considered both across indicat...
Lanthanide Methylotrophy Atlas
CompletedAcross BERDL's 293K-genome pangenome, what is the phylogenomic distribution and cassette-completeness of the lanthanide-dependent methanol/ethanol oxi...
Lignin Enrichment and Ecological Memory in Microbial Communities
CompletedHow does sequential enrichment on lignin, with or without labile carbon supplementation, shape bacterial and fungal community composition — and does e...
Community Metabolic Ecology via NMDC × Pangenome Integration
CompletedDo the GapMind-predicted pathway completeness profiles of community resident taxa predict or correlate with observed metabolomics profiles in NMDC env...
Polyhydroxybutyrate Granule Formation Pathways: Distribution Across Clades and Environmental Selection
CompletedHow are polyhydroxybutyrate (PHB) granule-forming pathways distributed across bacterial clades and environments, and does this distribution support th...
Plant Microbiome Ecotypes
CompletedWhat is the genomic basis for plant-microbe associations across different plant compartments (rhizosphere, root, phyllosphere, endophyte)? Can we clas...
Prophage Gene Modules and Terminase-Defined Lineages Across Bacterial Phylogeny and Environmental Gradients
CompletedHow are prophage gene modules and terminase-defined prophage lineages distributed across bacterial phylogeny and environmental gradients, and which mo...
Carbon Source Utilization Predicts Ecology and Lifestyle in Pseudomonas
CompletedAmong free-living *Pseudomonas* clades, does the carbon source utilization profile predict the soil ecosystem type from which strains were isolated — ...
Condition-Specific Respiratory Chain Wiring in ADP1
CompletedHow is *Acinetobacter baylyi* ADP1's branched respiratory chain wired across carbon sources — which NADH dehydrogenases and terminal oxidases are requ...
Soil Microbial Dark Matter: Genomic Frontiers and the Clay Shield Null Result
CompletedT4SS-CAZy Environmental HGT: Cross-Phylum Transfer of Carbohydrate-Active Enzyme Cassettes
CompletedDo Type IV secretion systems (T4SS) in environmental bacteria physically co-localise with carbohydrate-active enzyme (CAZy) genes in a manner consiste...