EMP Trading Cards show the distribution of unique 16S tag sequences across the EMP. Prevalence and abundance were calculated for a 2000-sample subset of the EMP (rarefied to 5000 observations per sample) having even distribution by habitat (EMPO level 3) and study. Colored points in scatter plots indicate tag sequence relative abundance as a function of metadata values (no points shown indicates that metadata were not provided for that category). For comparison, grey lines in scatter plots indicate kernel density estimates of metadata values across all samples in the subset (not just samples where the tag sequence was found). This analysis highlights the power of the EMP dataset to reveal sequence-specific niche ranges across the planet’s microbial communities.
The names given are the most granular taxon names (genus, family, order, or class) and are not unique to these sequences; see trading cards for actual sequences.

Top 10 most prevalent tag sequences

The top 10 most prevalent tag sequences, which also include five of the top 10 most abundant tag sequences (abundance rank in parentheses).

  1. Bacillus
  2. Bradyrhizobium
  3. Arthrobacter (1)
  4. Acidobacteria-6
  5. Enterobacteriaceae (4)
  6. Streptococcus (2)
  7. Pseudomonas (7)
  8. Rhodoplanes
  9. Pseudomonas
  10. Staphylococcaceae (5)

Top 10 most abundant tag sequences

The top 10 most abundant tag sequences not also among the top 10 most prevalent or habitat-specific tag sequences (abundance rank in parentheses).

Top sample type-specific tag sequences

The most abundant habitat-specific tag sequence (in order of prevalence) for each of 15 habitats. Samples of that habitat comprise at least 25% of that tag sequence, and Shannon entropy is less than 1. One sequence is the 8th most abundant.