Skip to main content
Fig. 7 | BMC Bioinformatics

Fig. 7

From: Multi-proteins similarity-based sampling to select representative genomes from large databases

Fig. 7

Comparison of taxonomic diversity and taxonomic redundancy for Bacteria. Samplings of the bacterial dataset with MPS-Sampling, Treemmer and TaxSampler are compared according to: the taxonomic diversity, with the proportion of retained taxa per level (left column); the taxonomic redundancy, with the average number of genomes per taxon for each level (right column). The solid lines represent the possible values of the parameter Δ with MPS-Sampling or Treemmer, leading to any sample size desired by the user.

Back to article page