| Key | Value |
|---|---|
| Created on | 2021-06-01 13:53:43 |
| Version | 14 |
| Number of genes | 488,146 |
| Number of gene clusters | 17,421 |
| Partial genes excluded | No |
| Minbit parameter | 0.8 |
| Gene cluster min occurrence parameter | 1 |
| MCL inflation parameter | 10.0 |
| NCBI blastp or DIAMOND? | NCBI blastp |
| Number of genomes used | 100 |
| Items aditional data keys | num_genomes_gene_cluster_has_hits, num_genes_in_gene_cluster, max_num_paralogs, SCG, functional_homogeneity_index, geometric_homogeneity_index, combined_homogeneity_index |
| Key | Value |
|---|---|
| Created on | Storage DB knows nothing :( |
| Version | 7 |
| Number of genomes described | 100 |
| Functional annotation | Available |
| Functional annotation sources | COG20_PATHWAY, COG20_FUNCTION, COG20_CATEGORY |
This was a full summary (i.e., the `--quick` flag has not been used), hence the gene clusters summary file is not succint by any means.
The summary file: ECtest_gene_clusters_summary.txt.gz
For layers
The directory misc data layers contains TAB-delimited files for additional data stored under the following data group names for each sample/layer found in the merged database: default.
The default data group, which often is added by anvi'o automatically and contains important information, contained these keys: total_length, gc_content, percent_completion, percent_redundancy, num_genes, avg_gene_length, num_genes_per_kb, singleton_gene_clusters, num_gene_clusters.
For items
The directory misc data items contains TAB-delimited files for additional data stored under the following data group names for each item found in the merged database: default.