logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000003500_00419

You are here: Home > Sequence: MGYG000003500_00419

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Butyrivibrio_A sp900768755
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; Butyrivibrio_A; Butyrivibrio_A sp900768755
CAZyme ID MGYG000003500_00419
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
699 MGYG000003500_13|CGC2 76512.45 4.393
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000003500 2005892 MAG Fiji Oceania
Gene Location Start: 19728;  End: 21827  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.4

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 68 348 2e-89 0.9927536231884058

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 6.46e-55 69 351 16 272
Cellulase (glycosyl hydrolase family 5).
COG2730 BglC 1.06e-26 38 380 44 393
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].
pfam09479 Flg_new 1.95e-05 526 581 3 65
Listeria-Bacteroides repeat domain (List_Bact_rpt). This model describes a conserved core region of about 43 residues, which occurs in at least two families of tandem repeats. These include 78-residue repeats which occur from 2 to 15 times in some proteins of Bacteroides forsythus ATCC 43037, and 70-residue repeats found in families of internalins of Listeria species. Single copies are found in proteins of Fibrobacter succinogenes, Geobacter sulfurreducens, and a few other bacteria.
pfam18259 CBM65_1 1.14e-04 414 517 12 113
Carbohydrate binding module 65 domain 1. This domain is found in the non-catalytic carbohydrate binding module 65B (CMB65B) present in Eubacterium cellulosolvens. CBMs are present in plant cell wall degrading enzymes and are responsible for targeting, which enhances catalysis. CBM65s display higher affinity for oligosaccharides, such as cellohexaose, and particularly polysaccharides than cellotetraose, which fully occupies the core component of the substrate binding cleft. The concave surface presented by beta-sheet 2 comprises the beta-glucan binding site in CBM65s. C6 of all the backbone glucose moieties makes extensive hydrophobic interactions with the surface tryptophans of CBM65s. Three out of the four surface Trp are highly conserved. The conserved metal ion site typical of CBMs is absent in this CBM65 family.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QPB75694.1 7.93e-87 4 526 8 523
CBL17440.1 4.33e-84 37 383 42 378
ADU21423.1 6.51e-84 31 552 34 570
BAK27255.1 2.77e-83 3 494 8 514
SQG78665.1 2.77e-83 3 494 8 514

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6XSU_A 4.15e-90 32 379 7 351
GH5-4broad specificity endoglucanase from Ruminococcus flavefaciens [Ruminococcus flavefaciens],6XSU_B GH5-4 broad specificity endoglucanase from Ruminococcus flavefaciens [Ruminococcus flavefaciens]
6WQP_A 7.51e-74 37 379 15 354
GH5-4broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQP_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQV_A GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_C GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_D GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis]
3AYR_A 1.43e-73 37 380 19 359
GH5endoglucanase EglA from a ruminal fungus [Piromyces rhizinflatus]
3AYS_A 1.07e-72 37 380 19 359
GH5endoglucanase from a ruminal fungus in complex with cellotriose [Piromyces rhizinflatus]
6Q1I_A 1.25e-71 32 383 8 355
GH5-4broad specificity endoglucanase from Clostrdium longisporum [Clostridium longisporum],6Q1I_B GH5-4 broad specificity endoglucanase from Clostrdium longisporum [Clostridium longisporum]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
Q12647 7.38e-75 32 381 19 363
Endoglucanase B OS=Neocallimastix patriciarum OX=4758 GN=CELB PE=2 SV=1
P54937 1.29e-70 32 397 33 394
Endoglucanase A OS=Clostridium longisporum OX=1523 GN=celA PE=1 SV=1
P20847 7.14e-69 4 381 8 395
Endoglucanase 1 OS=Butyrivibrio fibrisolvens OX=831 GN=end1 PE=3 SV=1
P16216 7.74e-69 21 375 54 392
Endoglucanase 1 OS=Ruminococcus albus OX=1264 GN=Eg I PE=1 SV=1
P23661 4.44e-68 20 385 55 400
Endoglucanase B OS=Ruminococcus albus OX=1264 GN=celB PE=3 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000226 0.999109 0.000181 0.000165 0.000165 0.000144

TMHMM  Annotations      download full data without filtering help

start end
667 689