logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000003172_00423

You are here: Home > Sequence: MGYG000003172_00423

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species CAG-485 sp900555915
Lineage Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Muribaculaceae; CAG-485; CAG-485 sp900555915
CAZyme ID MGYG000003172_00423
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
476 53576.77 4.2957
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000003172 1933519 MAG United States North America
Gene Location Start: 3579;  End: 5009  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.- 3.2.1.4 3.2.1.73 3.2.1.78 3.2.1.8

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 171 449 3e-95 0.9818840579710145

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 1.27e-55 155 448 1 266
Cellulase (glycosyl hydrolase family 5).
COG2730 BglC 6.28e-23 148 448 54 357
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].
cd14948 BACON 2.06e-13 43 126 7 82
Bacteroidetes-Associated Carbohydrate-binding (putative) Often N-terminal (BACON) domain. The BACON domain is found in diverse domain architectures and accociated with a wide variety of domains, including carbohydrate-active enzymes and proteases. It was named for its suggested function of carbohydrate binding; the latter was inferred from domain architectures, sequence conservation, and phyletic distribution. However, recent experimental data suggest that its primary function in Bacteroides ovatus endo-xyloglucanase BoGH5A is to distance the catalytic module from the cell surface and confer additional mobility to the catalytic domain for attack of the polysaccharide. No evidence for a direct role in carbohydrate binding could be found in that case. The large majority of BACON domains are found in Bacteroidetes.
pfam13004 BACON 2.20e-06 62 126 1 60
Putative binding domain, N-terminal. The BACON (Bacteroidetes-Associated Carbohydrate-binding Often N-terminal) domain is an all-beta domain found in diverse architectures, principally in combination with carbohydrate-active enzymes and proteases. These architectures suggest a carbohydrate-binding function which is also supported by the nature of BACON's few conserved amino-acids. The phyletic distribution of BACON and other data tentatively suggest that it may frequently function to bind mucin. Further work with the characterized structure of a member of glycoside hydrolase family 5 enzyme, Structure 3ZMR, has found no evidence for carbohydrate-binding for this domain.
pfam19190 BACON_2 0.001 50 126 20 88
Viral BACON domain. This family represents a distinct class of BACON domains found in crAss-like phages, the most common viral family in the human gut, in which they are found in tail fiber genes. This suggests they may play a role in phage-host interactions.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QRX63738.1 1.14e-117 29 472 44 479
AEX97596.1 8.17e-88 136 471 33 379
QPB75690.1 9.31e-87 10 469 3 498
AHF24720.1 2.60e-86 136 448 42 359
ACA61144.1 9.54e-83 138 471 147 506

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6MQ4_A 2.91e-71 136 472 10 349
ChainA, cellulase [Acetivibrio cellulolyticus]
6WQP_A 3.00e-71 136 469 14 353
GH5-4broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQP_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQV_A GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_C GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_D GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis]
6PZ7_A 2.25e-68 136 470 7 335
GH5-4broad specificity endoglucanase from Clostridium acetobutylicum [Clostridium acetobutylicum ATCC 824]
6Q1I_A 1.18e-67 129 475 5 356
GH5-4broad specificity endoglucanase from Clostrdium longisporum [Clostridium longisporum],6Q1I_B GH5-4 broad specificity endoglucanase from Clostrdium longisporum [Clostridium longisporum]
4IM4_A 1.76e-63 136 472 5 335
ChainA, Endoglucanase E [Acetivibrio thermocellus],4IM4_B Chain B, Endoglucanase E [Acetivibrio thermocellus],4IM4_C Chain C, Endoglucanase E [Acetivibrio thermocellus],4IM4_D Chain D, Endoglucanase E [Acetivibrio thermocellus],4IM4_E Chain E, Endoglucanase E [Acetivibrio thermocellus],4IM4_F Chain F, Endoglucanase E [Acetivibrio thermocellus]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P54937 5.10e-66 129 475 30 381
Endoglucanase A OS=Clostridium longisporum OX=1523 GN=celA PE=1 SV=1
P28621 6.14e-60 136 472 40 374
Endoglucanase B OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engB PE=3 SV=1
P10477 3.78e-59 136 472 55 385
Cellulase/esterase CelE OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celE PE=1 SV=2
P17901 1.04e-58 142 476 51 404
Endoglucanase A OS=Ruminiclostridium cellulolyticum (strain ATCC 35319 / DSM 5812 / JCM 6584 / H10) OX=394503 GN=celCCA PE=1 SV=1
P23661 2.28e-58 127 466 57 394
Endoglucanase B OS=Ruminococcus albus OX=1264 GN=celB PE=3 SV=1

SignalP and Lipop Annotations help

This protein is predicted as LIPO

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000511 0.336871 0.662111 0.000216 0.000146 0.000129

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000003172_00423.