logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000002082_00673

You are here: Home > Sequence: MGYG000002082_00673

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Alistipes sp900544265
Lineage Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Rikenellaceae; Alistipes; Alistipes sp900544265
CAZyme ID MGYG000002082_00673
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
490 MGYG000002082_3|CGC2 53756.51 4.6315
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000002082 2573115 MAG Netherlands Europe
Gene Location Start: 107150;  End: 108622  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.4

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 142 438 4.5e-91 0.9927536231884058

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 8.76e-52 134 441 7 272
Cellulase (glycosyl hydrolase family 5).
COG2730 BglC 1.25e-24 112 439 40 361
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].
cd14948 BACON 2.47e-09 27 102 7 82
Bacteroidetes-Associated Carbohydrate-binding (putative) Often N-terminal (BACON) domain. The BACON domain is found in diverse domain architectures and accociated with a wide variety of domains, including carbohydrate-active enzymes and proteases. It was named for its suggested function of carbohydrate binding; the latter was inferred from domain architectures, sequence conservation, and phyletic distribution. However, recent experimental data suggest that its primary function in Bacteroides ovatus endo-xyloglucanase BoGH5A is to distance the catalytic module from the cell surface and confer additional mobility to the catalytic domain for attack of the polysaccharide. No evidence for a direct role in carbohydrate binding could be found in that case. The large majority of BACON domains are found in Bacteroidetes.
pfam13004 BACON 0.009 47 103 2 61
Putative binding domain, N-terminal. The BACON (Bacteroidetes-Associated Carbohydrate-binding Often N-terminal) domain is an all-beta domain found in diverse architectures, principally in combination with carbohydrate-active enzymes and proteases. These architectures suggest a carbohydrate-binding function which is also supported by the nature of BACON's few conserved amino-acids. The phyletic distribution of BACON and other data tentatively suggest that it may frequently function to bind mucin. Further work with the characterized structure of a member of glycoside hydrolase family 5 enzyme, Structure 3ZMR, has found no evidence for carbohydrate-binding for this domain.
pfam19190 BACON_2 0.009 27 103 7 89
Viral BACON domain. This family represents a distinct class of BACON domains found in crAss-like phages, the most common viral family in the human gut, in which they are found in tail fiber genes. This suggests they may play a role in phage-host interactions.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QUT35080.1 4.59e-190 8 490 22 517
QRX62664.1 3.74e-188 8 490 23 513
ADD61911.1 4.03e-188 21 490 4 485
AVM57519.1 1.03e-187 8 489 68 562
ADY35478.1 9.90e-175 10 490 109 592

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6WQY_A 5.75e-162 110 474 18 384
ChainA, Cellulase [Phocaeicola salanitronis DSM 18170]
4YHE_A 1.13e-142 114 490 5 386
NativeBacteroidetes-affiliated Gh5 Cellulase Linked With A Polysaccharide Utilization Locus [Bacteroidetes bacterium AC2a],4YHE_B Native Bacteroidetes-affiliated Gh5 Cellulase Linked With A Polysaccharide Utilization Locus [Bacteroidetes bacterium AC2a]
4YHG_A 9.10e-142 114 490 5 386
NativeBacteroidetes-affiliated Gh5 Cellulase Linked With A Polysaccharide Utilization Locus [Bacteroidetes bacterium AC2a],4YHG_B Native Bacteroidetes-affiliated Gh5 Cellulase Linked With A Polysaccharide Utilization Locus [Bacteroidetes bacterium AC2a]
2JEP_A 3.52e-79 115 471 34 393
Nativefamily 5 xyloglucanase from Paenibacillus pabuli [Paenibacillus pabuli],2JEP_B Native family 5 xyloglucanase from Paenibacillus pabuli [Paenibacillus pabuli],2JEQ_A Family 5 xyloglucanase from Paenibacillus pabuli in complex with ligand [Paenibacillus pabuli]
6WQP_A 1.31e-60 116 454 16 340
GH5-4broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQP_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQV_A GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_C GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_D GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
O08342 2.91e-71 116 471 40 398
Endoglucanase A OS=Paenibacillus barcinonensis OX=198119 GN=celA PE=1 SV=1
Q12647 3.81e-58 119 471 28 362
Endoglucanase B OS=Neocallimastix patriciarum OX=4758 GN=CELB PE=2 SV=1
P28623 3.58e-57 116 471 43 371
Endoglucanase D OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engD PE=1 SV=2
P28621 3.76e-56 118 451 44 355
Endoglucanase B OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engB PE=3 SV=1
P10477 4.05e-55 116 471 57 384
Cellulase/esterase CelE OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celE PE=1 SV=2

SignalP and Lipop Annotations help

This protein is predicted as OTHER

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
1.000061 0.000004 0.000000 0.000000 0.000000 0.000000

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000002082_00673.