logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000001789_02216

You are here: Home > Sequence: MGYG000001789_02216

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Phocaeicola sp002161565
Lineage Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Bacteroidaceae; Phocaeicola; Phocaeicola sp002161565
CAZyme ID MGYG000001789_02216
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
592 MGYG000001789_44|CGC1 64118.51 3.9549
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000001789 3544186 MAG Denmark Europe
Gene Location Start: 2343;  End: 4121  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.4

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 243 540 5.7e-98 0.9891304347826086

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 1.33e-55 234 543 7 272
Cellulase (glycosyl hydrolase family 5).
COG2730 BglC 4.55e-24 191 503 22 322
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].
cd14948 BACON 1.87e-17 121 204 2 83
Bacteroidetes-Associated Carbohydrate-binding (putative) Often N-terminal (BACON) domain. The BACON domain is found in diverse domain architectures and accociated with a wide variety of domains, including carbohydrate-active enzymes and proteases. It was named for its suggested function of carbohydrate binding; the latter was inferred from domain architectures, sequence conservation, and phyletic distribution. However, recent experimental data suggest that its primary function in Bacteroides ovatus endo-xyloglucanase BoGH5A is to distance the catalytic module from the cell surface and confer additional mobility to the catalytic domain for attack of the polysaccharide. No evidence for a direct role in carbohydrate binding could be found in that case. The large majority of BACON domains are found in Bacteroidetes.
cd14948 BACON 3.40e-15 35 115 4 82
Bacteroidetes-Associated Carbohydrate-binding (putative) Often N-terminal (BACON) domain. The BACON domain is found in diverse domain architectures and accociated with a wide variety of domains, including carbohydrate-active enzymes and proteases. It was named for its suggested function of carbohydrate binding; the latter was inferred from domain architectures, sequence conservation, and phyletic distribution. However, recent experimental data suggest that its primary function in Bacteroides ovatus endo-xyloglucanase BoGH5A is to distance the catalytic module from the cell surface and confer additional mobility to the catalytic domain for attack of the polysaccharide. No evidence for a direct role in carbohydrate binding could be found in that case. The large majority of BACON domains are found in Bacteroidetes.
pfam19190 BACON_2 9.40e-08 34 117 3 90
Viral BACON domain. This family represents a distinct class of BACON domains found in crAss-like phages, the most common viral family in the human gut, in which they are found in tail fiber genes. This suggests they may play a role in phage-host interactions.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
ADY35478.1 0.0 1 592 1 592
QUT88430.1 5.95e-247 9 592 7 598
ALJ60576.1 3.36e-205 133 592 51 512
QRX62664.1 5.55e-171 130 592 47 513
AIF26005.1 1.55e-170 210 592 24 402

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6WQY_A 2.26e-273 211 576 19 384
ChainA, Cellulase [Phocaeicola salanitronis DSM 18170]
4YHE_A 5.84e-134 222 592 13 386
NativeBacteroidetes-affiliated Gh5 Cellulase Linked With A Polysaccharide Utilization Locus [Bacteroidetes bacterium AC2a],4YHE_B Native Bacteroidetes-affiliated Gh5 Cellulase Linked With A Polysaccharide Utilization Locus [Bacteroidetes bacterium AC2a]
4YHG_A 4.68e-133 222 592 13 386
NativeBacteroidetes-affiliated Gh5 Cellulase Linked With A Polysaccharide Utilization Locus [Bacteroidetes bacterium AC2a],4YHG_B Native Bacteroidetes-affiliated Gh5 Cellulase Linked With A Polysaccharide Utilization Locus [Bacteroidetes bacterium AC2a]
2JEP_A 6.32e-84 206 574 25 394
Nativefamily 5 xyloglucanase from Paenibacillus pabuli [Paenibacillus pabuli],2JEP_B Native family 5 xyloglucanase from Paenibacillus pabuli [Paenibacillus pabuli],2JEQ_A Family 5 xyloglucanase from Paenibacillus pabuli in complex with ligand [Paenibacillus pabuli]
3NDY_A 2.56e-59 218 574 14 341
Thestructure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans [Clostridium cellulovorans],3NDY_B The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans [Clostridium cellulovorans],3NDY_C The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans [Clostridium cellulovorans],3NDY_D The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans [Clostridium cellulovorans],3NDZ_A The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans bound to cellotriose [Clostridium cellulovorans],3NDZ_B The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans bound to cellotriose [Clostridium cellulovorans],3NDZ_C The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans bound to cellotriose [Clostridium cellulovorans],3NDZ_D The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans bound to cellotriose [Clostridium cellulovorans]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
O08342 3.91e-79 215 574 39 399
Endoglucanase A OS=Paenibacillus barcinonensis OX=198119 GN=celA PE=1 SV=1
P28623 1.67e-57 218 574 45 372
Endoglucanase D OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engD PE=1 SV=2
P28621 2.29e-55 215 553 41 355
Endoglucanase B OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engB PE=3 SV=1
A7LXT7 3.21e-52 9 571 21 499
Xyloglucan-specific endo-beta-1,4-glucanase BoGH5A OS=Bacteroides ovatus (strain ATCC 8483 / DSM 1896 / JCM 5824 / BCRC 10623 / CCUG 4943 / NCTC 11153) OX=411476 GN=BACOVA_02653 PE=1 SV=1
Q12647 3.39e-52 218 556 27 347
Endoglucanase B OS=Neocallimastix patriciarum OX=4758 GN=CELB PE=2 SV=1

SignalP and Lipop Annotations help

This protein is predicted as LIPO

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000000 0.000048 1.000008 0.000000 0.000000 0.000000

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000001789_02216.