logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000002834_00426

You are here: Home > Sequence: MGYG000002834_00426

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Prevotella sp900551985
Lineage Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Bacteroidaceae; Prevotella; Prevotella sp900551985
CAZyme ID MGYG000002834_00426
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
603 MGYG000002834_18|CGC1 65213.89 4.7022
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000002834 3427886 MAG United Republic of Tanzania Africa
Gene Location Start: 16376;  End: 18187  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000002834_00426.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 258 566 3.2e-95 0.9927536231884058

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 9.55e-51 253 567 10 270
Cellulase (glycosyl hydrolase family 5).
COG2730 BglC 8.31e-24 192 527 11 324
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].
cd14948 BACON 1.59e-15 38 120 2 82
Bacteroidetes-Associated Carbohydrate-binding (putative) Often N-terminal (BACON) domain. The BACON domain is found in diverse domain architectures and accociated with a wide variety of domains, including carbohydrate-active enzymes and proteases. It was named for its suggested function of carbohydrate binding; the latter was inferred from domain architectures, sequence conservation, and phyletic distribution. However, recent experimental data suggest that its primary function in Bacteroides ovatus endo-xyloglucanase BoGH5A is to distance the catalytic module from the cell surface and confer additional mobility to the catalytic domain for attack of the polysaccharide. No evidence for a direct role in carbohydrate binding could be found in that case. The large majority of BACON domains are found in Bacteroidetes.
pfam13004 BACON 1.12e-09 65 120 4 60
Putative binding domain, N-terminal. The BACON (Bacteroidetes-Associated Carbohydrate-binding Often N-terminal) domain is an all-beta domain found in diverse architectures, principally in combination with carbohydrate-active enzymes and proteases. These architectures suggest a carbohydrate-binding function which is also supported by the nature of BACON's few conserved amino-acids. The phyletic distribution of BACON and other data tentatively suggest that it may frequently function to bind mucin. Further work with the characterized structure of a member of glycoside hydrolase family 5 enzyme, Structure 3ZMR, has found no evidence for carbohydrate-binding for this domain.
pfam19190 BACON_2 1.77e-06 40 120 4 88
Viral BACON domain. This family represents a distinct class of BACON domains found in crAss-like phages, the most common viral family in the human gut, in which they are found in tail fiber genes. This suggests they may play a role in phage-host interactions.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
VEH16026.1 2.14e-162 1 598 31 605
QUT37093.1 1.29e-121 20 601 25 592
QUT65987.1 3.63e-121 20 601 25 592
QUU01445.1 3.63e-121 20 601 25 592
BBK86684.1 3.63e-121 20 601 25 592

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
4W8A_A 4.98e-98 223 601 1 377
Crystalstructure of XEG5B, a GH5 xyloglucan-specific beta-1,4-glucanase from ruminal metagenomic library, in the native form [uncultured bacterium],4W8B_A Crystal structure of XEG5B, a GH5 xyloglucan-specific beta-1,4-glucanase from ruminal metagenomic library, in complex with XXLG [uncultured bacterium]
5OYC_A 6.39e-79 221 593 41 391
GH5endo-xyloglucanase from Cellvibrio japonicus [Cellvibrio japonicus Ueda107],5OYC_B GH5 endo-xyloglucanase from Cellvibrio japonicus [Cellvibrio japonicus Ueda107],5OYD_A GH5 endo-xyloglucanase from Cellvibrio japonicus [Cellvibrio japonicus Ueda107],5OYD_B GH5 endo-xyloglucanase from Cellvibrio japonicus [Cellvibrio japonicus Ueda107],5OYE_A GH5 endo-xyloglucanase from Cellvibrio japonicus [Cellvibrio japonicus Ueda107],5OYE_B GH5 endo-xyloglucanase from Cellvibrio japonicus [Cellvibrio japonicus Ueda107]
6HA9_A 4.89e-78 221 593 41 391
Structureof an endo-Xyloglucanase from Cellvibrio japonicus complexed with XXXG(2F)-beta-DNP [Cellvibrio japonicus Ueda107],6HA9_B Structure of an endo-Xyloglucanase from Cellvibrio japonicus complexed with XXXG(2F)-beta-DNP [Cellvibrio japonicus Ueda107],6HAA_A Structure of a covalent complex of endo-Xyloglucanase from Cellvibrio japonicus after reacting with XXXG(2F)-beta-DNP [Cellvibrio japonicus Ueda107],6HAA_B Structure of a covalent complex of endo-Xyloglucanase from Cellvibrio japonicus after reacting with XXXG(2F)-beta-DNP [Cellvibrio japonicus Ueda107]
2JEP_A 7.39e-64 228 596 38 395
Nativefamily 5 xyloglucanase from Paenibacillus pabuli [Paenibacillus pabuli],2JEP_B Native family 5 xyloglucanase from Paenibacillus pabuli [Paenibacillus pabuli],2JEQ_A Family 5 xyloglucanase from Paenibacillus pabuli in complex with ligand [Paenibacillus pabuli]
6WQY_A 2.55e-57 223 595 22 375
ChainA, Cellulase [Phocaeicola salanitronis DSM 18170]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
O08342 1.38e-59 222 596 37 400
Endoglucanase A OS=Paenibacillus barcinonensis OX=198119 GN=celA PE=1 SV=1
A7LXT7 1.10e-54 132 595 50 501
Xyloglucan-specific endo-beta-1,4-glucanase BoGH5A OS=Bacteroides ovatus (strain ATCC 8483 / DSM 1896 / JCM 5824 / BCRC 10623 / CCUG 4943 / NCTC 11153) OX=411476 GN=BACOVA_02653 PE=1 SV=1
P28621 5.16e-50 218 581 35 360
Endoglucanase B OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engB PE=3 SV=1
P23660 2.30e-49 224 596 26 364
Endoglucanase A OS=Ruminococcus albus OX=1264 GN=celA PE=1 SV=1
P28623 2.12e-47 215 595 31 372
Endoglucanase D OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engD PE=1 SV=2

SignalP and Lipop Annotations help

This protein is predicted as LIPO

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000000 0.000003 1.000043 0.000000 0.000000 0.000000

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000002834_00426.