logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000003355_00349

You are here: Home > Sequence: MGYG000003355_00349

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Lachnoclostridium phytofermentans_A
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; Lachnoclostridium; Lachnoclostridium phytofermentans_A
CAZyme ID MGYG000003355_00349
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
648 73061.89 4.5567
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000003355 5194070 MAG China Asia
Gene Location Start: 6450;  End: 8396  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000003355_00349.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 66 345 5.3e-82 0.9927536231884058
CBM46 374 458 4.5e-17 0.9770114942528736

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 1.53e-43 56 346 4 271
Cellulase (glycosyl hydrolase family 5).
pfam18448 CBM46 4.36e-29 464 564 2 103
Carbohydrate binding domain. Carbohydrate active enzymes (CAZYmes) that target recalcitrant polysaccharides are modular enzymes containing noncatalytic carbohydrate-binding modules (CBMs) that direct enzymes to their cognate substrate, thus potentiating catalysis. The structure of Bacillus halodurans endo-beta-1,4-glucanase B (Cel5B) reveals that CBM46 is tightly associated with the catalytic module and, dependent on the glucan presented to the enzyme, can contribute directly to substrate binding or play a targeting role in directing the enzyme to regions of the plant cell wall rich in the polysaccharide hydrolyzed by the enzyme. The CBM46 domain displays a classic beta-sandwich jelly roll fold. Against beta-1,3-1,4-glucans CBM46 domain participates in productive substrate binding and thus plays a direct role in the hydrolytic activity of the enzyme.
COG2730 BglC 1.26e-28 35 355 41 371
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].
PRK11198 PRK11198 2.56e-15 547 644 38 143
LysM domain/BON superfamily protein; Provisional
cd00118 LysM 5.23e-11 599 646 3 45
Lysin Motif is a small domain involved in binding peptidoglycan. LysM, a small globular domain with approximately 40 amino acids, is a widespread protein module involved in binding peptidoglycan in bacteria and chitin in eukaryotes. The domain was originally identified in enzymes that degrade bacterial cell walls, but proteins involved in many other biological functions also contain this domain. It has been reported that the LysM domain functions as a signal for specific plant-bacteria recognition in bacterial pathogenesis. Many of these enzymes are modular and are composed of catalytic units linked to one or several repeats of LysM domains. LysM domains are found in bacteria and eukaryotes.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
AUS96699.1 4.44e-256 4 563 4 563
BCN31364.1 1.23e-249 1 636 1 616
ADL52313.1 3.86e-228 1 564 1 567
BAV13065.1 3.86e-228 1 564 1 567
ACZ98603.1 1.33e-218 1 563 1 560

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
4V2X_A 5.45e-157 28 564 46 582
ChainA, Endo-beta-1,4-glucanase (cellulase B) [Halalkalibacterium halodurans]
5XRC_A 3.40e-149 22 549 26 552
ATrimodular GH5_4 Subfamily Endoglucanase Structure with Large Unit Cell [Bacillus sp. BG-CS10],5XRC_B A Trimodular GH5_4 Subfamily Endoglucanase Structure with Large Unit Cell [Bacillus sp. BG-CS10],5XRC_C A Trimodular GH5_4 Subfamily Endoglucanase Structure with Large Unit Cell [Bacillus sp. BG-CS10]
5E0C_A 4.13e-149 39 549 11 519
StructuralInsight of a Trimodular Halophilic Cellulase with a Family 46 Carbohydrate-Binding Module [Bacillus sp. BG-CS10]
5E09_A 1.05e-143 39 549 11 519
StructuralInsight of a Trimodular Halophilic Cellulase with a Family 46 Carbohydrate-Binding Module [Bacillus sp. BG-CS10]
4YZP_A 5.29e-67 23 516 9 494
Crystalstructure of a tri-modular GH5 (subfamily 4) endo-beta-1, 4-glucanase from Bacillus licheniformis [Bacillus licheniformis],4YZT_A Crystal structure of a tri-modular GH5 (subfamily 4) endo-beta-1, 4-glucanase from Bacillus licheniformis complexed with cellotetraose [Bacillus licheniformis]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P23550 1.79e-152 36 549 35 547
Endoglucanase B OS=Paenibacillus lautus OX=1401 GN=celB PE=3 SV=1
P28623 8.30e-48 38 355 43 350
Endoglucanase D OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engD PE=1 SV=2
P23660 2.11e-46 42 351 31 338
Endoglucanase A OS=Ruminococcus albus OX=1264 GN=celA PE=1 SV=1
P28621 3.20e-46 37 355 41 352
Endoglucanase B OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engB PE=3 SV=1
P10477 3.22e-45 42 346 61 354
Cellulase/esterase CelE OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celE PE=1 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000222 0.999070 0.000224 0.000171 0.000155 0.000138

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000003355_00349.