logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000004275_00642

You are here: Home > Sequence: MGYG000004275_00642

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Ruminococcus_C sp000437175
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Ruminococcaceae; Ruminococcus_C; Ruminococcus_C sp000437175
CAZyme ID MGYG000004275_00642
CAZy Family CBM79
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
710 MGYG000004275_26|CGC1 76951.65 4.2531
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000004275 2279760 MAG China Asia
Gene Location Start: 5659;  End: 7791  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000004275_00642.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 409 681 1.2e-103 0.9927536231884058
CBM79 226 335 3.7e-36 0.9636363636363636
CBM79 88 200 6.1e-30 0.9636363636363636

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 1.45e-63 409 683 16 272
Cellulase (glycosyl hydrolase family 5).
COG2730 BglC 1.60e-35 383 704 46 389
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].
pfam18522 DUF5620 7.87e-35 227 338 1 119
Domain of unknown function (DUF5620). This is a domain of unknown function predicted to be a carbohydrate binding module.
pfam18522 DUF5620 2.35e-31 88 202 1 119
Domain of unknown function (DUF5620). This is a domain of unknown function predicted to be a carbohydrate binding module.
COG1874 GanA 0.002 422 473 36 86
Beta-galactosidase GanA [Carbohydrate transport and metabolism].

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
CDE11886.1 0.0 1 710 1 710
ERJ92483.1 0.0 1 710 24 734
CCZ83818.1 8.87e-312 5 710 7 717
CDE32105.1 2.33e-310 1 709 1 701
EWM54675.1 6.55e-188 63 697 58 671

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6MQ4_A 2.77e-101 373 705 4 345
ChainA, cellulase [Acetivibrio cellulolyticus]
6PZ7_A 3.34e-100 379 705 7 333
GH5-4broad specificity endoglucanase from Clostridium acetobutylicum [Clostridium acetobutylicum ATCC 824]
4IM4_A 2.08e-98 379 709 5 335
ChainA, Endoglucanase E [Acetivibrio thermocellus],4IM4_B Chain B, Endoglucanase E [Acetivibrio thermocellus],4IM4_C Chain C, Endoglucanase E [Acetivibrio thermocellus],4IM4_D Chain D, Endoglucanase E [Acetivibrio thermocellus],4IM4_E Chain E, Endoglucanase E [Acetivibrio thermocellus],4IM4_F Chain F, Endoglucanase E [Acetivibrio thermocellus]
6WQP_A 4.79e-95 369 705 6 352
GH5-4broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQP_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQV_A GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_C GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_D GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis]
6Q1I_A 1.13e-93 371 705 3 349
GH5-4broad specificity endoglucanase from Clostrdium longisporum [Clostridium longisporum],6Q1I_B GH5-4 broad specificity endoglucanase from Clostrdium longisporum [Clostridium longisporum]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P10477 1.43e-92 379 709 55 385
Cellulase/esterase CelE OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celE PE=1 SV=2
P54937 9.55e-92 371 705 28 374
Endoglucanase A OS=Clostridium longisporum OX=1523 GN=celA PE=1 SV=1
P28623 5.20e-90 376 704 38 367
Endoglucanase D OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engD PE=1 SV=2
P23660 2.21e-87 375 710 21 364
Endoglucanase A OS=Ruminococcus albus OX=1264 GN=celA PE=1 SV=1
P28621 4.28e-86 373 704 34 369
Endoglucanase B OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engB PE=3 SV=1

SignalP and Lipop Annotations help

This protein is predicted as LIPO

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000000 0.000000 1.000047 0.000000 0.000000 0.000000

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000004275_00642.