logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000001375_00117

You are here: Home > Sequence: MGYG000001375_00117

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Ruminococcus_F champanellensis
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Ruminococcaceae; Ruminococcus_F; Ruminococcus_F champanellensis
CAZyme ID MGYG000001375_00117
CAZy Family GH9
CAZyme Description Endoglucanase 1
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
845 93340.35 4.0892
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000001375 2513436 Isolate not provided not provided
Gene Location Start: 133226;  End: 135763  Strand: +

Full Sequence      Download help

MNTKRTLKRR  LKALGLSGAM  LAGALCLPGA  APQGGDLLAN  AAGDAASGFD  ANFAKLLQYS60
IYFYDANMCG  TDVSENNRLN  WRGDCHTYDA  QVPMDTEHTN  LSSAFLTANK  DYLDPDGDGF120
IDVSGGFHDA  GDHVKFGMPE  NYSAATVGWG  YYEFRDAYAA  TGQDAHVETI  LRYFNDYLMR180
CTFLDDSGDV  VAFCYQVGDG  DIDHAYWQAP  EIDTMDRPAF  FLTGDKPQTD  YVASAAASLA240
INYLNFKDTD  EAYAAKSLKY  ANALYDFARD  HEKELSDNGD  GPKQYYSSSK  WQDDYCWASA300
WMYKITGDHA  YLEEIYPNYD  YYAAPCYVYC  WNDMWGGVQC  VLGEIVSEMY  PNFIDEYKEA360
AGKSPYEEMD  CWASVKEALD  TYMSGGIGEI  SPQGYFWLNT  WGSARYNTAA  QLIAMVYDKY420
TNNNQPSKYS  DWAKGQMEYL  MGNNDITYQE  RIDANTEAEN  SGNPAPYSAD  ELHGPRCFIV480
GFNDVAAAYP  HHRASSGLSK  CEDTKPQKHV  LVGALVGGPD  NKDLHNDVTK  DWIYNEVTID540
YNAAFVGASA  GLYHFYGTDA  MQPDPDIDLG  TSEEEGGGQD  YWVEAYAVDD  KQTSGAGVTK600
LAMLVCTDSN  KPRTDISVRY  YFSVKELSNP  SNVSLVKGDE  LYDQTSVETD  FDGVLSGPYQ660
YDASFDPDIY  YIEVKWDGYN  IANANKKYQL  AVGFYYGDTW  DPTNDWSYQG  ITKCKDTYQD720
GSETRTDYIC  VYSGDTLVGG  IEPNGSKPVV  TTAATTEGSG  TTTTTTTTTT  DTTVLGDVDG780
NGKVEVNDLV  RLARYVAQDQ  ELTPALTAQQ  VTNADVNCDG  TVDASDITMI  ARALARLTSL840
EDFGK845

Enzyme Prediction      help

EC 3.2.1.4 3.2.1.-

CAZyme Signature Domains help

Created with Snap428412616921125329533838042246450754959163367671876080252446GH9
Family Start End Evalue family coverage
GH9 52 446 8.3e-95 0.8205741626794258

CDD Domains      download full data without filtering help

Created with Snap428412616921125329533838042246450754959163367671876080255548Glyco_hydro_957553PLN0234541552PLN0261352552PLN0011948552PLN02420
Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00759 Glyco_hydro_9 1.78e-116 55 548 1 373
Glycosyl hydrolase family 9.
PLN02345 PLN02345 1.39e-57 57 553 2 459
endoglucanase
PLN02613 PLN02613 1.56e-56 41 552 14 478
endoglucanase
PLN00119 PLN00119 2.99e-51 52 552 31 488
endoglucanase
PLN02420 PLN02420 2.51e-50 48 552 39 506
endoglucanase

CAZyme Hits      help

Created with Snap42841261692112532953383804224645075495916336767187608021845CBL16391.1|CBM3|GH9|3.2.1.447815ADU21883.1|CBM3|GH945801CAP78918.2|CBM3|GH945801ANV76549.1|CBM3|GH945801ADU74844.1|CBM3|GH9
Hit ID E-Value Query Start Query End Hit Start Hit End
CBL16391.1 0.0 1 845 1 845
ADU21883.1 0.0 47 815 42 810
CAP78918.2 3.94e-289 45 801 34 750
ANV76549.1 1.12e-288 45 801 34 750
ADU74844.1 1.12e-288 45 801 34 750

PDB Hits      download full data without filtering help

Created with Snap4284126169211253295338380422464507549591633676718760802525672YIK_A525631IA6_A527431JS4_A527431G87_A527431K72_A
Hit ID E-Value Query Start Query End Hit Start Hit End Description
2YIK_A 1.41e-223 52 567 39 527
ChainA, Endoglucanase [Acetivibrio thermocellus]
1IA6_A 3.75e-82 52 563 5 437
CrystalStructure Of The Cellulase Cel9m Of C. Cellulolyticum [Ruminiclostridium cellulolyticum],1IA7_A Crystal Structure Of The Cellulase Cel9m Of C. Cellulolyticium In Complex With Cellobiose [Ruminiclostridium cellulolyticum]
1JS4_A 5.28e-80 52 743 5 605
EndoEXOCELLULASE:CELLOBIOSEFROM THERMOMONOSPORA [Thermobifida fusca],1JS4_B EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA [Thermobifida fusca],1TF4_A EndoEXOCELLULASE FROM THERMOMONOSPORA [Thermobifida fusca],1TF4_B EndoEXOCELLULASE FROM THERMOMONOSPORA [Thermobifida fusca],3TF4_A EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA [Thermobifida fusca],3TF4_B EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA [Thermobifida fusca],4TF4_A EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA [Thermobifida fusca],4TF4_B EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA [Thermobifida fusca]
1G87_A 1.24e-78 52 743 5 614
TheCrystal Structure Of Endoglucanase 9g From Clostridium Cellulolyticum [Ruminiclostridium cellulolyticum],1G87_B The Crystal Structure Of Endoglucanase 9g From Clostridium Cellulolyticum [Ruminiclostridium cellulolyticum],1GA2_A The Crystal Structure Of Endoglucanase 9g From Clostridium Cellulolyticum Complexed With Cellobiose [Ruminiclostridium cellulolyticum],1GA2_B The Crystal Structure Of Endoglucanase 9g From Clostridium Cellulolyticum Complexed With Cellobiose [Ruminiclostridium cellulolyticum],1KFG_A The X-ray Crystal Structure of Cel9G from Clostridium cellulolyticum complexed with a Thio-Oligosaccharide Inhibitor [Ruminiclostridium cellulolyticum],1KFG_B The X-ray Crystal Structure of Cel9G from Clostridium cellulolyticum complexed with a Thio-Oligosaccharide Inhibitor [Ruminiclostridium cellulolyticum]
1K72_A 2.36e-77 52 743 5 614
TheX-ray Crystal Structure Of Cel9G Complexed With cellotriose [Ruminiclostridium cellulolyticum],1K72_B The X-ray Crystal Structure Of Cel9G Complexed With cellotriose [Ruminiclostridium cellulolyticum]

Swiss-Prot Hits      download full data without filtering help

Created with Snap428412616921125329533838042246450754959163367671876080250803sp|P26224|GUNF_ACET252747sp|Q02934|GUNI_ACET28743sp|P26221|GUN4_THEFU52796sp|P37700|GUNG_RUMCH52751sp|P22534|GUNA_CALSA
Hit ID E-Value Query Start Query End Hit Start Hit End Description
P26224 6.09e-85 50 803 29 696
Endoglucanase F OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celF PE=3 SV=1
Q02934 1.99e-84 52 747 77 687
Endoglucanase 1 OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celI PE=1 SV=2
P26221 1.31e-78 8 743 18 651
Endoglucanase E-4 OS=Thermobifida fusca OX=2021 GN=celD PE=1 SV=2
P37700 5.48e-78 52 796 40 683
Endoglucanase G OS=Ruminiclostridium cellulolyticum (strain ATCC 35319 / DSM 5812 / JCM 6584 / H10) OX=394503 GN=celCCG PE=1 SV=2
P22534 7.52e-77 52 751 27 646
Endoglucanase A OS=Caldicellulosiruptor saccharolyticus OX=44001 GN=celA PE=3 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.004359 0.950570 0.001146 0.042665 0.000872 0.000372

TMHMM  Annotations      download full data without filtering help

start end
13 32