logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000003769_00657

You are here: Home > Sequence: MGYG000003769_00657

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; Lachnoclostridium;
CAZyme ID MGYG000003769_00657
CAZy Family GH9
CAZyme Description Endoglucanase Z
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
984 MGYG000003769_4|CGC3 107253.26 4.7354
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000003769 4856301 MAG Canada North America
Gene Location Start: 79854;  End: 82808  Strand: -

Full Sequence      Download help

MKKILSVFLV  LSVIISLFPS  KITKAETNYN  YGEALQKAIM  FYEFQRSGEL  PENQRDNWRG60
DSGLNDGSDV  GLDLTGGWYD  AGDHVKFNLP  MAYSAAMLAW  AVYEEEKAFV  QSGQIDYILD120
AIKWVSDYLI  KCHPEANVFY  YQVGDGNLDH  SWWGAAEVMQ  MKRPSYKVDL  ANPGSSVVGE180
AAAALAATGL  VFKDKDPAYA  ATCIQHAKEL  YAFAETTKSD  KGYTAANGFY  TSYSGFYDEL240
SWAGAWLYLA  TSDNAYLTKA  ESYVGNWGTE  PQSTTLAYKW  GQSWDDVHYG  AAVLLARITN300
KEIYKTNVEM  HLDYWTTGYN  GNRISYTPKG  LAWLDSWGAL  RYATTTAFLA  SVYADWSGCS360
TAKATTYRTF  AKQQVDYALG  STGRSFVVGY  GTNSPEHPHH  RTAHGSWTDS  QSQPVDHRHT420
IYGALVGGPG  RDDSYTDEIG  NYVNNEIACD  YNAGFVGALA  KVYKQYGGTP  IANFTAIEEK480
TNDEFFVEAG  INASGSNFVE  IKALLNNRSG  WPARVGDKLS  FNYFIDITEA  VKLGYTKDSF540
TVTTNYNSGA  VVSKLLPWDE  ANNIYYVNVD  FTGTKIYPGG  LSAYRKEIQF  RIAGPQNTNF600
WDSTNDYSFA  DLTGVTSGAT  VKTSYIPVYD  AGVLVYGLEP  GNAATNSKIT  PTTANFDKYT660
SNQADVIVTT  TLNGNVFHGI  KNGTTALVAG  TDYTVSGDVV  TILKSNLAKQ  SIGTTRLTFD720
FSQGVDPVLT  ITVADTTPSA  SISPTTAEFD  KVLSNQNDIE  VALALNGHSL  LGIKNGTNAL780
IADVDYTVVG  NTVTILKSYL  AKQAVSKVNL  VFDFSAGNDA  ILTITIKDSS  VVVSGDIKVQ840
MFNGNASAST  NGIAPKIKLM  NTGTSDIALS  DVVLRYYYTI  DGEKAQNFWC  DWSSAGSANV900
TGKFVKMESA  KAEADYYLEI  GFTLGAGSLA  AGQSIEVQVR  FSKTDWTNYT  QTGDYSFNSS960
NSSYVDWNQM  TGYLGGSFVW  GVEP984

Enzyme Prediction      help

EC 3.2.1.4

CAZyme Signature Domains help

Created with Snap499814719624629534439344249254159063968873878783688593430460GH9840921CBM3485570CBM3
Family Start End Evalue family coverage
GH9 30 460 1.6e-140 0.9976076555023924
CBM3 840 921 4.8e-29 0.9886363636363636
CBM3 485 570 1.9e-19 0.9886363636363636

CDD Domains      download full data without filtering help

Created with Snap499814719624629534439344249254159063968873878783688593433459Glyco_hydro_98462PLN0011934460PLN023451463PLN023404462PLN02420
Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00759 Glyco_hydro_9 7.81e-164 33 459 1 374
Glycosyl hydrolase family 9.
PLN00119 PLN00119 7.69e-116 8 462 9 488
endoglucanase
PLN02345 PLN02345 1.31e-112 34 460 1 456
endoglucanase
PLN02340 PLN02340 1.36e-111 1 463 1 494
endoglucanase
PLN02420 PLN02420 1.43e-107 4 462 17 506
endoglucanase

CAZyme Hits      help

Created with Snap49981471962462953443934424925415906396887387878368859341984BCN30513.1|CBM3|GH91984BCJ93767.1|CBM3|GH91984BCJ98973.1|CBM3|GH91984ABX43720.1|CBM3|GH9|3.2.1.41984CUH92781.1|CBM3|GH9
Hit ID E-Value Query Start Query End Hit Start Hit End
BCN30513.1 0.0 1 984 1 987
BCJ93767.1 0.0 1 984 1 983
BCJ98973.1 0.0 1 984 1 982
ABX43720.1 0.0 1 984 1 985
CUH92781.1 0.0 1 984 1 991

PDB Hits      download full data without filtering help

Created with Snap4998147196246295344393442492541590639688738787836885934296401G87_A296401K72_A174784DOD_A294672XFG_A266401JS4_A
Hit ID E-Value Query Start Query End Hit Start Hit End Description
1G87_A 2.52e-306 29 640 4 614
TheCrystal Structure Of Endoglucanase 9g From Clostridium Cellulolyticum [Ruminiclostridium cellulolyticum],1G87_B The Crystal Structure Of Endoglucanase 9g From Clostridium Cellulolyticum [Ruminiclostridium cellulolyticum],1GA2_A The Crystal Structure Of Endoglucanase 9g From Clostridium Cellulolyticum Complexed With Cellobiose [Ruminiclostridium cellulolyticum],1GA2_B The Crystal Structure Of Endoglucanase 9g From Clostridium Cellulolyticum Complexed With Cellobiose [Ruminiclostridium cellulolyticum],1KFG_A The X-ray Crystal Structure of Cel9G from Clostridium cellulolyticum complexed with a Thio-Oligosaccharide Inhibitor [Ruminiclostridium cellulolyticum],1KFG_B The X-ray Crystal Structure of Cel9G from Clostridium cellulolyticum complexed with a Thio-Oligosaccharide Inhibitor [Ruminiclostridium cellulolyticum]
1K72_A 2.52e-306 29 640 4 614
TheX-ray Crystal Structure Of Cel9G Complexed With cellotriose [Ruminiclostridium cellulolyticum],1K72_B The X-ray Crystal Structure Of Cel9G Complexed With cellotriose [Ruminiclostridium cellulolyticum]
4DOD_A 2.72e-242 17 478 14 475
Thestructure of Cbescii CelA GH9 module [Caldicellulosiruptor bescii],4DOE_A The liganded structure of Cbescii CelA GH9 module [Caldicellulosiruptor bescii]
2XFG_A 4.40e-225 29 467 24 464
ChainA, ENDOGLUCANASE 1 [Acetivibrio thermocellus]
1JS4_A 4.00e-221 26 640 1 605
EndoEXOCELLULASE:CELLOBIOSEFROM THERMOMONOSPORA [Thermobifida fusca],1JS4_B EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA [Thermobifida fusca],1TF4_A EndoEXOCELLULASE FROM THERMOMONOSPORA [Thermobifida fusca],1TF4_B EndoEXOCELLULASE FROM THERMOMONOSPORA [Thermobifida fusca],3TF4_A EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA [Thermobifida fusca],3TF4_B EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA [Thermobifida fusca],4TF4_A EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA [Thermobifida fusca],4TF4_B EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA [Thermobifida fusca]

Swiss-Prot Hits      download full data without filtering help

Created with Snap49981471962462953443934424925415906396887387878368859341984sp|P23659|GUNZ_THEST29984sp|Q02934|GUNI_ACET228984sp|P22534|GUNA_CALSA10646sp|P37700|GUNG_RUMCH1640sp|P26224|GUNF_ACET2
Hit ID E-Value Query Start Query End Hit Start Hit End Description
P23659 0.0 1 984 1 986
Endoglucanase Z OS=Thermoclostridium stercorarium OX=1510 GN=celZ PE=1 SV=1
Q02934 0.0 29 984 76 887
Endoglucanase 1 OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celI PE=1 SV=2
P22534 0.0 28 984 25 1058
Endoglucanase A OS=Caldicellulosiruptor saccharolyticus OX=44001 GN=celA PE=3 SV=2
P37700 1.29e-307 10 646 20 655
Endoglucanase G OS=Ruminiclostridium cellulolyticum (strain ATCC 35319 / DSM 5812 / JCM 6584 / H10) OX=394503 GN=celCCG PE=1 SV=2
P26224 2.36e-296 1 640 1 638
Endoglucanase F OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celF PE=3 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000246 0.999109 0.000192 0.000150 0.000145 0.000137

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000003769_00657.