logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000001_00152

You are here: Home > Sequence: MGYG000000001_00152

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species GCA-900066495 sp902362365
Lineage Bacteria; Firmicutes_A; Clostridia; Peptostreptococcales; Peptostreptococcaceae; GCA-900066495; GCA-900066495 sp902362365
CAZyme ID MGYG000000001_00152
CAZy Family CBM51
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
2140 243933.5 4.5867
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000001 3219617 Isolate United Kingdom Europe
Gene Location Start: 163342;  End: 169764  Strand: +

Full Sequence      Download help

MKKKLASIVA  ISIIATNTMP  AMNVFADEII  RNKATSIEKE  VSKNMTVETF  KIKNYENFTR60
YNEDYRVNIK  SISNNGGKYN  QSIITNAIDG  KLETHWETGR  ENEEDFKNEV  EFEFENVEKI120
NRLAYATRQD  SARPKGYPKV  AKILVADSDE  SEYKLVGEVA  SDKVTGDMVE  FKFDTVEAKK180
VKFIFSDVHN  GWASASEFWF  YKEDKTLDKM  ESLFTDDNMN  EISPEFADLE  ALNALEKEAK240
DHPFYTNFKE  DIDNAITILE  EEELESTLAK  VSKLTGYGTE  YQDAYNEKFM  IPTNKITKVE300
ANGGIYPGTK  YEYMFDGDPN  THWETNKANS  SDFTNEITLT  LDEAQTIDRL  AYKARSLNNK360
GFPTKFEIYV  SETSKGQTFQ  KIAYGQANVT  SDMVEFKFTP  TKAKRVKLVF  NECNMNRAFA420
SEMRLYKHDE  LSEKYNTLFT  NDKKNEISSE  FNSIDKLEKF  SKELEQHPLY  NLYKEGINDA480
RLILENKKVT  YIDAKVSKFK  SFDSPELVEY  DKAYKIDNKH  IVNITTNGSH  YSNEVIARAI540
DGDVTTAWHS  NKTNSENHTN  EVIMTLDKLE  TIDKVVYTSP  RARGFAEEFD  IYVSKTLEGD600
TFEKVTSGRA  SRTNDSVSIK  FNPTEARRVK  FVYTKAYENF  ALAYEFGLYK  QDSTIDKMDN660
LFTDENMNTV  SEEFANMEAL  NKLEDEVKEH  PFYSDFKEDI  ENAKALVEQG  QVEATVSKTK720
KFNHYNNQEY  MEQYAINRDN  IKKISNNAGQ  WSTMKIDNAI  DGKLDTYWET  NTSNTESFKN780
EVTVEFINPV  TIDRLAYGAR  ESDRKGFLQE  FEIYVSATTK  GDNFKLVSTA  KADKTTGLVE840
AKFEPTKFQR  LKIKVTKGDQ  NWATLNELVF  LKQDSIANQV  ESMFTNDLMN  ELTDDYNTIQ900
EIEALEQEVK  KHPLERELME  YIERAKYIIS  NPEQANNYIY  ELESRGDSIK  ESQKRQMWNF960
QDWQPTGLAV  KSGQQITVYV  DAEPGTPLPR  LVYKQMDSQH  NGNHNIALTN  GKNVITIPET1020
NPDQLRPGTP  QSGVLYTTNP  YTPEEQIREP  KVKIEGAFEY  PCFVLGEDTD  EEIFKDLEEY1080
TEKLKKDKQL  PDVFEIFSEK  NLINVRATYA  LDWYSKNDIT  PSENAKKIDA  ILKESMDFWG1140
FDNSSEVNSD  YNYRYMTILK  WLDNGGFMNA  ANGITGYNQA  EQGAVLNYGT  GWGLMHELGH1200
NMDTNSMSIV  EVTNNMLPLH  FEMIEGKESR  FVQQNQYQKN  IFPKVTKEDY  SQNKWYPDND1260
YSNLQHIAPL  WQLQLYDETF  WPRLQQEFRE  NKLGGGDWDN  KHEAWAIAAS  NVLEMDLTEH1320
FARHGFRVSE  ETKNHMKQYP  APTQKLWYLN  DNKYLKDGEA  FNGNLNVKVS  PKINKDNITL1380
TMDIDKENDK  SLLGYEIYRD  GKLIGFTDKG  TYVDQEVDLS  DNHIYEVVPF  SNDLNTAKKI1440
SVNSQQPVIE  TTGGVTVKLN  EEFNPLDYVK  STDYQGNPIE  NIEVKHNVDN  TKQGKYTVTY1500
KVKDNNNLFA  DETLEVEVVS  DYTFLSDFEW  TSATSGYQNA  RRNESIKGRH  LGEPKTFDKG1560
LATHANGVIT  YNLGEHDYDY  LEMRVGVDMS  IPEQGNSSIT  FKLTGDGKTL  ATTSVMKHAD1620
NLKYIKVPVK  DVDEIKIEVN  NGGNGITSDH  GIIVEPKLAT  NNGKPTLKVD  KSISVKVGEE1680
LEDVIGKIEA  YDAEDGDLTK  AVQISGQDKV  NTNRVGTYPI  TYTVTDKSGN  TATATRNIKV1740
VNMQDFTYLS  DYDWKSQSNN  YTQPKKDVAI  NGGNLTLTGE  NNEAVEYEKG  LGMHSTSTVI1800
YDLTDKNADI  FSTYIGVDRA  MYHSVGQVQF  EIYVDGKLSY  DSGIMKAKDK  QKYVELNLAD1860
AKELKLVVKD  GGNGNGSDHA  TFGDTKLHFV  NESRIDKTEL  TNLVNEAKKL  NEDDYTEESF1920
NAMQILLDKA  EQVLEDSNAS  QDTIDSVTAE  LQESVKSLTV  INLDEVVNIP  DKYLARSLSS1980
VLGKSEGFTI  GDMRSLTELS  LSGVTDLEGL  QYAKNLEVLN  IEYNEISDLS  PLKNLKNLTD2040
LRANPQIITA  GSIAKKDNRV  TINYDILNRK  GEKLSPKSVT  IRNNQTLEDT  TLDLDECVDE2100
NGVISFDTTN  LDSVVYSVYL  EYEDTNDNYL  SQVLFMFSNK  2140

Enzyme Prediction      help

No EC number prediction in MGYG000000001_00152.

CAZyme Signature Domains help

Created with Snap107214321428535642749856963107011771284139114981605171218191926203317471887CBM5115241658CBM51
Family Start End Evalue family coverage
CBM51 1747 1887 1.1e-39 0.9925373134328358
CBM51 1524 1658 1.5e-32 0.9850746268656716

CDD Domains      download full data without filtering help

Created with Snap107214321428535642749856963107011771284139114981605171218191926203317471888NPCBM17471888NPCBM15211658NPCBM15231659NPCBM16661740DUF5011
Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam08305 NPCBM 2.54e-47 1747 1888 3 136
NPCBM/NEW2 domain. This novel putative carbohydrate binding module (NPCBM) domain is found at the N-terminus of glycosyl hydrolase family 98 proteins. This domain has also been called the NEW2 domain (Naumoff DG. Phylogenetic analysis of alpha-galactosidases of the GH27 family. Molecular Biology (Engl Transl). (2004)38:388-399.)
smart00776 NPCBM 7.84e-39 1747 1888 5 145
This novel putative carbohydrate binding module (NPCBM) domain is found at the N-terminus of glycosyl hydrolase family 98 proteins.
pfam08305 NPCBM 8.94e-36 1521 1658 1 135
NPCBM/NEW2 domain. This novel putative carbohydrate binding module (NPCBM) domain is found at the N-terminus of glycosyl hydrolase family 98 proteins. This domain has also been called the NEW2 domain (Naumoff DG. Phylogenetic analysis of alpha-galactosidases of the GH27 family. Molecular Biology (Engl Transl). (2004)38:388-399.)
smart00776 NPCBM 1.92e-28 1523 1659 5 145
This novel putative carbohydrate binding module (NPCBM) domain is found at the N-terminus of glycosyl hydrolase family 98 proteins.
pfam16403 DUF5011 2.86e-17 1666 1740 2 71
Domain of unknown function (DUF5011). This small family of proteins is functionally uncharacterized. This family is found in Bacteroides, Prevotella, and Parabateroides. Proteins in this family are around 230 amino acids in length.

CAZyme Hits      help

Created with Snap107214321428535642749856963107011771284139114981605171218191926203312124CEI71816.1|CBM32|CBM512632137AIY84110.1|CBM32|CBM512502140CED93629.1|CBM32|CBM512502131AQW26342.1|CBM32|CBM512632137AQM59432.1|CBM32|CBM51
Hit ID E-Value Query Start Query End Hit Start Hit End
CEI71816.1 0.0 1 2124 1 2117
AIY84110.1 0.0 263 2137 39 1922
CED93629.1 0.0 250 2140 28 1924
AQW26342.1 0.0 250 2131 28 2134
AQM59432.1 0.0 263 2137 39 2442

PDB Hits      download full data without filtering help

Created with Snap107214321428535642749856963107011771284139114981605171218191926203349414437JFS_A93918927JS4_A93914386XSX_A87115205KDJ_A93914435KDN_A
Hit ID E-Value Query Start Query End Hit Start Hit End Description
7JFS_A 1.19e-279 494 1443 27 981
ChainA, F5/8 type C domain protein [Clostridium perfringens ATCC 13124]
7JS4_A 3.49e-240 939 1892 24 980
ChainA, F5/8 type C domain protein [Clostridium perfringens ATCC 13124]
6XSX_A 3.19e-216 939 1438 24 524
ChainA, ZmpA Glycopeptidase [Clostridium perfringens ATCC 13124]
5KDJ_A 6.93e-177 871 1520 21 672
ZmpBmetallopeptidase from Clostridium perfringens [Clostridium perfringens ATCC 13124],5KDJ_B ZmpB metallopeptidase from Clostridium perfringens [Clostridium perfringens ATCC 13124]
5KDN_A 8.53e-149 939 1443 24 530
ZmpBmetallopeptidase from Clostridium perfringens [Clostridium perfringens ATCC 13124],5KDS_A ZmpB metallopeptidase in complex with an O-glycopeptide (a2,6-sialylated core-3 pentapeptide). [Clostridium perfringens ATCC 13124],5KDU_A ZmpB metallopeptidase in complex with a2,6-Sialyl T-antigen [Clostridium perfringens ATCC 13124]

Swiss-Prot Hits      help

has no Swissprot hit.

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.001000 0.997758 0.000346 0.000326 0.000281 0.000249

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000000001_00152.