logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000222_00051

You are here: Home > Sequence: MGYG000000222_00051

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Parabacteroides sp003473295
Lineage Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Tannerellaceae; Parabacteroides; Parabacteroides sp003473295
CAZyme ID MGYG000000222_00051
CAZy Family GH9
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
842 94149.27 6.1855
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000222 4189286 Isolate China Asia
Gene Location Start: 62800;  End: 65328  Strand: +

Full Sequence      Download help

MRCKLLFSLA  LGIVGVFSQA  QEFKLSPSGY  FQNKGVDVMA  FDDIYPEGHQ  GGVCLIMHGH60
RVATNGDIRL  EATPGQWQPV  PIQRDRKADV  SSNQITAWLS  YPDSSRHMTG  FNPMIYPDLQ120
FNYTVHVKGE  GGSIIVTVDL  DRPVPEDFIG  KVGFNLELFP  GALFGKPWIM  DNESGIFPQQ180
PNGPTIEVQA  KHKHQGNYNQ  YDSPSGRVAD  VKVLAGTGGY  NPIIADDIIS  APYAVGRRFT240
VRPDDSYNRF  TIESKTSELK  LYDGRMNHNN  GWFVVRSEVP  AGATKGAIQW  VITPNVVNDW300
LYQPVIQTSQ  IGYHPAQPKT  AVIELDKRDA  QRLSADLYKI  TPDGAKLVLT  SKPADWGQFL360
RYNYLKFDFS  DVKEEGLYEV  RYGESVSSVF  RIAKNIYDRG  VWQPVLEYFL  PVQMCHMRVN420
EKYRVWHDFC  HMDDARMAPT  LNHIDGYAQG  SSTMTKYKPG  DVVPGLNIGG  WHDAGDFDLR480
VESQSGEAYI  LALAYEAFNE  NYDATSIDQT  KRITEIHQPD  GKPDMLQQVE  NGALTVVGGY540
RALGRLYRGI  ICNDLRQYVL  LGDAGAMTDN  IIGNKDDRWV  FTEDNPSREL  STAAHLAAIS600
RVLKNFNDTL  SVQSLEAARA  LFDVTREESY  SKGAKIQAAV  ELFLTTGEAR  YKDYLLKEQK660
YIVENIGRFG  WFLGRADAKM  NNAEFSKAVR  GALASFTTEL  DKQGGETPYG  IPYRPHIWGA720
GWDIQSFGYR  HYFLYKSYPD  LFGPEYIYNA  LNFVLGCHPG  SNTASFASGV  GAVSATVGYG780
LNRADWSYIP  GGVISGTALI  RPDFPELLTF  PFLWQQTEYV  LGGGSSHYMF  LVLAARQILE840
GK842

Enzyme Prediction      help

No EC number prediction in MGYG000000222_00051.

CAZyme Signature Domains help

Created with Snap4284126168210252294336378421463505547589631673715757799409775GH9
Family Start End Evalue family coverage
GH9 409 775 1.3e-29 0.8253588516746412

CDD Domains      download full data without filtering help

Created with Snap4284126168210252294336378421463505547589631673715757799305392E_set_Cellulase_N426671Glyco_hydro_9304387CelD_N582701PLN02613
Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
cd02850 E_set_Cellulase_N 4.71e-19 305 392 1 86
N-terminal Early set domain associated with the catalytic domain of cellulase. E or "early" set domains are associated with the catalytic domain of cellulases at the N-terminal end. Cellulases are O-glycosyl hydrolases (GHs) that hydrolyze beta 1-4 glucosidic bonds in cellulose. They are usually categorized into either exoglucanases, which sequentially release terminal sugar units from the cellulose chain, or endoglucanases, which also attack the chain internally. The N-terminal domain of cellulase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase, among others.
pfam00759 Glyco_hydro_9 6.22e-10 426 671 26 251
Glycosyl hydrolase family 9.
pfam02927 CelD_N 1.17e-08 304 387 1 83
Cellulase N-terminal ig-like domain.
PLN02613 PLN02613 3.10e-04 582 701 165 299
endoglucanase

CAZyme Hits      help

Created with Snap4284126168210252294336378421463505547589631673715757799284796QUY42379.1|GH9297826AOX03543.1|GH9289796ABW27815.1|GH9297826AOY84056.1|GH9306796AFZ21278.1|GH9
Hit ID E-Value Query Start Query End Hit Start Hit End
QUY42379.1 1.03e-18 284 796 21 531
AOX03543.1 8.98e-17 297 826 23 565
ABW27815.1 1.46e-16 289 796 3 506
AOY84056.1 3.63e-16 297 826 23 565
AFZ21278.1 8.11e-14 306 796 91 580

PDB Hits      download full data without filtering help

Created with Snap42841261682102522943363784214635055475896316737157577992988023X17_A
Hit ID E-Value Query Start Query End Hit Start Hit End Description
3X17_A 1.99e-12 298 802 10 522
Crystalstructure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium],3X17_B Crystal structure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium]

Swiss-Prot Hits      help

has no Swissprot hit.

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000387 0.998728 0.000286 0.000194 0.000187 0.000168

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000000222_00051.