logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000001995_01201

You are here: Home > Sequence: MGYG000001995_01201

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Parabacteroides sp900541965
Lineage Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Tannerellaceae; Parabacteroides; Parabacteroides sp900541965
CAZyme ID MGYG000001995_01201
CAZy Family CBM62
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
650 MGYG000001995_12|CGC4 75068.28 6.9153
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000001995 4648473 MAG Spain Europe
Gene Location Start: 84269;  End: 86221  Strand: -

Full Sequence      Download help

MKLYLIIFFA  FCLFSCSWQG  NKNTYLEQAL  SLAGNNRKEL  EKVLNHYQAD  TLKLKAAQFL60
IENMPYYYYY  TGKTLDRQME  QYKLFATTDQ  HPASIRDSLA  LKYGPFSYSA  LNMEYDLLNV120
DSAYLVENID  EAFRVWEEQP  WGKHVSFENF  CEYILPYRTG  DEPLTYWRKQ  IYEKYNPLLD180
SIRKLPEAED  PAFAAQALLD  TFRREGNIKY  TEQLAITPHT  GPQVCVEWKS  GTCREHADAV240
IYVMRALGIP  CGIDEVPLRG  DNNSPHMWNF  VLDKEQNTYM  IEILTYSEIR  KAPEVYLSAG300
KIYRHTFSTN  NSLKEQFNAS  GLPVPSYFRT  GQIKDITKIY  AGSGCFPVEI  SGTKLYKRIK360
KSEPIWLCLS  ARQQWKPVDY  GYLLENKVCF  KDVKGGVVCR  LAHSTKEGLE  MLSDPFHVDA420
EHGTIRFFTP  GQEMEQVTVL  FKFHFFYEYF  LNRMVNGIFE  GSNAPDFSHA  DTLFQIKDEP480
RRLITVVSPA  TDKKYRYVRY  KGPKGSHCNI  SEVAFYENNV  DTVALKGKVI  GTPGSFDGIH540
DYRNVFDGNP  YTSFDYRFPD  GGWSGLDLGK  ACSIARIAFT  PRNSDNFIRK  GDSYEMFYLD600
HEWVSAGIQT  AVSDSLNFMA  PKGALLYLKN  HTRGKDERIF  EYKEGKQIFW  650

Enzyme Prediction      help

No EC number prediction in MGYG000001995_01201.

CAZyme Signature Domains help

Created with Snap326597130162195227260292325357390422455487520552585617449517CBM62
Family Start End Evalue family coverage
CBM62 449 517 4.1e-23 0.5267175572519084

CDD Domains      download full data without filtering help

Created with Snap326597130162195227260292325357390422455487520552585617188269Transglut_core229269TGc
Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam01841 Transglut_core 2.99e-07 188 269 11 94
Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that centre around conserved cysteine, histidine, and aspartate residues that form the catalytic triad in the structurally characterized transglutaminase, the human blood clotting factor XIIIa'. On the basis of the experimentally demonstrated activity of the Methanobacterium phage pseudomurein endoisopeptidase, it is proposed that many, if not all, microbial homologs of the transglutaminases are proteases and that the eukaryotic transglutaminases have evolved from an ancestral protease.
smart00460 TGc 1.45e-04 229 269 5 53
Transglutaminase/protease-like homologues. Transglutaminases are enzymes that establish covalent links between proteins. A subset of transglutaminase homologues appear to catalyse the reverse reaction, the hydrolysis of peptide bonds. Proteins with this domain are both extracellular and intracellular, and it is likely that the eukaryotic intracellular proteins are involved in signalling events.

CAZyme Hits      help

Created with Snap3265971301621952272602923253573904224554875205525856173650QKH87132.1|CBM6222650QDO67716.1|CBM621650QDO68297.1|CBM6213650QCQ50079.1|CBM6215650QUU03539.1|CBM62
Hit ID E-Value Query Start Query End Hit Start Hit End
QKH87132.1 9.52e-215 3 650 11 653
QDO67716.1 3.83e-214 22 650 23 653
QDO68297.1 5.02e-212 1 650 10 653
QCQ50079.1 1.11e-211 13 650 25 656
QUU03539.1 9.29e-210 15 650 24 653

PDB Hits      download full data without filtering help

Created with Snap3265971301621952272602923253573904224554875205525856174535162YB7_A4535165G56_A
Hit ID E-Value Query Start Query End Hit Start Hit End Description
2YB7_A 8.37e-09 453 516 73 137
ChainA, Carbohydrate Binding Family 6 [Acetivibrio thermocellus],2YB7_B Chain B, Carbohydrate Binding Family 6 [Acetivibrio thermocellus],2YFU_A Chain A, Carbohydrate Binding Family 6 [Acetivibrio thermocellus],2YFZ_A Chain A, Carbohydrate Binding Family 6 [Acetivibrio thermocellus],2YG0_A Chain A, Carbohydrate Binding Family 6 [Acetivibrio thermocellus]
5G56_A 7.28e-07 453 516 774 838
ChainA, Carbohydrate Binding Family 6 [Acetivibrio thermocellus]

Swiss-Prot Hits      help

has no Swissprot hit.

SignalP and Lipop Annotations help

This protein is predicted as LIPO

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000000 0.000045 0.999996 0.000000 0.000000 0.000000

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000001995_01201.