logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000002622_01309

You are here: Home > Sequence: MGYG000002622_01309

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Phocaeicola sp900546095
Lineage Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Bacteroidaceae; Phocaeicola; Phocaeicola sp900546095
CAZyme ID MGYG000002622_01309
CAZy Family CBM32
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
282 MGYG000002622_10|CGC1 31477.98 4.1181
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000002622 3429958 MAG China Asia
Gene Location Start: 21898;  End: 22746  Strand: -

Full Sequence      Download help

MKKLSYLMLF  FLSAFLLGGC  ADTEQPYVGY  ITIKRAVVEA  AANSTATITA  DTDIDSPIEL60
KSIDFGEGVE  EWCTVAFNGR  DITVTATQAN  TGDNYRTATV  NVKCGYWQTS  FTVLQKYEGQ120
EYLQYDWTGW  TATGSDVQEN  DGGGYSSLFT  EDRTEFWHSY  YGEPTPCPHW  LLIDMQKELE180
CSRFAIGRRE  AGGNNYPSVR  HMNIYVSTDG  ENFEQVGEFT  FELPWTAPDG  TVVEGNSPLV240
PAEEVITLDA  PVTARYVRLE  ITATNNDTGV  CQVAYCKVYE  KL282

Enzyme Prediction      help

No EC number prediction in MGYG000002622_01309.

CAZyme Signature Domains help

Created with Snap14284256708498112126141155169183197211225239253267139268CBM32
Family Start End Evalue family coverage
CBM32 139 268 2.5e-16 0.8306451612903226

CDD Domains      download full data without filtering help

Created with Snap14284256708498112126141155169183197211225239253267138274F5_F8_type_C152266FA58C62116BACON70116BACON156267FA58C
Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00754 F5_F8_type_C 1.19e-14 138 274 7 125
F5/8 type C domain. This domain is also known as the discoidin (DS) domain family.
cd00057 FA58C 1.50e-09 152 266 32 133
Substituted updates: Jan 31, 2002
pfam13004 BACON 6.13e-05 62 116 1 61
Putative binding domain, N-terminal. The BACON (Bacteroidetes-Associated Carbohydrate-binding Often N-terminal) domain is an all-beta domain found in diverse architectures, principally in combination with carbohydrate-active enzymes and proteases. These architectures suggest a carbohydrate-binding function which is also supported by the nature of BACON's few conserved amino-acids. The phyletic distribution of BACON and other data tentatively suggest that it may frequently function to bind mucin. Further work with the characterized structure of a member of glycoside hydrolase family 5 enzyme, Structure 3ZMR, has found no evidence for carbohydrate-binding for this domain.
cd14948 BACON 1.06e-04 70 116 34 83
Bacteroidetes-Associated Carbohydrate-binding (putative) Often N-terminal (BACON) domain. The BACON domain is found in diverse domain architectures and accociated with a wide variety of domains, including carbohydrate-active enzymes and proteases. It was named for its suggested function of carbohydrate binding; the latter was inferred from domain architectures, sequence conservation, and phyletic distribution. However, recent experimental data suggest that its primary function in Bacteroides ovatus endo-xyloglucanase BoGH5A is to distance the catalytic module from the cell surface and confer additional mobility to the catalytic domain for attack of the polysaccharide. No evidence for a direct role in carbohydrate binding could be found in that case. The large majority of BACON domains are found in Bacteroidetes.
smart00231 FA58C 1.73e-04 156 267 35 129
Coagulation factor 5/8 C-terminal domain, discoidin domain. Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.

CAZyme Hits      help

Created with Snap142842567084981121261411551691831972112252392532671282CCO21431.1|CBM3233267AIM35717.1|CBM32122270QEH41611.1|CBM32130259QIU94663.1|CBM32129259CRK61066.1|AA5_2|CBM32
Hit ID E-Value Query Start Query End Hit Start Hit End
CCO21431.1 4.01e-108 1 282 5 286
AIM35717.1 2.02e-15 33 267 233 441
QEH41611.1 4.63e-08 122 270 181 317
QIU94663.1 6.33e-08 130 259 329 445
CRK61066.1 6.33e-08 129 259 499 614

PDB Hits      help

has no PDB hit.

Swiss-Prot Hits      help

has no Swissprot hit.

SignalP and Lipop Annotations help

This protein is predicted as LIPO

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000000 0.000000 1.000087 0.000000 0.000000 0.000000

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000002622_01309.