logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000878_01677

You are here: Home > Sequence: MGYG000000878_01677

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Collinsella sp900541145
Lineage Bacteria; Actinobacteriota; Coriobacteriia; Coriobacteriales; Coriobacteriaceae; Collinsella; Collinsella sp900541145
CAZyme ID MGYG000000878_01677
CAZy Family CBM32
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1323 140676.34 4.2995
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000878 2047179 MAG Netherlands Europe
Gene Location Start: 88;  End: 4059  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000000878_01677.

CAZyme Signature Domains help

Family Start End Evalue family coverage
CBM32 254 371 5.4e-16 0.8870967741935484

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
cd08759 Type_III_cohesin_like 4.37e-36 458 625 1 167
Cohesin domain, interaction partner of dockerin. Bacterial cohesin domains bind to a complementary protein domain named dockerin, and this interaction is required for the formation of the cellulosome, a cellulose-degrading complex. Two specific calcium-dependent interactions between cohesin and dockerin appear to be essential for cellulosome assembly, type I and type II. This subfamily represents type III cohesins and closely related domains.
pfam00754 F5_F8_type_C 9.90e-18 248 364 1 118
F5/8 type C domain. This domain is also known as the discoidin (DS) domain family.
pfam00754 F5_F8_type_C 1.13e-09 953 1090 5 127
F5/8 type C domain. This domain is also known as the discoidin (DS) domain family.
cd00057 FA58C 1.21e-09 250 364 18 134
Substituted updates: Jan 31, 2002
pfam07554 FIVAR 2.98e-08 1106 1171 3 69
FIVAR domain. This domain is found in a wide variety of contexts, but mostly occurring in cell wall associated proteins. A lack of conserved catalytic residues suggests that it is a binding domain. From context, possible substrates are hyaluronate or fibronectin (personal obs: C Yeats). This is further evidenced by. Possibly the exact substrate is N-acetyl glucosamine. Finding it in the same protein as pfam05089 further supports this proposal. It is found in the C-terminal part of Bacillus sp. Gellan lyase, which is removed during maturation. Some of the proteins it is found in are involved in methicillin resistance. The name FIVAR derives from Found In Various Architectures.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QWT17625.1 0.0 1 1298 828 2140
QNM10857.1 4.95e-301 3 1251 796 2018
BCT46261.1 1.98e-251 1 1246 798 2041
QOY60737.1 2.79e-217 1 630 223 848
BBK61154.1 4.85e-199 1 1252 816 2075

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
4LPL_A 5.22e-21 220 375 23 182
Structureof CBM32-1 from a family 31 glycoside hydrolase from Clostridium perfringens [Clostridium perfringens ATCC 13124]
6M76_A 6.65e-20 12 241 750 963
GH31alpha-N-acetylgalactosaminidase from Enterococcus faecalis [Enterococcus faecalis ATCC 10100],6M77_A GH31 alpha-N-acetylgalactosaminidase from Enterococcus faecalis in complex with N-acetylgalactosamine [Enterococcus faecalis ATCC 10100]
7F7Q_A 6.65e-20 12 241 750 963
ChainA, GH31 alpha-N-acetylgalactosaminidase [Enterococcus faecalis ATCC 10100]
7F7R_A 6.65e-20 12 241 750 963
ChainA, GH31 alpha-N-acetylgalactosaminidase [Enterococcus faecalis ATCC 10100]
4LKS_A 5.99e-14 953 1094 32 166
Structureof CBM32-3 from a family 31 glycoside hydrolase from Clostridium perfringens in complex with galactose [Clostridium perfringens ATCC 13124],4LKS_C Structure of CBM32-3 from a family 31 glycoside hydrolase from Clostridium perfringens in complex with galactose [Clostridium perfringens ATCC 13124],4LQR_A Structure of CBM32-3 from a family 31 glycoside hydrolase from Clostridium perfringens [Clostridium perfringens ATCC 13124],4P5Y_A Structure of CBM32-3 from a family 31 glycoside hydrolase from Clostridium perfringens in complex with N-acetylgalactosamine [Clostridium perfringens ATCC 13124]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
E8MGH9 1.40e-08 1048 1246 1610 1805
Beta-L-arabinobiosidase OS=Bifidobacterium longum subsp. longum (strain ATCC 15707 / DSM 20219 / JCM 1217 / NCTC 11818 / E194b) OX=565042 GN=hypBA2 PE=1 SV=1
Q0TR53 2.07e-06 287 374 672 766
O-GlcNAcase NagJ OS=Clostridium perfringens (strain ATCC 13124 / DSM 756 / JCM 1290 / NCIMB 6125 / NCTC 8237 / Type A) OX=195103 GN=nagJ PE=1 SV=1
Q8XL08 2.07e-06 287 374 672 766
O-GlcNAcase NagJ OS=Clostridium perfringens (strain 13 / Type A) OX=195102 GN=nagJ PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as OTHER

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
1.000064 0.000002 0.000000 0.000000 0.000000 0.000000

TMHMM  Annotations      download full data without filtering help

start end
1295 1317