logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000003664_00894

You are here: Home > Sequence: MGYG000003664_00894

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species CAG-170 sp900751035
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Oscillospiraceae; CAG-170; CAG-170 sp900751035
CAZyme ID MGYG000003664_00894
CAZy Family CE4
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
609 MGYG000003664_128|CGC1 68192.52 4.2375
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000003664 2586855 MAG Peru South America
Gene Location Start: 1144;  End: 2973  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000003664_00894.

CAZyme Signature Domains help

Family Start End Evalue family coverage
CE4 412 568 2.5e-25 0.9230769230769231

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
cd10969 CE4_Ecf1_like_5s 2.32e-42 381 578 1 189
Putative catalytic NodB homology domain of a hypothetical protein Ecf1 from Escherichia coli and similar proteins. This family contains a hypothetical protein Ecf1 from Escherichia coli and its prokaryotic homologs. Although their biochemical properties remain to be determined, members in this family contain a conserved domain with a 5-stranded beta/alpha barrel, which is similar to the catalytic NodB homology domain of rhizobial NodB-like proteins, belonging to the larger carbohydrate esterase 4 (CE4) superfamily.
cd10918 CE4_NodB_like_5s_6s 2.13e-37 418 598 1 157
Putative catalytic NodB homology domain of PgaB, IcaB, and similar proteins which consist of a deformed (beta/alpha)8 barrel fold with 5- or 6-strands. This family belongs to the large and functionally diverse carbohydrate esterase 4 (CE4) superfamily, whose members show strong sequence similarity with some variability due to their distinct carbohydrate substrates. It includes bacterial poly-beta-1,6-N-acetyl-D-glucosamine N-deacetylase PgaB, hemin storage system HmsF protein in gram-negative species, intercellular adhesion proteins IcaB, and many uncharacterized prokaryotic polysaccharide deacetylases. It also includes a putative polysaccharide deacetylase YxkH encoded by the Bacillus subtilis yxkH gene, which is one of six polysaccharide deacetylase gene homologs present in the Bacillus subtilis genome. Sequence comparison shows all family members contain a conserved domain similar to the catalytic NodB homology domain of rhizobial NodB-like proteins, which consists of a deformed (beta/alpha)8 barrel fold with 6 or 7 strands. However, in this family, most proteins have 5 strands and some have 6 strands. Moreover, long insertions are found in many family members, whose function remains unknown.
cd10966 CE4_yadE_5s 1.97e-35 415 600 1 160
Putative catalytic polysaccharide deacetylase domain of uncharacterized protein yadE and similar proteins. This family contains an uncharacterized protein yadE from Escherichia coli and its bacterial homologs. Although its molecular function remains unknown, yadE shows high sequence similarity with the catalytic NodB homology domain of outer membrane lipoprotein PgaB and the surface-attached protein intercellular adhesion protein IcaB. Both PgaB and IcaB are essential in bacterial biofilm formation.
TIGR03938 deacetyl_PgaB 3.16e-32 363 609 5 262
poly-beta-1,6-N-acetyl-D-glucosamine N-deacetylase PgaB. Two well-characterized systems produce polysaccharide based on N-acetyl-D-glucosamine in straight chains with beta-1,6 linkages. These are encoded by the icaADBC operon in Staphylococcus species, where the system is designated polysaccharide intercellular adhesin (PIA), and the pgaABCD operon in Gram-negative bacteria such as E. coli. Both systems include a putative polysaccharide deacetylase. The PgaB protein, described here, contains an additional domain lacking from its Gram-positive counterpart IcaB (TIGR03933). Deacetylation by this protein appears necessary to allow export through the porin PgaA [Cell envelope, Biosynthesis and degradation of surface polysaccharides and lipopolysaccharides]
cd10964 CE4_PgaB_5s 4.45e-26 414 584 1 183
N-terminal putative catalytic polysaccharide deacetylase domain of bacterial poly-beta-1,6-N-acetyl-D-glucosamine N-deacetylase PgaB, and similar proteins. This family is represented by an outer membrane lipoprotein, poly-beta-1,6-N-acetyl-D-glucosamine N-deacetylase (PgaB, EC 3.5.1.-), encoded by Escherichia coli pgaB gene from the pgaABCD (formerly ycdSRQP) operon, which affects biofilm development by promoting abiotic surface binding and intercellular adhesion. PgaB catalyzes the N-deacetylation of poly-beta-1,6-N-acetyl-D-glucosamine (PGA), a biofilm adhesin polysaccharide that stabilizes biofilms of E. coli and other bacteria. PgaB contains an N-terminal NodB homology domain with a 5-stranded beta/alpha barrel, and a C-terminal carbohydrate binding domain required for PGA N-deacetylation, which may be involved in binding to unmodified poly-beta-1,6-GlcNAc and assisting catalysis by the deacetylase domain. This family also includes several orthologs of PgaB, such as the hemin storage system HmsF protein, encoded by Yersinia pestis hmsF gene from the hmsHFRS operon, which is essential for Y. pestis biofilm formation. Like PgaB, HmsF is an outer membrane protein with an N-terminal NodB homology domain, which is likely involved in the modification of the exopolysaccharide (EPS) component of the biofilm. HmsF also has a conserved but uncharacterized C-terminal domain that is present in other HmsF-like proteins in Gram-negative bacteria. This alignment model corresponds to the N-terminal NodB homology domain.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
AFS77609.1 3.99e-41 360 608 21 269
AAM25723.1 1.42e-38 356 577 30 265
ARI76505.1 1.64e-38 359 585 225 450
AIS53446.1 1.67e-36 361 577 35 260
AFS79569.1 1.87e-35 342 572 12 239

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
5BU6_A 1.67e-17 363 604 15 276
Structureof BpsB deaceylase domain from Bordetella bronchiseptica [Bordetella bronchiseptica RB50],5BU6_B Structure of BpsB deaceylase domain from Bordetella bronchiseptica [Bordetella bronchiseptica RB50]
4WCJ_A 2.24e-15 363 577 39 235
Structureof IcaB from Ammonifex degensii [Ammonifex degensii KC4]
6DQ3_A 2.31e-15 356 606 4 226
ChainA, Polysaccharide deacetylase [Streptococcus pyogenes],6DQ3_B Chain B, Polysaccharide deacetylase [Streptococcus pyogenes]
4HD5_A 2.29e-13 349 609 132 360
CrystalStructure of BC0361, a polysaccharide deacetylase from Bacillus cereus [Bacillus cereus ATCC 14579]
4V33_A 2.29e-13 349 609 132 360
Crystalstructure of the putative polysaccharide deacetylase BA0330 from bacillus anthracis [Bacillus anthracis],4V33_B Crystal structure of the putative polysaccharide deacetylase BA0330 from bacillus anthracis [Bacillus anthracis]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P94361 1.86e-23 356 603 60 270
Putative polysaccharide deacetylase YxkH OS=Bacillus subtilis (strain 168) OX=224308 GN=yxkH PE=3 SV=1
P31666 3.08e-13 363 575 172 374
Uncharacterized protein YadE OS=Escherichia coli (strain K12) OX=83333 GN=yadE PE=3 SV=2
P75906 3.96e-11 364 589 52 293
Poly-beta-1,6-N-acetyl-D-glucosamine N-deacetylase OS=Escherichia coli (strain K12) OX=83333 GN=pgaB PE=1 SV=1
Q8XAR3 5.23e-11 364 494 52 189
Poly-beta-1,6-N-acetyl-D-glucosamine N-deacetylase OS=Escherichia coli O157:H7 OX=83334 GN=pgaB PE=3 SV=1
Q6GDD6 1.28e-10 365 551 63 231
Poly-beta-1,6-N-acetyl-D-glucosamine N-deacetylase OS=Staphylococcus aureus (strain MRSA252) OX=282458 GN=icaB PE=3 SV=1

SignalP and Lipop Annotations help

This protein is predicted as OTHER

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.786909 0.202388 0.008747 0.000589 0.000349 0.001042

TMHMM  Annotations      download full data without filtering help

start end
7 29