logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000003658_00464

You are here: Home > Sequence: MGYG000003658_00464

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species CAG-110 sp900546915
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Oscillospiraceae; CAG-110; CAG-110 sp900546915
CAZyme ID MGYG000003658_00464
CAZy Family GH85
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
2202 MGYG000003658_14|CGC1 237604.34 3.951
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000003658 2415548 MAG United States North America
Gene Location Start: 34518;  End: 41126  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000003658_00464.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH85 109 433 1.8e-51 0.9746031746031746

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
COG4724 COG4724 8.63e-70 44 625 27 547
Endo-beta-N-acetylglucosaminidase D [Carbohydrate transport and metabolism].
pfam03644 Glyco_hydro_85 2.12e-55 116 430 2 291
Glycosyl hydrolase family 85. Family of endo-beta-N-acetylglucosaminidases. These enzymes work on a broad spectrum of substrates.
cd06547 GH85_ENGase 2.18e-48 100 465 1 339
Endo-beta-N-acetylglucosaminidase (ENGase) hydrolyzes the N-N'-diacetylchitobiosyl core of N-glycosylproteins. The beta-1,4-glycosyl bond located between two N-acetylglucosamine residues is hydrolyzed such that N-acetylglucosamine 1 remains with the protein and N-acetylglucosamine 2 forms the reducing end of the released glycan. ENGase is a key enzyme in the processing of free oligosaccharides in the cytosol of eukaryotes. Oligosaccharides formed in the lumen of the endoplasmic reticulum are transported into the cytosol where they are catabolized by cytosolic ENGases and other enzymes, possibly to maximize the reutilization of the component sugars. ENGases have an eight-stranded alpha/beta barrel topology and are classified as a family 85 glycosyl hydrolase (GH85) domain. The GH85 ENGases are sequence-similar to the family 18 glycosyl hydrolases, also known as GH18 chitinases. An ENGase-like protein is also found in bacteria and is included in this alignment model.
NF033190 inl_like_NEAT_1 8.86e-16 1986 2176 560 732
NEAT domain-containing leucine-rich repeat protein. Members of this family have an N-terminal NEAT (near transporter) domain often associated with iron transport, followed by a leucine-rich repeat region with significant sequence similarity to the internalins of Listeria monocytogenes. However, since Bacillus cereus (from which this protein was described, in PMID:16978259) is not considered an intracellular pathogen, and the function may be iron transport rather than internalization, applying the name "internalin" to this family probably would be misleading.
sd00033 LRR_RI 1.61e-10 947 1119 36 201
leucine-rich repeats, ribonuclease inhibitor (RI)-like subfamily. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QNO18499.1 8.84e-237 6 897 13 917
ALA43266.1 1.55e-187 27 817 31 829
APB74945.1 1.55e-187 27 817 31 829
AET57459.1 5.71e-187 27 838 31 854
AWB44945.1 1.02e-185 29 1000 36 1011

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
2W91_A 1.24e-44 54 732 18 643
Structureof a Streptococcus pneumoniae family 85 glycoside hydrolase, Endo-D. [Streptococcus pneumoniae TIGR4],2W92_A Structure of a Streptococcus pneumoniae family 85 glycoside hydrolase, Endo-D, in complex with NAG-thiazoline. [Streptococcus pneumoniae TIGR4]
3GDB_A 1.42e-43 54 732 169 794
Crystalstructure of Spr0440 glycoside hydrolase domain, Endo-D from Streptococcus pneumoniae R6 [Streptococcus pneumoniae R6]
2VTF_A 1.29e-41 44 670 9 568
X-raycrystal structure of the Endo-beta-N-acetylglucosaminidase from Arthrobacter protophormiae E173Q mutant reveals a TIM barrel catalytic domain and two ancillary domains [Glutamicibacter protophormiae],2VTF_B X-ray crystal structure of the Endo-beta-N-acetylglucosaminidase from Arthrobacter protophormiae E173Q mutant reveals a TIM barrel catalytic domain and two ancillary domains [Glutamicibacter protophormiae]
3FHA_A 5.18e-41 44 704 4 597
ChainA, Endo-beta-N-acetylglucosaminidase [Glutamicibacter protophormiae],3FHA_B Chain B, Endo-beta-N-acetylglucosaminidase [Glutamicibacter protophormiae],3FHA_C Chain C, Endo-beta-N-acetylglucosaminidase [Glutamicibacter protophormiae],3FHA_D Chain D, Endo-beta-N-acetylglucosaminidase [Glutamicibacter protophormiae]
3FHQ_A 6.94e-41 44 704 4 597
ChainA, Endo-beta-N-acetylglucosaminidase [Glutamicibacter protophormiae],3FHQ_B Chain B, Endo-beta-N-acetylglucosaminidase [Glutamicibacter protophormiae],3FHQ_D Chain D, Endo-beta-N-acetylglucosaminidase [Glutamicibacter protophormiae],3FHQ_F Chain F, Endo-beta-N-acetylglucosaminidase [Glutamicibacter protophormiae]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P38537 6.11e-13 2001 2199 12 206
Surface-layer 125 kDa protein OS=Lysinibacillus sphaericus OX=1421 PE=3 SV=1
C6CRV0 8.62e-13 2023 2197 1283 1457
Endo-1,4-beta-xylanase A OS=Paenibacillus sp. (strain JDR-2) OX=324057 GN=xynA1 PE=1 SV=1
A1L251 1.20e-11 160 313 132 301
Cytosolic endo-beta-N-acetylglucosaminidase OS=Danio rerio OX=7955 GN=engase PE=2 SV=1
Q9SRL4 1.85e-08 160 562 99 478
Cytosolic endo-beta-N-acetylglucosaminidase 2 OS=Arabidopsis thaliana OX=3702 GN=ENGASE2 PE=1 SV=1
Q8BX80 1.90e-08 54 292 83 298
Cytosolic endo-beta-N-acetylglucosaminidase OS=Mus musculus OX=10090 GN=Engase PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000471 0.998632 0.000332 0.000208 0.000178 0.000156

TMHMM  Annotations      download full data without filtering help

start end
9 28