logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000001602_00787

You are here: Home > Sequence: MGYG000001602_00787

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Blautia sp900547685
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; Blautia; Blautia sp900547685
CAZyme ID MGYG000001602_00787
CAZy Family GH85
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1234 MGYG000001602_12|CGC2 139104.44 4.2959
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000001602 2582611 MAG China Asia
Gene Location Start: 40100;  End: 43804  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000001602_00787.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH85 106 463 1.5e-63 0.9936507936507937

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
NF033930 pneumo_PspA 1.15e-80 1039 1234 444 643
pneumococcal surface protein A. The pneumococcal surface protein proteins, found in Streptococcus pneumoniae, are repetitive, with patterns of localized high sequence identity across pairs of proteins given different specific names that recombination may be presumed. This protein, PspA, has an N-terminal region that lacks a cross-wall-targeting YSIRK type extended signal peptide, in contrast to the closely related choline-binding protein CbpA which has a similar C-terminus but a YSIRK-containing region at the N-terminus.
NF033930 pneumo_PspA 3.06e-79 1018 1231 442 657
pneumococcal surface protein A. The pneumococcal surface protein proteins, found in Streptococcus pneumoniae, are repetitive, with patterns of localized high sequence identity across pairs of proteins given different specific names that recombination may be presumed. This protein, PspA, has an N-terminal region that lacks a cross-wall-targeting YSIRK type extended signal peptide, in contrast to the closely related choline-binding protein CbpA which has a similar C-terminus but a YSIRK-containing region at the N-terminus.
NF033838 PspC_subgroup_1 2.06e-74 1039 1232 487 681
pneumococcal surface protein PspC, choline-binding form. The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
NF033930 pneumo_PspA 8.99e-73 1058 1233 443 621
pneumococcal surface protein A. The pneumococcal surface protein proteins, found in Streptococcus pneumoniae, are repetitive, with patterns of localized high sequence identity across pairs of proteins given different specific names that recombination may be presumed. This protein, PspA, has an N-terminal region that lacks a cross-wall-targeting YSIRK type extended signal peptide, in contrast to the closely related choline-binding protein CbpA which has a similar C-terminus but a YSIRK-containing region at the N-terminus.
NF033838 PspC_subgroup_1 1.75e-68 1057 1234 485 666
pneumococcal surface protein PspC, choline-binding form. The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
AQP47875.1 1.44e-238 18 984 10 951
AVM41738.1 5.67e-223 49 952 72 947
SMF77083.1 3.91e-220 29 940 21 916
VEG55861.1 2.72e-213 32 949 28 941
AMN36044.1 7.84e-210 36 941 31 908

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
2W91_A 2.12e-62 46 746 5 643
Structureof a Streptococcus pneumoniae family 85 glycoside hydrolase, Endo-D. [Streptococcus pneumoniae TIGR4],2W92_A Structure of a Streptococcus pneumoniae family 85 glycoside hydrolase, Endo-D, in complex with NAG-thiazoline. [Streptococcus pneumoniae TIGR4]
3GDB_A 5.79e-62 37 795 150 847
Crystalstructure of Spr0440 glycoside hydrolase domain, Endo-D from Streptococcus pneumoniae R6 [Streptococcus pneumoniae R6]
3FHA_A 2.27e-47 48 730 4 611
ChainA, Endo-beta-N-acetylglucosaminidase [Glutamicibacter protophormiae],3FHA_B Chain B, Endo-beta-N-acetylglucosaminidase [Glutamicibacter protophormiae],3FHA_C Chain C, Endo-beta-N-acetylglucosaminidase [Glutamicibacter protophormiae],3FHA_D Chain D, Endo-beta-N-acetylglucosaminidase [Glutamicibacter protophormiae]
3FHQ_A 3.05e-47 48 730 4 611
ChainA, Endo-beta-N-acetylglucosaminidase [Glutamicibacter protophormiae],3FHQ_B Chain B, Endo-beta-N-acetylglucosaminidase [Glutamicibacter protophormiae],3FHQ_D Chain D, Endo-beta-N-acetylglucosaminidase [Glutamicibacter protophormiae],3FHQ_F Chain F, Endo-beta-N-acetylglucosaminidase [Glutamicibacter protophormiae]
2VTF_A 1.43e-46 48 730 9 616
X-raycrystal structure of the Endo-beta-N-acetylglucosaminidase from Arthrobacter protophormiae E173Q mutant reveals a TIM barrel catalytic domain and two ancillary domains [Glutamicibacter protophormiae],2VTF_B X-ray crystal structure of the Endo-beta-N-acetylglucosaminidase from Arthrobacter protophormiae E173Q mutant reveals a TIM barrel catalytic domain and two ancillary domains [Glutamicibacter protophormiae]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
Q9SRL4 1.73e-16 170 581 108 477
Cytosolic endo-beta-N-acetylglucosaminidase 2 OS=Arabidopsis thaliana OX=3702 GN=ENGASE2 PE=1 SV=1
A1L251 6.59e-13 59 484 70 414
Cytosolic endo-beta-N-acetylglucosaminidase OS=Danio rerio OX=7955 GN=engase PE=2 SV=1
Q8BX80 1.60e-10 35 398 60 380
Cytosolic endo-beta-N-acetylglucosaminidase OS=Mus musculus OX=10090 GN=Engase PE=1 SV=1
P0C7A1 9.54e-09 59 484 90 432
Cytosolic endo-beta-N-acetylglucosaminidase OS=Gallus gallus OX=9031 GN=ENGASE PE=1 SV=1
Q8NFI3 4.39e-07 35 597 68 531
Cytosolic endo-beta-N-acetylglucosaminidase OS=Homo sapiens OX=9606 GN=ENGASE PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000599 0.998367 0.000495 0.000200 0.000158 0.000148

TMHMM  Annotations      download full data without filtering help

start end
12 34