logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000002289_01878

You are here: Home > Sequence: MGYG000002289_01878

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Coprococcus sp000433075
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; Coprococcus; Coprococcus sp000433075
CAZyme ID MGYG000002289_01878
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
567 62718.78 9.5957
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000002289 2644240 Isolate China Asia
Gene Location Start: 23814;  End: 25517  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.4

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 82 343 6.9e-75 0.9873417721518988

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 7.94e-53 80 351 2 272
Cellulase (glycosyl hydrolase family 5).
NF033930 pneumo_PspA 6.07e-15 390 558 442 593
pneumococcal surface protein A. The pneumococcal surface protein proteins, found in Streptococcus pneumoniae, are repetitive, with patterns of localized high sequence identity across pairs of proteins given different specific names that recombination may be presumed. This protein, PspA, has an N-terminal region that lacks a cross-wall-targeting YSIRK type extended signal peptide, in contrast to the closely related choline-binding protein CbpA which has a similar C-terminus but a YSIRK-containing region at the N-terminus.
NF033838 PspC_subgroup_1 6.93e-14 388 547 483 628
pneumococcal surface protein PspC, choline-binding form. The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
COG5263 COG5263 2.54e-10 350 559 108 307
Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism].
NF033840 PspC_relate_1 3.50e-10 392 527 511 637
PspC-related protein choline-binding protein 1. Members of this family share C-terminal homology to the choline-binding form of the pneumococcal surface antigen PspC, but not to its allelic LPXTG-anchored forms because they lack the choline-binding repeat region. Members of this family should not be confused with PspC itself, whose identity and function reflect regions N-terminal to the choline-binding region. See Iannelli, et al. (PMID: 11891047) for information about the different allelic forms of PspC.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
SIP56455.1 4.36e-88 47 383 72 379
CBK83458.1 2.03e-87 61 386 40 352
CBK89462.1 7.62e-87 47 383 76 383
CBK94241.1 9.51e-87 47 383 72 379
ACR74405.1 2.00e-86 47 383 72 379

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6GJF_A 2.35e-71 60 380 4 299
Ancestralendocellulase Cel5A [synthetic construct],6GJF_B Ancestral endocellulase Cel5A [synthetic construct],6GJF_C Ancestral endocellulase Cel5A [synthetic construct],6GJF_D Ancestral endocellulase Cel5A [synthetic construct],6GJF_E Ancestral endocellulase Cel5A [synthetic construct],6GJF_F Ancestral endocellulase Cel5A [synthetic construct]
4XZW_A 4.51e-66 61 360 4 279
Endo-glucanasechimera C10 [uncultured bacterium]
4XZB_A 4.20e-62 59 360 2 280
endo-glucanaseGsCelA P1 [Geobacillus sp. 70PC53]
5WH8_A 1.57e-60 59 377 4 303
CellulaseCel5C_n [uncultured organism]
3PZT_A 1.76e-60 61 362 29 303
Structureof the endo-1,4-beta-glucanase from Bacillus subtilis 168 with manganese(II) ion [Bacillus subtilis subsp. subtilis str. 168],3PZT_B Structure of the endo-1,4-beta-glucanase from Bacillus subtilis 168 with manganese(II) ion [Bacillus subtilis subsp. subtilis str. 168],3PZU_A P212121 crystal form of the endo-1,4-beta-glucanase from Bacillus subtilis 168 [Bacillus subtilis subsp. subtilis str. 168],3PZU_B P212121 crystal form of the endo-1,4-beta-glucanase from Bacillus subtilis 168 [Bacillus subtilis subsp. subtilis str. 168],3PZV_A C2 crystal form of the endo-1,4-beta-glucanase from Bacillus subtilis 168 [Bacillus subtilis subsp. subtilis str. 168],3PZV_B C2 crystal form of the endo-1,4-beta-glucanase from Bacillus subtilis 168 [Bacillus subtilis subsp. subtilis str. 168],3PZV_C C2 crystal form of the endo-1,4-beta-glucanase from Bacillus subtilis 168 [Bacillus subtilis subsp. subtilis str. 168],3PZV_D C2 crystal form of the endo-1,4-beta-glucanase from Bacillus subtilis 168 [Bacillus subtilis subsp. subtilis str. 168]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P22541 1.31e-68 63 393 110 419
Endoglucanase A OS=Butyrivibrio fibrisolvens OX=831 GN=celA PE=1 SV=1
P15704 6.19e-67 57 383 35 326
Endoglucanase OS=Clostridium saccharobutylicum OX=169679 GN=eglA PE=3 SV=1
P07983 7.09e-59 61 362 34 308
Endoglucanase OS=Bacillus subtilis OX=1423 GN=bglC PE=3 SV=2
P10475 7.17e-58 61 362 34 308
Endoglucanase OS=Bacillus subtilis (strain 168) OX=224308 GN=eglS PE=1 SV=1
Q59394 1.12e-55 54 386 26 322
Endoglucanase N OS=Pectobacterium atrosepticum OX=29471 GN=celN PE=3 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000379 0.998776 0.000315 0.000187 0.000164 0.000142

TMHMM  Annotations      download full data without filtering help

start end
12 34