logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000014_02987

You are here: Home > Sequence: MGYG000000014_02987

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Clostridium butyricum
Lineage Bacteria; Firmicutes_A; Clostridia; Clostridiales; Clostridiaceae; Clostridium; Clostridium butyricum
CAZyme ID MGYG000000014_02987
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
435 MGYG000000014_14|CGC1 50772.62 6.8873
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000014 4564499 Isolate United Kingdom Europe
Gene Location Start: 14208;  End: 15515  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000000014_02987.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 62 296 5.3e-25 0.9351145038167938

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
NF033840 PspC_relate_1 1.51e-23 300 434 509 647
PspC-related protein choline-binding protein 1. Members of this family share C-terminal homology to the choline-binding form of the pneumococcal surface antigen PspC, but not to its allelic LPXTG-anchored forms because they lack the choline-binding repeat region. Members of this family should not be confused with PspC itself, whose identity and function reflect regions N-terminal to the choline-binding region. See Iannelli, et al. (PMID: 11891047) for information about the different allelic forms of PspC.
NF033930 pneumo_PspA 1.54e-23 336 434 543 660
pneumococcal surface protein A. The pneumococcal surface protein proteins, found in Streptococcus pneumoniae, are repetitive, with patterns of localized high sequence identity across pairs of proteins given different specific names that recombination may be presumed. This protein, PspA, has an N-terminal region that lacks a cross-wall-targeting YSIRK type extended signal peptide, in contrast to the closely related choline-binding protein CbpA which has a similar C-terminus but a YSIRK-containing region at the N-terminus.
COG5263 COG5263 7.77e-23 333 434 191 313
Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism].
NF033838 PspC_subgroup_1 2.27e-20 348 432 477 563
pneumococcal surface protein PspC, choline-binding form. The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
NF033930 pneumo_PspA 4.41e-20 336 432 463 560
pneumococcal surface protein A. The pneumococcal surface protein proteins, found in Streptococcus pneumoniae, are repetitive, with patterns of localized high sequence identity across pairs of proteins given different specific names that recombination may be presumed. This protein, PspA, has an N-terminal region that lacks a cross-wall-targeting YSIRK type extended signal peptide, in contrast to the closely related choline-binding protein CbpA which has a similar C-terminus but a YSIRK-containing region at the N-terminus.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QMW91197.1 1.75e-294 1 435 1 435
BBK76626.1 1.75e-294 1 435 1 435
QUF84801.1 1.01e-293 1 435 1 435
AOR94316.1 4.80e-292 1 435 1 435
AXB85103.1 4.80e-292 1 434 1 434

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
3AZR_A 5.99e-15 29 318 3 310
DiverseSubstrates Recognition Mechanism Revealed by Thermotoga maritima Cel5A Structures in Complex with Cellobiose [Thermotoga maritima MSB8],3AZR_B Diverse Substrates Recognition Mechanism Revealed by Thermotoga maritima Cel5A Structures in Complex with Cellobiose [Thermotoga maritima MSB8],3AZS_A Diverse Substrates Recognition Mechanism Revealed by Thermotoga maritima Cel5A Structures in Complex with Mannotriose [Thermotoga maritima MSB8],3AZS_B Diverse Substrates Recognition Mechanism Revealed by Thermotoga maritima Cel5A Structures in Complex with Mannotriose [Thermotoga maritima MSB8],3AZT_A Diverse Substrates Recognition Mechanism Revealed by Thermotoga maritima Cel5A Structures in Complex with Cellotetraose [Thermotoga maritima MSB8],3AZT_B Diverse Substrates Recognition Mechanism Revealed by Thermotoga maritima Cel5A Structures in Complex with Cellotetraose [Thermotoga maritima MSB8],3AZT_C Diverse Substrates Recognition Mechanism Revealed by Thermotoga maritima Cel5A Structures in Complex with Cellotetraose [Thermotoga maritima MSB8],3AZT_D Diverse Substrates Recognition Mechanism Revealed by Thermotoga maritima Cel5A Structures in Complex with Cellotetraose [Thermotoga maritima MSB8]
3AMC_A 2.65e-14 29 318 3 310
Crystalstructures of Thermotoga maritima Cel5A, apo form and dimer/au [Thermotoga maritima MSB8],3AMC_B Crystal structures of Thermotoga maritima Cel5A, apo form and dimer/au [Thermotoga maritima MSB8],3AMD_A Crystal structures of Thermotoga maritima Cel5A, apo form and tetramer/au [Thermotoga maritima MSB8],3AMD_B Crystal structures of Thermotoga maritima Cel5A, apo form and tetramer/au [Thermotoga maritima MSB8],3AMD_C Crystal structures of Thermotoga maritima Cel5A, apo form and tetramer/au [Thermotoga maritima MSB8],3AMD_D Crystal structures of Thermotoga maritima Cel5A, apo form and tetramer/au [Thermotoga maritima MSB8],3MMU_A Crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima [Thermotoga maritima],3MMU_B Crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima [Thermotoga maritima],3MMU_C Crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima [Thermotoga maritima],3MMU_D Crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima [Thermotoga maritima],3MMU_E Crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima [Thermotoga maritima],3MMU_F Crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima [Thermotoga maritima],3MMU_G Crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima [Thermotoga maritima],3MMU_H Crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima [Thermotoga maritima],3MMW_A Crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima [Thermotoga maritima],3MMW_B Crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima [Thermotoga maritima],3MMW_C Crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima [Thermotoga maritima],3MMW_D Crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima [Thermotoga maritima]
6UJE_A 2.72e-14 63 318 39 308
ChainA, Endoglucanase [Clostridioides difficile],6UJF_A Chain A, Endoglucanase [Clostridioides difficile]
4U5I_A 4.90e-13 29 318 76 377
ChainA, Endoglucanase H [Acetivibrio thermocellus ATCC 27405],4U5I_B Chain B, Endoglucanase H [Acetivibrio thermocellus ATCC 27405],4U5K_A Chain A, Endoglucanase H [Acetivibrio thermocellus ATCC 27405],4U5K_B Chain B, Endoglucanase H [Acetivibrio thermocellus ATCC 27405]
3NCO_A 5.23e-13 96 314 92 313
Crystalstructure of FnCel5A from F. nodosum Rt17-B1 [Fervidobacterium nodosum Rt17-B1],3NCO_B Crystal structure of FnCel5A from F. nodosum Rt17-B1 [Fervidobacterium nodosum Rt17-B1]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P16218 2.17e-11 29 318 327 628
Endoglucanase H OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celH PE=1 SV=1
P06653 3.23e-06 254 371 66 211
Autolysin OS=Streptococcus pneumoniae serotype 4 (strain ATCC BAA-334 / TIGR4) OX=170187 GN=lytA PE=1 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000264 0.999025 0.000190 0.000180 0.000167 0.000156

TMHMM  Annotations      download full data without filtering help

start end
7 29