logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000004582_02228

You are here: Home > Sequence: MGYG000004582_02228

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; CAG-81;
CAZyme ID MGYG000004582_02228
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
388 MGYG000004582_39|CGC1 44251.88 5.0089
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000004582 2816599 MAG France Europe
Gene Location Start: 5186;  End: 6352  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000004582_02228.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 122 361 5.4e-28 0.9541984732824428

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 1.37e-13 127 335 31 246
Cellulase (glycosyl hydrolase family 5).
COG5263 COG5263 1.26e-06 26 83 257 313
Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism].
NF033840 PspC_relate_1 3.27e-05 26 83 592 647
PspC-related protein choline-binding protein 1. Members of this family share C-terminal homology to the choline-binding form of the pneumococcal surface antigen PspC, but not to its allelic LPXTG-anchored forms because they lack the choline-binding repeat region. Members of this family should not be confused with PspC itself, whose identity and function reflect regions N-terminal to the choline-binding region. See Iannelli, et al. (PMID: 11891047) for information about the different allelic forms of PspC.
pfam19085 Choline_bind_2 7.91e-05 56 82 11 38
Choline-binding repeat. this entry contains a pair of presumed choline-binding repeats that are often found adjacent to pfam01473.
NF033838 PspC_subgroup_1 1.98e-04 21 83 623 683
pneumococcal surface protein PspC, choline-binding form. The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QIB28168.1 1.38e-134 91 387 2 298
QCI58964.2 2.70e-124 90 388 27 325
QMW91197.1 5.52e-124 94 386 29 321
BBK76626.1 5.52e-124 94 386 29 321
QCJ07784.1 5.52e-124 96 386 31 321

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
4U5I_A 7.76e-16 128 375 114 369
ChainA, Endoglucanase H [Acetivibrio thermocellus ATCC 27405],4U5I_B Chain B, Endoglucanase H [Acetivibrio thermocellus ATCC 27405],4U5K_A Chain A, Endoglucanase H [Acetivibrio thermocellus ATCC 27405],4U5K_B Chain B, Endoglucanase H [Acetivibrio thermocellus ATCC 27405]
4U3A_A 3.39e-15 128 323 114 313
ChainA, Endoglucanase H [Acetivibrio thermocellus ATCC 27405],4U3A_B Chain B, Endoglucanase H [Acetivibrio thermocellus ATCC 27405]
5BYW_A 6.79e-13 128 323 114 324
ChainA, Endoglucanase H [Acetivibrio thermocellus ATCC 27405],5BYW_B Chain B, Endoglucanase H [Acetivibrio thermocellus ATCC 27405],5BYW_C Chain C, Endoglucanase H [Acetivibrio thermocellus ATCC 27405],5BYW_D Chain D, Endoglucanase H [Acetivibrio thermocellus ATCC 27405],5BYW_E Chain E, Endoglucanase H [Acetivibrio thermocellus ATCC 27405]
3NCO_A 1.87e-11 135 337 66 271
Crystalstructure of FnCel5A from F. nodosum Rt17-B1 [Fervidobacterium nodosum Rt17-B1],3NCO_B Crystal structure of FnCel5A from F. nodosum Rt17-B1 [Fervidobacterium nodosum Rt17-B1]
3RJX_A 1.87e-11 135 337 66 271
CrystalStructure of Hyperthermophilic Endo-Beta-1,4-glucanase [Fervidobacterium nodosum Rt17-B1]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P25472 1.61e-16 128 369 64 313
Endoglucanase D OS=Ruminiclostridium cellulolyticum (strain ATCC 35319 / DSM 5812 / JCM 6584 / H10) OX=394503 GN=celCCD PE=3 SV=1
P16218 3.77e-14 128 323 365 564
Endoglucanase H OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celH PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000289 0.998995 0.000169 0.000194 0.000171 0.000146

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000004582_02228.