logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000003442_01723

You are here: Home > Sequence: MGYG000003442_01723

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; Oribacterium;
CAZyme ID MGYG000003442_01723
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
389 MGYG000003442_206|CGC1 44340.06 4.7693
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000003442 2188754 MAG Fiji Oceania
Gene Location Start: 2747;  End: 3916  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000003442_01723.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 105 360 5.4e-25 0.7781818181818182

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
NF033838 PspC_subgroup_1 1.16e-07 18 83 620 683
pneumococcal surface protein PspC, choline-binding form. The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
pfam00150 Cellulase 3.24e-07 153 345 67 256
Cellulase (glycosyl hydrolase family 5).
COG5263 COG5263 8.71e-07 26 83 257 313
Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism].
NF033930 pneumo_PspA 1.05e-06 23 113 462 547
pneumococcal surface protein A. The pneumococcal surface protein proteins, found in Streptococcus pneumoniae, are repetitive, with patterns of localized high sequence identity across pairs of proteins given different specific names that recombination may be presumed. This protein, PspA, has an N-terminal region that lacks a cross-wall-targeting YSIRK type extended signal peptide, in contrast to the closely related choline-binding protein CbpA which has a similar C-terminus but a YSIRK-containing region at the N-terminus.
NF033840 PspC_relate_1 3.08e-06 14 111 499 592
PspC-related protein choline-binding protein 1. Members of this family share C-terminal homology to the choline-binding form of the pneumococcal surface antigen PspC, but not to its allelic LPXTG-anchored forms because they lack the choline-binding repeat region. Members of this family should not be confused with PspC itself, whose identity and function reflect regions N-terminal to the choline-binding region. See Iannelli, et al. (PMID: 11891047) for information about the different allelic forms of PspC.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QIB28168.1 1.57e-126 91 389 2 300
QUF84801.1 1.15e-123 94 382 29 317
QMW91197.1 1.62e-123 94 382 29 317
BBK76626.1 1.62e-123 94 382 29 317
QCI58964.2 6.39e-123 89 382 26 319

PDB Hits      help

has no PDB hit.

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P25472 1.68e-12 111 371 53 315
Endoglucanase D OS=Ruminiclostridium cellulolyticum (strain ATCC 35319 / DSM 5812 / JCM 6584 / H10) OX=394503 GN=celCCD PE=3 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000320 0.998840 0.000220 0.000229 0.000191 0.000160

TMHMM  Annotations      download full data without filtering help

start end
5 27