logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000003858_01884

You are here: Home > Sequence: MGYG000003858_01884

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Ruminococcaceae; UMGS363;
CAZyme ID MGYG000003858_01884
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
572 63029.77 8.1803
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000003858 2536747 MAG United States North America
Gene Location Start: 5794;  End: 7512  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000003858_01884.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 64 334 2.5e-95 0.9891304347826086

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 6.60e-66 59 334 11 269
Cellulase (glycosyl hydrolase family 5).
COG2730 BglC 3.25e-36 37 353 45 378
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].
NF033189 internalin_A 1.06e-15 383 534 258 404
class 1 internalin InlA. Internalins, as found in the intracellular human pathogen Listeria monocytogenes, are paralogous surface or secreted proteins with an N-terminal signal peptide, leucine-rich repeats, and usually a C-terminal LPXTG processing and cell surface anchoring site. See PMID:17764999 for a general discussion of internalins. Members of this family are internalin A (InlA), a class 1 (LPXTG-type) internalin.
sd00033 LRR_RI 1.51e-15 402 533 99 238
leucine-rich repeats, ribonuclease inhibitor (RI)-like subfamily. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
sd00031 LRR_1 9.08e-15 402 504 3 110
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
CCO06082.1 6.15e-99 1 351 1 365
ADU86903.1 7.55e-97 34 366 77 412
ADL33047.1 1.50e-94 34 366 116 451
ADU21113.1 2.94e-92 25 353 26 360
BAA92146.1 2.94e-92 25 353 26 360

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6Q1I_A 1.71e-91 27 353 5 338
GH5-4broad specificity endoglucanase from Clostrdium longisporum [Clostridium longisporum],6Q1I_B GH5-4 broad specificity endoglucanase from Clostrdium longisporum [Clostridium longisporum]
6XSU_A 1.74e-87 32 345 9 330
GH5-4broad specificity endoglucanase from Ruminococcus flavefaciens [Ruminococcus flavefaciens],6XSU_B GH5-4 broad specificity endoglucanase from Ruminococcus flavefaciens [Ruminococcus flavefaciens]
6PZ7_A 1.79e-87 34 365 7 334
GH5-4broad specificity endoglucanase from Clostridium acetobutylicum [Clostridium acetobutylicum ATCC 824]
6MQ4_A 1.49e-84 25 353 1 334
ChainA, cellulase [Acetivibrio cellulolyticus]
6WQP_A 9.26e-83 34 353 14 341
GH5-4broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQP_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQV_A GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_C GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_D GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P54937 1.33e-89 27 353 30 363
Endoglucanase A OS=Clostridium longisporum OX=1523 GN=celA PE=1 SV=1
P23660 4.79e-86 34 369 25 364
Endoglucanase A OS=Ruminococcus albus OX=1264 GN=celA PE=1 SV=1
Q12647 1.78e-83 30 366 19 361
Endoglucanase B OS=Neocallimastix patriciarum OX=4758 GN=CELB PE=2 SV=1
P23661 3.11e-83 35 350 70 382
Endoglucanase B OS=Ruminococcus albus OX=1264 GN=celB PE=3 SV=1
P16216 5.64e-83 35 354 68 384
Endoglucanase 1 OS=Ruminococcus albus OX=1264 GN=Eg I PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000231 0.999067 0.000191 0.000178 0.000166 0.000149

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000003858_01884.