logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000250_01845

You are here: Home > Sequence: MGYG000000250_01845

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species TF01-11 sp001414325
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; TF01-11; TF01-11 sp001414325
CAZyme ID MGYG000000250_01845
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
969 MGYG000000250_5|CGC4 105224.76 7.4799
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000250 3613289 Isolate China Asia
Gene Location Start: 158509;  End: 161418  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000000250_01845.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 89 379 5.4e-77 0.9891304347826086

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 4.42e-45 86 382 13 272
Cellulase (glycosyl hydrolase family 5).
COG2730 BglC 1.38e-16 45 345 27 322
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].
sd00036 LRR_3 1.60e-08 859 958 15 104
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
sd00036 LRR_3 2.13e-08 859 946 38 115
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
pfam13306 LRR_5 4.87e-07 859 936 35 101
Leucine rich repeats (6 copies). This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
AUO18792.1 1.02e-116 53 801 246 959
AUO19859.1 6.12e-87 49 607 101 642
BCN29385.1 6.16e-67 57 518 349 807
BAE44526.1 2.09e-65 55 409 38 403
QAA35398.1 2.15e-65 51 409 33 402

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
2JEP_A 1.10e-66 62 409 35 393
Nativefamily 5 xyloglucanase from Paenibacillus pabuli [Paenibacillus pabuli],2JEP_B Native family 5 xyloglucanase from Paenibacillus pabuli [Paenibacillus pabuli],2JEQ_A Family 5 xyloglucanase from Paenibacillus pabuli in complex with ligand [Paenibacillus pabuli]
6WQY_A 5.52e-58 64 391 26 361
ChainA, Cellulase [Phocaeicola salanitronis DSM 18170]
6WQP_A 4.49e-55 48 392 2 338
GH5-4broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQP_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQV_A GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_C GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_D GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis]
6Q1I_A 9.79e-53 61 410 13 353
GH5-4broad specificity endoglucanase from Clostrdium longisporum [Clostridium longisporum],6Q1I_B GH5-4 broad specificity endoglucanase from Clostrdium longisporum [Clostridium longisporum]
4X0V_A 5.84e-51 61 408 39 393
Structureof a GH5 family lichenase from Caldicellulosiruptor sp. F32 [Caldicellulosiruptor sp. F32],4X0V_B Structure of a GH5 family lichenase from Caldicellulosiruptor sp. F32 [Caldicellulosiruptor sp. F32],4X0V_C Structure of a GH5 family lichenase from Caldicellulosiruptor sp. F32 [Caldicellulosiruptor sp. F32],4X0V_D Structure of a GH5 family lichenase from Caldicellulosiruptor sp. F32 [Caldicellulosiruptor sp. F32],4X0V_E Structure of a GH5 family lichenase from Caldicellulosiruptor sp. F32 [Caldicellulosiruptor sp. F32],4X0V_F Structure of a GH5 family lichenase from Caldicellulosiruptor sp. F32 [Caldicellulosiruptor sp. F32],4X0V_G Structure of a GH5 family lichenase from Caldicellulosiruptor sp. F32 [Caldicellulosiruptor sp. F32],4X0V_H Structure of a GH5 family lichenase from Caldicellulosiruptor sp. F32 [Caldicellulosiruptor sp. F32]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
O08342 3.77e-62 50 409 28 398
Endoglucanase A OS=Paenibacillus barcinonensis OX=198119 GN=celA PE=1 SV=1
P54937 2.49e-51 37 410 14 378
Endoglucanase A OS=Clostridium longisporum OX=1523 GN=celA PE=1 SV=1
P23661 3.69e-51 56 392 65 382
Endoglucanase B OS=Ruminococcus albus OX=1264 GN=celB PE=3 SV=1
P16216 6.38e-51 35 392 35 380
Endoglucanase 1 OS=Ruminococcus albus OX=1264 GN=Eg I PE=1 SV=1
P23660 3.64e-50 59 392 24 345
Endoglucanase A OS=Ruminococcus albus OX=1264 GN=celA PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000351 0.998857 0.000240 0.000210 0.000169 0.000141

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000000250_01845.