logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000586_00010

You are here: Home > Sequence: MGYG000000586_00010

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; TF01-11;
CAZyme ID MGYG000000586_00010
CAZy Family GH59
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1093 MGYG000000586_1|CGC1 119791.13 5.1625
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000586 2012106 MAG Madagascar Africa
Gene Location Start: 12164;  End: 15445  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.23

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH59 54 756 7.4e-175 0.993660855784469

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam02057 Glyco_hydro_59 1.39e-89 61 397 1 293
Glycosyl hydrolase family 59.
sd00036 LRR_3 4.37e-10 995 1065 20 89
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
sd00036 LRR_3 5.57e-10 999 1074 1 77
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
sd00036 LRR_3 9.55e-10 996 1069 44 116
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
sd00036 LRR_3 1.79e-09 995 1057 66 127
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
ACL75594.1 1.02e-273 46 902 36 921
ABG76970.1 1.02e-273 46 902 36 921
QUL53185.1 1.68e-273 34 909 307 1206
AUX40340.1 5.85e-271 44 892 57 932
QNK59157.1 8.61e-270 34 900 305 1197

PDB Hits      help

has no PDB hit.

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
B5X3C1 4.28e-23 58 756 34 662
Galactocerebrosidase OS=Salmo salar OX=8030 GN=galc PE=2 SV=1
Q0VA39 5.88e-23 54 756 38 672
Galactocerebrosidase OS=Xenopus tropicalis OX=8364 GN=galc PE=2 SV=1
Q498K0 1.52e-20 58 756 44 671
Galactocerebrosidase OS=Xenopus laevis OX=8355 GN=galc PE=2 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000353 0.998853 0.000228 0.000234 0.000171 0.000145

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000000586_00010.