logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000001748_01336

You are here: Home > Sequence: MGYG000001748_01336

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species CAG-56 sp900762665
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; CAG-56; CAG-56 sp900762665
CAZyme ID MGYG000001748_01336
CAZy Family GH59
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1683 MGYG000001748_13|CGC1 182294.88 4.7209
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000001748 4218051 MAG Sweden Europe
Gene Location Start: 9597;  End: 14648  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.23

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH59 301 988 2.4e-158 0.993660855784469

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam02057 Glyco_hydro_59 2.17e-91 308 645 2 292
Glycosyl hydrolase family 59.
sd00036 LRR_3 1.03e-15 1572 1658 63 139
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
sd00036 LRR_3 3.48e-15 1572 1660 17 95
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
pfam13306 LRR_5 3.43e-14 1572 1660 37 113
Leucine rich repeats (6 copies). This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.
pfam13306 LRR_5 6.76e-14 1568 1644 10 75
Leucine rich repeats (6 copies). This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QLH25411.1 3.06e-269 271 1141 19 930
AMW11854.1 4.44e-269 271 1116 19 909
QUW95523.1 5.63e-266 271 1116 22 910
AZM74018.1 3.08e-264 287 1116 38 907
QKW59509.1 1.21e-263 287 1116 38 907

PDB Hits      help

has no PDB hit.

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
Q0VA39 5.14e-15 288 652 26 347
Galactocerebrosidase OS=Xenopus tropicalis OX=8364 GN=galc PE=2 SV=1
Q498K0 3.68e-12 288 652 26 346
Galactocerebrosidase OS=Xenopus laevis OX=8355 GN=galc PE=2 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000358 0.998852 0.000222 0.000212 0.000176 0.000151

TMHMM  Annotations      download full data without filtering help

start end
11 33