logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000002152_00480

You are here: Home > Sequence: MGYG000002152_00480

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species UBA5446 sp900544295
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Oscillospiraceae; UBA5446; UBA5446 sp900544295
CAZyme ID MGYG000002152_00480
CAZy Family SLH
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1157 MGYG000002152_3|CGC1 126154.99 4.2547
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000002152 2803193 MAG Germany Europe
Gene Location Start: 22210;  End: 25683  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000002152_00480.

CAZyme Signature Domains help

Family Start End Evalue family coverage
SLH 1030 1071 1.1e-17 0.9761904761904762
SLH 970 1011 8.7e-16 0.9761904761904762

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
sd00036 LRR_3 2.60e-17 125 287 1 142
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
NF033190 inl_like_NEAT_1 2.59e-16 916 1070 586 743
NEAT domain-containing leucine-rich repeat protein. Members of this family have an N-terminal NEAT (near transporter) domain often associated with iron transport, followed by a leucine-rich repeat region with significant sequence similarity to the internalins of Listeria monocytogenes. However, since Bacillus cereus (from which this protein was described, in PMID:16978259) is not considered an intracellular pathogen, and the function may be iron transport rather than internalization, applying the name "internalin" to this family probably would be misleading.
sd00036 LRR_3 4.18e-16 150 304 1 136
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
pfam13306 LRR_5 3.39e-15 114 251 12 123
Leucine rich repeats (6 copies). This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.
pfam00395 SLH 5.94e-15 1030 1071 1 42
S-layer homology domain.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QUL56471.1 9.04e-31 907 1120 1101 1302
QKS46422.1 1.38e-29 874 1118 1085 1305
QLG41554.1 1.23e-28 908 1114 1094 1294
QHW31849.1 3.16e-28 851 1119 1340 1580
QOS78172.1 4.83e-28 908 1114 1106 1306

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6BT4_A 9.43e-13 923 1078 37 195
Crystalstructure of the SLH domain of Sap from Bacillus anthracis in complex with a pyruvylated SCWP unit [Bacillus anthracis]
3PYW_A 9.62e-13 923 1078 16 174
Thestructure of the SLH domain from B. anthracis surface array protein at 1.8A [Bacillus anthracis]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
Q06852 3.24e-31 910 1119 2087 2298
Cell surface glycoprotein 1 OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=olpB PE=3 SV=2
Q06848 9.16e-27 913 1120 228 433
Cellulosome-anchoring protein OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=ancA PE=1 SV=1
Q06853 1.55e-26 924 1120 480 673
Cell surface glycoprotein 2 OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=Cthe_3079 PE=1 SV=1
P38535 5.03e-16 930 1075 931 1077
Exoglucanase XynX OS=Acetivibrio thermocellus OX=1515 GN=xynX PE=3 SV=1
P38536 8.02e-16 930 1075 1705 1851
Amylopullulanase OS=Thermoanaerobacterium thermosulfurigenes OX=33950 GN=amyB PE=3 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000272 0.999073 0.000177 0.000170 0.000158 0.000141

TMHMM  Annotations      download full data without filtering help

start end
5 27