dbCAN-seq: a database of CAZyme sequence and annotation

You are browsing environment: HUMAN GUT

help

CAZyme Information: MGYG000002152_00480

You are here: Home > Sequence: MGYG000002152_00480

Basic Information help

Species

UBA5446 sp900544295

Lineage

Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Oscillospiraceae; UBA5446; UBA5446 sp900544295

CAZyme ID

MGYG000002152_00480

CAZy Family

SLH

CAZyme Description

hypothetical protein

CAZyme Property

Protein Length	CGC	Molecular Weight	Isoelectric Point
1157	MGYG000002152_3\|CGC1	126154.99	4.2547

Genome Property

Genome Assembly ID	Genome Size	Genome Type	Country	Continent
MGYG000002152	2803193	MAG	Germany	Europe

Gene Location

Start: 22210; End: 25683 Strand: -

Full Sequence Download help

Enzyme Prediction help

No EC number prediction in MGYG000002152_00480.

CAZyme Signature Domains help

Family	Start	End	Evalue	family coverage
SLH	1030	1071	1.1e-17	0.9761904761904762
SLH	970	1011	8.7e-16	0.9761904761904762

CDD Domains download full data without filtering help

Cdd ID	Domain	E-Value	qStart	qEnd	sStart	sEnd	Domain Description
sd00036	LRR_3	2.60e-17	125	287	1	142	leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
NF033190	inl_like_NEAT_1	2.59e-16	916	1070	586	743	NEAT domain-containing leucine-rich repeat protein. Members of this family have an N-terminal NEAT (near transporter) domain often associated with iron transport, followed by a leucine-rich repeat region with significant sequence similarity to the internalins of Listeria monocytogenes. However, since Bacillus cereus (from which this protein was described, in PMID:16978259) is not considered an intracellular pathogen, and the function may be iron transport rather than internalization, applying the name "internalin" to this family probably would be misleading.
sd00036	LRR_3	4.18e-16	150	304	1	136	leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
pfam13306	LRR_5	3.39e-15	114	251	12	123	Leucine rich repeats (6 copies). This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.
pfam00395	SLH	5.94e-15	1030	1071	1	42	S-layer homology domain.

CAZyme Hits help

Hit ID	E-Value	Query Start	Query End	Hit Start	Hit End
QUL56471.1	9.04e-31	907	1120	1101	1302
QKS46422.1	1.38e-29	874	1118	1085	1305
QLG41554.1	1.23e-28	908	1114	1094	1294
QHW31849.1	3.16e-28	851	1119	1340	1580
QOS78172.1	4.83e-28	908	1114	1106	1306

PDB Hits download full data without filtering help

Hit ID	E-Value	Query Start	Query End	Hit Start	Hit End	Description
6BT4_A	9.43e-13	923	1078	37	195	Crystalstructure of the SLH domain of Sap from Bacillus anthracis in complex with a pyruvylated SCWP unit [Bacillus anthracis]
3PYW_A	9.62e-13	923	1078	16	174	Thestructure of the SLH domain from B. anthracis surface array protein at 1.8A [Bacillus anthracis]

Swiss-Prot Hits download full data without filtering help

Hit ID	E-Value	Query Start	Query End	Hit Start	Hit End	Description
Q06852	3.24e-31	910	1119	2087	2298	Cell surface glycoprotein 1 OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=olpB PE=3 SV=2
Q06848	9.16e-27	913	1120	228	433	Cellulosome-anchoring protein OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=ancA PE=1 SV=1
Q06853	1.55e-26	924	1120	480	673	Cell surface glycoprotein 2 OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=Cthe_3079 PE=1 SV=1
P38535	5.03e-16	930	1075	931	1077	Exoglucanase XynX OS=Acetivibrio thermocellus OX=1515 GN=xynX PE=3 SV=1
P38536	8.02e-16	930	1075	1705	1851	Amylopullulanase OS=Thermoanaerobacterium thermosulfurigenes OX=33950 GN=amyB PE=3 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other	SP_Sec_SPI	LIPO_Sec_SPII	TAT_Tat_SPI	TATLIP_Sec_SPII	PILIN_Sec_SPIII
0.000272	0.999073	0.000177	0.000170	0.000158	0.000141

TMHMM Annotations download full data without filtering help

start	end
5	27