logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000001660_01341

You are here: Home > Sequence: MGYG000001660_01341

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species HGM11788 sp900760465
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; CAG-274; HGM11788; HGM11788 sp900760465
CAZyme ID MGYG000001660_01341
CAZy Family PL1
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1252 139792.62 4.3063
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000001660 2161039 MAG United States North America
Gene Location Start: 172;  End: 3930  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000001660_01341.

CAZyme Signature Domains help

Family Start End Evalue family coverage
PL1 84 277 4.7e-77 0.994535519125683

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
sd00036 LRR_3 7.87e-31 1065 1193 14 142
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
pfam13306 LRR_5 9.95e-28 1077 1222 1 121
Leucine rich repeats (6 copies). This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.
sd00036 LRR_3 1.33e-26 1099 1222 2 126
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
pfam13306 LRR_5 1.41e-25 1065 1183 11 127
Leucine rich repeats (6 copies). This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.
pfam13306 LRR_5 2.88e-21 1038 1160 13 127
Leucine rich repeats (6 copies). This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
AUO19476.1 5.29e-116 2 470 5 484
AKD03746.1 3.82e-108 26 463 33 452
AWK06249.1 1.51e-106 28 466 42 485
AXP82553.1 1.62e-105 21 498 113 582
AUO19400.1 1.92e-105 6 462 8 500

PDB Hits      help

has no PDB hit.

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
B8NQQ7 8.16e-60 17 471 8 419
Probable pectate lyase C OS=Aspergillus flavus (strain ATCC 200026 / FGSC A1120 / IAM 13836 / NRRL 3357 / JCM 12722 / SRRC 167) OX=332952 GN=plyC PE=3 SV=1
Q2UB83 9.46e-59 17 471 8 419
Probable pectate lyase C OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40) OX=510516 GN=plyC PE=3 SV=1
B0XMA2 3.61e-53 13 470 7 419
Probable pectate lyase C OS=Neosartorya fumigata (strain CEA10 / CBS 144.89 / FGSC A1163) OX=451804 GN=plyC PE=3 SV=1
Q4WL88 3.61e-53 13 470 7 419
Probable pectate lyase C OS=Neosartorya fumigata (strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100) OX=330879 GN=plyC PE=3 SV=1
Q0CLG7 4.77e-53 9 470 3 418
Probable pectate lyase C OS=Aspergillus terreus (strain NIH 2624 / FGSC A1156) OX=341663 GN=plyC PE=3 SV=1

SignalP and Lipop Annotations help

This protein is predicted as LIPO

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000497 0.420121 0.578752 0.000231 0.000208 0.000170

TMHMM  Annotations      download full data without filtering help

start end
7 26