logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000001748_00130

You are here: Home > Sequence: MGYG000001748_00130

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species CAG-56 sp900762665
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; CAG-56; CAG-56 sp900762665
CAZyme ID MGYG000001748_00130
CAZy Family GH141
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1953 MGYG000001748_1|CGC1 214683.91 4.7443
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000001748 4218051 MAG Sweden Europe
Gene Location Start: 157476;  End: 163337  Strand: +

Full Sequence      Download help

MKRLCSVLLA  VVLLCSGIPT  VTFAQGTGTS  GAQRKVAHTL  YVSTDGRDNG  DGTEQNPFQT60
IEQARDAVRT  LDKTKGDIVV  KIAGGTYYLD  NTIAFTEADS  GNENCTIYYE  AVDGERPVIS120
GGEKVTGDWR  DEGDGTYSIP  YERDIKLRSL  YVNGERAYMT  QRDSQGRGDY  GSYTVDSSKD180
WAWISGTRAA  GTQLDAGAIP  LDTRNQDDIE  LMTQTTWNTA  IVCVDKLQDI  GNGRISANYQ240
MPYGAVAQQP  SWNNNYKSGG  WQMMYNVFEW  LPGAKGHFYF  DKTEKRLYYC  PRDGEDMNDL300
EVIAPKLETL  IDLSGSSTTS  RIGYITFSGL  EFAHSDWNLY  ELEGSYGRVT  VQGAAGLIYF360
ADGNWHPSIY  RAYDVGPGAV  MVNSAQHIAF  YGNTICHTGN  DGLSFVNDVV  DSTVSGNLIY420
DTAGSAFLLG  HPQHVYIGDK  GSNYGAFSEK  EKYDVGVEGA  CKRIKLTNNF  ISDTSLMFWG480
DAGVMVFLAE  EFEMKYNHLQ  NTPYSGLSLG  WGWWNMDGSN  GAVVPGVPME  TTKNNTIMYN540
TFKNTITKLG  DAGAIYTLGD  MPGTKISENY  IWSIGTPGID  PYHIRGIHVD  EGTKHVYGEK600
NVIEILPKLT  CVDCGNWGWK  GNNTWDNNYA  TTESYTTTGT  WEPGTVVTNA  HTSLEGIWGT660
EVFDILKNVG  IQSDYYSIIP  ESMFGLQDRL  LPNKIYAARQ  ELDWGTAAQN  IKGEIWLAPE720
GTEEFVESDA  VVQVKDGKVV  VPDVNGIYKL  YIVNGTEVSA  PSSGQIIVEA  GAPIRNAAEG780
ERKKTSTQKP  FALELNTKYY  KDFVLRKAEA  PDIPGENVTD  GYKITEAGSY  ILQAKDLNNL840
KAEVSFEVYE  NLVDQVFSKN  IQSKPGNSVR  LDTTGMDGET  AWFVPEGMEI  NKVSQLTESE900
QMTKAESGAS  EIAAPRAVGN  YQMYLVMDDV  ISEPSDAVLT  VFMGGLPITD  GLLARFDAED960
IENGDGKAVS  EWQDSTKQYS  LVQTEAGRQP  IIQNTENDMA  YLSFDGSDDY  LQLKEDQEID1020
LNQKSNLTII  TLSAYKETDP  PTGTYGDEKT  TVFFPESGSW  GSLYMSNYAG  FMVSRFGSGQ1080
SNNYNKYMRP  AATSRFTTAA  MVKDGKTEYM  YDDGEKVYTN  TDRYEQTNNL  QKSMMVGVTK1140
ASNKDSYANI  EVSEILIYDR  SLSDDEIEKI  YNYTSRKQYL  KSLEAQMEAA  EEVFADPDAE1200
TKYSEASRNN  LKHVYNGAME  FAANFTTAIE  NPEAAAAEWT  NKLTNAINAL  VPPVTTVPSE1260
GLALWLKADE  GITLDEDGGV  SVWNDYSGLG  RNAVKAQNAQ  PNETVTSPKV  IEDLYNGKPA1320
VRFNGSSDGM  QFPFAGLNNQ  SEATVVLVSA  NQVKTDVVGT  GDNRPLLCFD  ESGGWGKFII1380
TPTQDEVNAR  IGSGQESDKG  GYKKYTYPES  IGNRLSASVV  WKNGSEETIY  VGDQEVMRVT1440
DAQSTIKNVK  DDIGYLGRFP  TGGDSAYWYN  QSDVAEVLIY  NRALTLAEIQ  QINSYLEDKY1500
QISAQVVLES  ITVTPPANTV  YTVGEELDLT  GMEVTARYTD  GSIKAITEGF  KVTGYDKDRP1560
GEQTITISYT  EQGLEKTATF  TVTVRSAVEP  EVLESITVTP  PAKTAYIVGE  ELELTGMEVT1620
ARYTDGSTKV  ITKGYTVTGY  DKDVPGEQTI  TISYTEQGVE  KTAAFTVTVR  STVDPEVTNV1680
EGLISQIGAV  AYNNTTKAKI  EAAENAYKKL  TPQQQALVSN  YDALKSARAN  YDALKADAEK1740
RAADQEAADR  VSSLISGIGT  VSAGSKAKID  AAEKAYNALT  ADQKKLVKNY  SVLTSAKEAY1800
QKITALPRKG  AKFLVGNLWY  QVTRSDVKNG  TVTVVKAKNK  NYKSINIKST  VKIKGYTFKI1860
TAIGKKAFYK  NRGLTSIKVG  KNIVKIDSYA  FYGCTKLKSV  RIYSTKLKTV  GKNAFGKTAK1920
NIEVRVPKNP  KKLLKKYQNL  LKKGGSKKAK  YKR1953

Enzyme Prediction      help

No EC number prediction in MGYG000001748_00130.

CAZyme Signature Domains help

Created with Snap9719529239048858568378187897610741171126913671464156216601757185539604GH141
Family Start End Evalue family coverage
GH141 39 604 4.9e-124 0.9905123339658444

CDD Domains      download full data without filtering help

Created with Snap9719529239048858568378187897610741171126913671464156216601757185518401915LRR_318611915LRR_518431915LRR_518381915LRR_518431915LRR_3
Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
sd00036 LRR_3 1.07e-13 1840 1915 14 102
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
pfam13306 LRR_5 2.24e-12 1861 1915 1 53
Leucine rich repeats (6 copies). This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.
pfam13306 LRR_5 6.62e-12 1843 1915 37 120
Leucine rich repeats (6 copies). This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.
pfam13306 LRR_5 7.08e-12 1838 1915 9 75
Leucine rich repeats (6 copies). This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.
sd00036 LRR_3 9.37e-12 1843 1915 63 125
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.

CAZyme Hits      help

Created with Snap9719529239048858568378187897610741171126913671464156216601757185541775AUX40294.1|GH1412788AEY67284.1|CBM6|GH14110767ADI13073.1|GH14141775AUX31799.1|GH1412788ACL76007.1|CBM6|GH141
Hit ID E-Value Query Start Query End Hit Start Hit End
AUX40294.1 5.53e-163 41 775 19 730
AEY67284.1 5.08e-162 2 788 8 772
ADI13073.1 5.44e-162 10 767 14 761
AUX31799.1 1.10e-160 41 775 97 808
ACL76007.1 4.70e-160 2 788 8 772

PDB Hits      download full data without filtering help

Created with Snap97195292390488585683781878976107411711269136714641562166017571855306285MQP_A
Hit ID E-Value Query Start Query End Hit Start Hit End Description
5MQP_A 5.74e-43 30 628 12 601
Glycosidehydrolase BT_1002 [Bacteroides thetaiotaomicron],5MQP_B Glycoside hydrolase BT_1002 [Bacteroides thetaiotaomicron],5MQP_C Glycoside hydrolase BT_1002 [Bacteroides thetaiotaomicron],5MQP_D Glycoside hydrolase BT_1002 [Bacteroides thetaiotaomicron],5MQP_E Glycoside hydrolase BT_1002 [Bacteroides thetaiotaomicron],5MQP_F Glycoside hydrolase BT_1002 [Bacteroides thetaiotaomicron],5MQP_G Glycoside hydrolase BT_1002 [Bacteroides thetaiotaomicron],5MQP_H Glycoside hydrolase BT_1002 [Bacteroides thetaiotaomicron]

Swiss-Prot Hits      help

has no Swissprot hit.

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000230 0.998937 0.000290 0.000179 0.000155 0.000135

TMHMM  Annotations      download full data without filtering help

start end
7 29