logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000642_00110

You are here: Home > Sequence: MGYG000000642_00110

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species
Lineage Bacteria; Actinobacteriota; Actinomycetia; Actinomycetales; Actinomycetaceae; Arcanobacterium_A;
CAZyme ID MGYG000000642_00110
CAZy Family GH33
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
2180 234909.52 4.5409
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000642 1644124 MAG Madagascar Africa
Gene Location Start: 124963;  End: 131505  Strand: -

Full Sequence      Download help

MQHNHRVNTT  KCFARLLSAL  YVTALMLSGL  LVTPPYAAAA  PLASTITVEN  ANVKLVLTPT60
TSPAGETWAV  GEVVKYNLTL  TNKTTKKRGF  KIRSTNFDWD  ASNTATAEGY  RWTGAEPKKA120
MPCKGKLAHT  ITQADVAAGG  FTPSLVYNMY  NAASYESAPM  LAGGTAIPAT  SKIKLLNPKA180
SDAGWNVGEV  MRFSVELTND  TGIKRSFEID  STNLSGTDKC  RWGEFNNGVT  ANCNSIEITH240
KVDDSDFTAG  YFTPSIAWKM  YPVAQYKGAA  EILAPTVGEP  IKIGENLALV  SFTQSEDSEK300
EIYSAGDELV  FTLIVKNSGD  VNVAVKPELA  GGNVTFDESC  TVANLAPNAE  LTCTVRYALT360
NESLAQGKAT  PIAILTANNP  DGAELDRVTA  SAEIPVKGTF  AKADAFTAYD  ANPAGASVLS420
DLTMIAKTDS  DNNMRIPAIA  VTQNGDILAA  YDARPRAGEI  GKKVKRAGGD  SPNENSIMQK480
RSTDGGKTWQ  SATTIAAGKI  GSHGYSDPSY  VVDHTTGTIF  NFHVYSQDAG  LFTGQPTSYV540
YKEGKIDETD  KNAMNLGLSI  SRDNGYTWEQ  RVVTNQVLNG  IAAQHKFVGC  FATSGAGTQK600
QQEPHKGRLL  QQVACITDTN  GNGKGDNNVD  DVVAFTMYSD  DHGQTWNAGT  PTDPYALGRD660
NSKRMRFDEN  KVVELSDGSL  MLNSRTATWN  GKGYRLVGLS  KDGGQTWSDY  RVDETLIDPG720
NNAQLIRAFP  NANAGTLRSQ  VVLFSNTAHA  KSRTNGTISL  SCDDGKTWGV  RKQFRGAGTG780
YTTMAVQSDG  NIGLLMEPNG  GGWQDIGYQS  FNLSWVAGEG  QLCSEVSPVD  TQQVEASSDV840
AINPIKVADL  FTHNDPLLAD  TVSVEGLPEG  LSYNAQTGVI  EGTPTEKLNK  ITDYTVSVTM900
SEESDGTKYE  RKATAKINVR  VTPTGGRTWY  VAADGSDSND  GLSPERPFQT  LAPVNKLDLL960
PGDAVYLKRG  DVFNDQYMHL  KGSGSADAPI  IISAYGEASD  PLPIINTNGE  GVWYEDYNTK1020
LDNPSHKQKG  NVSTSILLKD  VEYIEVSELE  ITNKRQDDQE  PILKNQNAMD  RTGVAVIAEN1080
IGTVNHVVLK  KLYVHDVDGN  VYNKHMANGG  IYVIAHKPKD  ESVTGIARFD  DVQVIDNRVE1140
DVSRWGIGVG  YTGYTGEFHT  AQLSDETVAR  YGQTNVVVRG  NYVKRAGGDA  ITVFYALRPL1200
VEYNVSVDAS  KYINTEDYPG  TFGRVAAGIW  PWKTKDAVFQ  YNEAFNTLNG  AHGNGDAMPW1260
DSDWSDGTLY  QYNYSAGNTG  GTYMICGVQA  INSTFRYNIS  QNDIGGLLDP  VDRNPNGHIY1320
NNTFYVAKDV  PILKTIHTPN  GRATIENNIF  YYTGDTPRNE  DWNRSDGGGF  TKTYRNNLYY1380
NYANTPASDS  SAIKVTAGTQ  VFAGEQMQLA  PSEARADFLT  YRLPSDVVSA  NDTLVTNNAN1440
EAPASQSDVF  AGYRLADNSP  ATDTGKTITD  ENGFTPEADF  FGDPLVGATD  IGAVQHAKVG1500
ELSSTIYLVT  RDSDGVQTIH  VPQTEKNPTT  VSEVLENVAF  TPKLTLSVVD  ASGNAVDGSK1560
YLTAGMKVRA  TDDSGKYIEI  PLEITNEYSW  VENYVNAQQG  NIWFGQHQEN  SDAEWTNIDK1620
YDSQWPNWQV  DTYYGPGVTG  SQNLPPVGKR  SNLHGLLSDS  PDNAQDSTAM  AFRAPKTGWV1680
SFQVKENEPQ  LRQSSNSGGT  VELSLYKNGE  LLQKVELKDS  NTVAQDWKNY  LEDNPIYLEK1740
GDFLRVVVHS  VGKPTKPSLW  ISPVITYLDK  ELPVKAQAEY  AAASAKPQQL  ATATPVVKNA1800
DGEEIDVPQG  TTFAFGDLPE  GVTLADSADN  LQPGEVYIDS  AAGVVSFVPT  SQQANSTLTF1860
PVTVTYRNGS  ANDAEVTFTV  GEIKPVLVNI  TEVPTSNVTA  DAEDPASCTV  KPFVKITATE1920
GVEFYLNGVR  VDPGVIKYGY  GETVVVEAKA  LEGYAFSEDA  TTKWKWTAPT  WESLQCEEPQ1980
PDQTDAEKIV  LGLPAATDLT  KPEKTEVSFA  GVVLSEGQTL  PEGTVLTAEG  LPDGLSMNKT2040
TGEISGTLPE  VDVDADYEIT  VTVTYPDNSI  GTVTYTLKVT  NVPDAKPNPG  SDPDPEVKPN2100
PNPDSGSGEV  TPGSELNPEG  NDQDQRKAKD  PVTSDAASSG  TIAKTGTSDL  ALVSGLGLMF2160
MIAGAIMVRG  RKLRVARHNS  2180

Enzyme Prediction      help

EC 3.2.1.18 3.2.1.-

CAZyme Signature Domains help

Created with Snap10921832743654565476387298110901199130814171526163517441853196220719261494GH136428800GH33
Family Start End Evalue family coverage
GH136 926 1494 5.6e-148 0.9938900203665988
GH33 428 800 9.2e-73 0.9181286549707602

CDD Domains      download full data without filtering help

Created with Snap1092183274365456547638729811090119913081417152616351744185319622071435812Sialidase_non-viral423813NanH445794BNR_219942080RibLong17861880RibLong
Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
cd15482 Sialidase_non-viral 1.64e-76 435 812 15 339
Non-viral sialidases. Sialidases or neuraminidases function to bind and hydrolyze terminal sialic acid residues from various glycoconjugates, they play vital roles in pathogenesis, bacterial nutrition and cellular interactions. They have a six-bladed, beta-propeller fold with the non-viral sialidases containing 2-5 Asp-box motifs (most commonly Ser/Thr-X-Asp-[X]-Gly-X-Thr- Trp/Phe). This CD includes eubacterial and eukaryotic sialidases.
COG4409 NanH 1.48e-32 423 813 261 659
Neuraminidase (sialidase) [Carbohydrate transport and metabolism, Cell wall/membrane/envelope biogenesis].
pfam13088 BNR_2 5.96e-17 445 794 1 280
BNR repeat-like domain. This family of proteins contains BNR-like repeats suggesting these proteins may act as sialidases.
pfam18957 RibLong 1.64e-11 1994 2080 6 93
Long Rib domain. This entry represents the Long Rib domain that is closely related to the pfam08428 Rib domain but has a conserved insertion. These domains are found in bacterial cell surface proteins.
pfam18957 RibLong 1.83e-07 1786 1880 14 92
Long Rib domain. This entry represents the Long Rib domain that is closely related to the pfam08428 Rib domain but has a conserved insertion. These domains are found in bacterial cell surface proteins.

CAZyme Hits      help

Created with Snap10921832743654565476387298110901199130814171526163517441853196220719091890ESV34046.1|GH1369091969QSG89108.1|GH1369091969VEG79925.1|GH1369091969BAJ67172.1|GH136|3.2.1.1409091890BBV24031.1|GH136
Hit ID E-Value Query Start Query End Hit Start Hit End
ESV34046.1 1.14e-201 909 1890 29 1064
QSG89108.1 1.27e-200 909 1969 29 1155
VEG79925.1 1.27e-200 909 1969 29 1155
BAJ67172.1 1.27e-200 909 1969 29 1155
BBV24031.1 1.57e-200 909 1890 29 1064

PDB Hits      download full data without filtering help

Created with Snap109218327436545654763872981109011991308141715261635174418531962207192414997V6M_A91114975GQC_A92615017V6I_A92613486KQT_A92613486KQS_A
Hit ID E-Value Query Start Query End Hit Start Hit End Description
7V6M_A 1.21e-180 924 1499 6 580
ChainA, Fibronectin type III domain-containing protein [Tyzzerella nexilis]
5GQC_A 2.82e-157 911 1497 2 598
Crystalstructure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_B Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_C Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_D Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_E Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_F Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_G Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_H Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQF_A Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, lacto-N-biose complex [Bifidobacterium longum subsp. longum],5GQF_B Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, lacto-N-biose complex [Bifidobacterium longum subsp. longum],5GQG_A Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, galacto-N-biose complex [Bifidobacterium longum subsp. longum],5GQG_B Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, galacto-N-biose complex [Bifidobacterium longum subsp. longum]
7V6I_A 5.26e-151 926 1501 13 614
ChainA, Lacto-N-biosidase [Bifidobacterium saguini DSM 23967]
6KQT_A 9.45e-113 926 1348 244 660
CrystalStructure of GH136 lacto-N-biosidase from Eubacterium ramulus - native protein [Eubacterium ramulus ATCC 29099]
6KQS_A 1.67e-108 926 1348 244 660
CrystalStructure of GH136 lacto-N-biosidase from Eubacterium ramulus - selenomethionine derivative [Eubacterium ramulus ATCC 29099]

Swiss-Prot Hits      download full data without filtering help

Created with Snap1092183274365456547638729811090119913081417152616351744185319622071433818sp|Q02834|NANH_MICVI432816sp|P29767|NANH_CLOSE430798sp|Q5RAF4|NEUR1_PONAB430798sp|Q99519|NEUR1_HUMAN430768sp|A6BMK7|NEUR1_BOVIN
Hit ID E-Value Query Start Query End Hit Start Hit End Description
Q02834 1.31e-82 433 818 66 403
Sialidase OS=Micromonospora viridifaciens OX=1881 GN=nedA PE=1 SV=1
P29767 5.64e-17 432 816 393 826
Sialidase OS=Clostridium septicum OX=1504 PE=3 SV=1
Q5RAF4 7.79e-15 430 798 73 387
Sialidase-1 OS=Pongo abelii OX=9601 GN=NEU1 PE=2 SV=1
Q99519 7.79e-15 430 798 73 387
Sialidase-1 OS=Homo sapiens OX=9606 GN=NEU1 PE=1 SV=1
A6BMK7 1.38e-14 430 768 73 356
Sialidase-1 OS=Bos taurus OX=9913 GN=NEU1 PE=2 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000466 0.998706 0.000225 0.000234 0.000175 0.000154

TMHMM  Annotations      download full data without filtering help

start end
12 34
2146 2168