logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000002916_00171

You are here: Home > Sequence: MGYG000002916_00171

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Collinsella sp900545055
Lineage Bacteria; Actinobacteriota; Coriobacteriia; Coriobacteriales; Coriobacteriaceae; Collinsella; Collinsella sp900545055
CAZyme ID MGYG000002916_00171
CAZy Family GT2
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
828 92629.57 4.7966
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000002916 1930338 MAG United States North America
Gene Location Start: 2023;  End: 4509  Strand: -

Full Sequence      Download help

MSIKRLALCR  SQGRLFVLLR  FAGQDVAALI  EREGSQAFAH  ATTSGSSVPS  LVLPVDHGRV60
LALCPSVSDY  ERELAVLVLP  FLDGSTIDVT  FASGDQRLGS  IRLDSRVAKL  ESKINYKAKP120
ALCALIRDAQ  RGEHCGLYEI  DAIRYLPADA  GAVWRYEVTW  AGDPQCAPEL  LIFDTHMNAI180
DATVHVFESQ  FDVPQQNGCR  VNKTYLSVEM  PRDIRDFVAI  VSDPTGLIQS  GFCAMDGRLY240
NGMVDDSWNR  MKDARADDAA  YRRWFEQHRA  KPGDMACQRV  ASAAFAYRPL  VSIVVPCYKT300
DRVYLRELLD  SVLAQSYDNW  ELLLMDASPE  WDAVANLAAD  ANDERIRRIE  LPGNGGIVVN360
TNAGIEQATG  DYIAFLDHDD  ILEPDALFHY  VAALNKAAEG  ERPQVLFCDE  DMFQKTGEWG420
QPVFKTKLNV  DLLYSHNCVT  HFLMVEKALI  DRIGTSPEDV  AGAQDYDLTL  RCLSADARFE480
HVAHVLYHWR  VHPGSTADGS  ADSKPYAIEA  GRLALQRHFN  ALGICGTVED  TETPFVYRMR540
YALPEPAPLA  SIVIPTKDHI  ETLDACVMSI  AQKATYANYE  IVLVENNSEA  PETFAYYETL600
PERVAATSEG  KGIARVVYWP  GEFNYSQIIN  FGVEHAKGDY  LLLLNNDTEV  ISPDFIEEMM660
GYLQRPDAGV  VGAKLYFADH  LVQHAGILVG  VRGALAHANQ  DFSAKREGYL  ARAVRPGNFS720
AVTGACQMVR  RDVFEQVGGY  NEEFAVGFND  ADFCLRVREA  GYRTIFTPYA  ELYHYEFTSR780
GREEANEEKL  RRWKREQALF  TQRWPEFFLT  GDPWLGPNLS  SESEYFSL828

Enzyme Prediction      help

No EC number prediction in MGYG000002916_00171.

CAZyme Signature Domains help

Created with Snap4182124165207248289331372414455496538579621662703745786292451GT2551737GT2
Family Start End Evalue family coverage
GT2 292 451 1.6e-23 0.9764705882352941
GT2 551 737 4.1e-22 0.9882352941176471

CDD Domains      download full data without filtering help

Created with Snap4182124165207248289331372414455496538579621662703745786289491GT2_RfbC_Mx_like552776GT_2_like_c548800GT2292496GT_2_WfgS_like551734Glycos_transf_2
Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
cd04184 GT2_RfbC_Mx_like 8.93e-91 289 491 1 202
Myxococcus xanthus RfbC like proteins are required for O-antigen biosynthesis. The rfbC gene encodes a predicted protein of 1,276 amino acids, which is required for O-antigen biosynthesis in Myxococcus xanthus. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds.
cd04186 GT_2_like_c 5.68e-41 552 776 1 166
Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
COG1216 GT2 7.16e-30 548 800 3 247
Glycosyltransferase, GT2 family [Carbohydrate transport and metabolism].
cd06433 GT_2_WfgS_like 1.24e-23 292 496 1 200
WfgS and WfeV are involved in O-antigen biosynthesis. Escherichia coli WfgS and Shigella dysenteriae WfeV are glycosyltransferase 2 family enzymes involved in O-antigen biosynthesis. GT-2 enzymes have GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
pfam00535 Glycos_transf_2 1.66e-20 551 734 1 164
Glycosyl transferase family 2. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids.

CAZyme Hits      help

Created with Snap41821241652072482893313724144554965385796216627037457861827ATP53737.1|GT21828QIA33434.1|GT21828AZH69434.1|GT21828QOY61397.1|GT21828ACV56006.1|GT2
Hit ID E-Value Query Start Query End Hit Start Hit End
ATP53737.1 0.0 1 827 1 827
QIA33434.1 0.0 1 828 1 828
AZH69434.1 0.0 1 828 1 828
QOY61397.1 5.76e-295 1 828 1 801
ACV56006.1 6.33e-181 1 828 1 829

PDB Hits      download full data without filtering help

Created with Snap41821241652072482893313724144554965385796216627037457862903945HEA_A2784972Z87_A2784972Z86_A2893853BCV_A
Hit ID E-Value Query Start Query End Hit Start Hit End Description
5HEA_A 7.03e-10 290 394 6 109
CgTstructure in hexamer [Streptococcus parasanguinis FW213],5HEA_B CgT structure in hexamer [Streptococcus parasanguinis FW213],5HEA_C CgT structure in hexamer [Streptococcus parasanguinis FW213],5HEC_A CgT structure in dimer [Streptococcus parasanguinis FW213],5HEC_B CgT structure in dimer [Streptococcus parasanguinis FW213]
2Z87_A 1.61e-09 278 497 363 578
Crystalstructure of chondroitin polymerase from Escherichia coli strain K4 (K4CP) complexed with UDP-GalNAc and UDP [Escherichia coli],2Z87_B Crystal structure of chondroitin polymerase from Escherichia coli strain K4 (K4CP) complexed with UDP-GalNAc and UDP [Escherichia coli]
2Z86_A 1.62e-09 278 497 364 579
Crystalstructure of chondroitin polymerase from Escherichia coli strain K4 (K4CP) complexed with UDP-GlcUA and UDP [Escherichia coli],2Z86_B Crystal structure of chondroitin polymerase from Escherichia coli strain K4 (K4CP) complexed with UDP-GlcUA and UDP [Escherichia coli],2Z86_C Crystal structure of chondroitin polymerase from Escherichia coli strain K4 (K4CP) complexed with UDP-GlcUA and UDP [Escherichia coli],2Z86_D Crystal structure of chondroitin polymerase from Escherichia coli strain K4 (K4CP) complexed with UDP-GlcUA and UDP [Escherichia coli]
3BCV_A 1.48e-06 289 385 5 100
Crystalstructure of a putative glycosyltransferase from Bacteroides fragilis [Bacteroides fragilis NCTC 9343],3BCV_B Crystal structure of a putative glycosyltransferase from Bacteroides fragilis [Bacteroides fragilis NCTC 9343]

Swiss-Prot Hits      download full data without filtering help

Created with Snap4182124165207248289331372414455496538579621662703745786258828sp|P55465|Y4GI_SINFN271820sp|Q50864|RFBC_MYXXA287498sp|Q68X33|Y329_RICTY287498sp|Q9ZDI9|Y339_RICPR287664sp|Q4UM29|Y543_RICFE
Hit ID E-Value Query Start Query End Hit Start Hit End Description
P55465 3.63e-116 258 828 338 897
Uncharacterized protein y4gI OS=Sinorhizobium fredii (strain NBRC 101917 / NGR234) OX=394 GN=NGR_a03550 PE=4 SV=1
Q50864 8.28e-76 271 820 301 832
O-antigen biosynthesis protein RfbC OS=Myxococcus xanthus OX=34 GN=rfbC PE=4 SV=1
Q68X33 6.32e-16 287 498 6 218
Uncharacterized glycosyltransferase RT0329 OS=Rickettsia typhi (strain ATCC VR-144 / Wilmington) OX=257363 GN=RT0329 PE=3 SV=1
Q9ZDI9 1.54e-15 287 498 6 218
Uncharacterized glycosyltransferase RP339 OS=Rickettsia prowazekii (strain Madrid E) OX=272947 GN=RP339 PE=3 SV=1
Q4UM29 1.62e-15 287 664 12 397
Uncharacterized glycosyltransferase RF_0543 OS=Rickettsia felis (strain ATCC VR-1525 / URRWXCal2) OX=315456 GN=RF_0543 PE=3 SV=1

SignalP and Lipop Annotations help

This protein is predicted as OTHER

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
1.000049 0.000001 0.000000 0.000000 0.000000 0.000000

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000002916_00171.