logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000003_02015

You are here: Home > Sequence: MGYG000000003_02015

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Alistipes shahii
Lineage Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Rikenellaceae; Alistipes; Alistipes shahii
CAZyme ID MGYG000000003_02015
CAZy Family PL29
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
858 MGYG000000003_4|CGC3 93100.08 4.8772
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000003 3229518 Isolate United Kingdom Europe
Gene Location Start: 327289;  End: 329865  Strand: +

Full Sequence      Download help

MNKLFLWIAA  ASAFVLQSCE  NTDGIRDEIG  SLRDRVIALE  AKIGSVNTSI  VALHKLMDES60
TIIVGIEPNA  KGYEIELSDG  TRLPVILGEK  IEALVPVMGI  DAEGYWTVSL  DGGATSERLK120
VGGEYVSAWP  VSGGDHKPGA  EGVTPQLKVS  ADGEWLVSLD  GGATYAPLLQ  NGQPVNALGD180
KVVVSYSSAF  KSVTYDATTG  LLAVELLDGE  KLTLPVFDDF  GLTVTASDNE  TFRLGETRAF240
EVVQNNVAEA  VIDAPAGWTA  VLGETTLTVK  APATFDAASQ  QAAVSVTVYS  DRKYRKLVTL300
NVTLLDEQVD  ANAALAWRNF  KAGTANNVLL  DYSYAGYKHG  EEAPADVWGL  GYKVYNVVDY360
GADPTGVRSS  RGALAALLKE  LKLSGRSDAG  ANLANANARA  VIYFPEGRFV  LHNDDDNVVD420
PTSANQKYTD  SKGNNRSEEI  FIRGGYFVLK  GAGRGKTTLV  MDTPNLPNNS  EQMWSSPMMI480
NIKHNSGLSD  LTTVTGDAAR  GTFSVEVASA  AGIGKGDWVC  LSLSNNDPTL  VAQELAPHRV540
EGNMTDIQTI  TVEDYHQVAS  VSGNRVTFAE  PIMYAVEAKW  GWKIRKYPHY  EHVGVEDLTF600
EGRSKENFGH  HASWEDDGAY  KPLNMMRLTD  SWIRRVDFRG  VSEALSIVSS  ANCSAYDIEI660
SGNRGHSGVR  SQSSSRIFIG  KVCDRSRGQA  VSPPYTSTGY  FENAGQYHAS  GVSNTSLGAV720
LWNNTWGDDA  FFESHSRQPR  ATLVDRCTGG  FVQWRFGGDE  TNVPNHLGDL  TIWNLNATRA780
AHDFGAEPFK  WWLSSDKWWK  TMPPIIVGFH  GAAVTFDESA  EQVKYLESNG  AAVEPLSLYE840
AQLRQRLGYV  PAWLNSLK858

Enzyme Prediction      help

No EC number prediction in MGYG000000003_02015.

CAZyme Signature Domains help

Created with Snap4285128171214257300343386429471514557600643686729772815354684PL29
Family Start End Evalue family coverage
PL29 354 684 2.5e-111 0.9966777408637874

CDD Domains      download full data without filtering help

Created with Snap4285128171214257300343386429471514557600643686729772815705857DUF495525215DUF4988
Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam16315 DUF4955 9.16e-78 705 857 1 149
Domain of unknown function (DUF4955). This family consists of uncharacterized proteins around 850 residues in length and is mainly found in various Bacteroides species. The function of this protein is unknown.
pfam16378 DUF4988 9.02e-42 25 215 1 181
Domain of unknown function. This family around 200 residues locates in the N-terminal of some uncharacterized proteins in various Bacteroides and Alistipes species. The function of this family remains unknown. The N-terminus of this model has been clipped by ~30 residues as it was capturing parts of collagen sequences, pfam01391.

CAZyme Hits      help

Created with Snap42851281712142573003433864294715145576006436867297728151858CBK63379.1|PL291858QNL40303.1|PL291858QDM10828.1|PL291858QUT78855.1|PL291858QRQ54963.1|PL29
Hit ID E-Value Query Start Query End Hit Start Hit End
CBK63379.1 0.0 1 858 1 858
QNL40303.1 5.46e-265 1 858 6 833
QDM10828.1 3.10e-264 1 858 6 833
QUT78855.1 3.10e-264 1 858 6 833
QRQ54963.1 1.77e-221 1 858 6 862

PDB Hits      help

has no PDB hit.

Swiss-Prot Hits      help

has no Swissprot hit.

SignalP and Lipop Annotations help

This protein is predicted as LIPO

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000000 0.000000 1.000062 0.000000 0.000000 0.000000

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000000003_02015.