logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000313_00094

You are here: Home > Sequence: MGYG000000313_00094

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Collinsella vaginalis
Lineage Bacteria; Actinobacteriota; Coriobacteriia; Coriobacteriales; Coriobacteriaceae; Collinsella; Collinsella vaginalis
CAZyme ID MGYG000000313_00094
CAZy Family GH154
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
591 MGYG000000313_7|CGC1 65502.07 4.889
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000313 1450348 MAG Sweden Europe
Gene Location Start: 3527;  End: 5302  Strand: +

Full Sequence      Download help

MSGMKSKDDR  VHDVKRRAAS  VLDRFASDAS  GLDHMSREEM  AAIAEGYINP  LRPFFSDACA60
RVSLPGAGVS  YELDTASFEA  FARPLWAFAP  MWAGGDELAA  YAEVYRSGLI  AGTDPAHAEY120
WGECRDYDQK  FVEMAAIAYA  LLLAPDILWE  PLPADAQERV  ATWLGQVNRY  EVWDNNWLFF180
PTLVNLALRS  LGMPWSPAVL  KRCLDGIDAC  YRGDGWYTDG  PVGGPNANTD  YYNPFAFHFY240
GLVYAVFARD  VDLERSEEFA  RRASAFEPDF  RRWFSSRGES  IAYGRSLTYR  FAQSAFYSMA300
LLARAEGVPV  DVDPVIAKGI  VARNIVAWAS  LPCTDSSGAL  QVGYHYPNLH  MAEGYNAPGS360
PLWACKAFAM  LAIQSCDPIW  ETPVSPLELA  DGVFPVVGGS  MLVRRDRGEA  TLYTGGRTRR420
RRFTHCEEKY  CKFAYSTRWG  FSVSVSSYSL  KEAAPDSMLA  FEVDGMIRVR  SESEFVRLLD480
GELESVWSPC  SGVHVTTRIR  PDAEGHVRTH  VVESSLVCRA  FDCGFAVPAE  SLEEAEGVCE540
VLPAEDGIQG  EPFSFKAEPN  TNLMAPKTAI  HAVVYDIPVG  ISRIVTKVRE  R591

Enzyme Prediction      help

No EC number prediction in MGYG000000313_00094.

CAZyme Signature Domains help

Created with Snap29598811814717720623626529532535438441344347250253156137383GH154
Family Start End Evalue family coverage
GH154 37 383 9.5e-116 0.985632183908046

CDD Domains      download full data without filtering help

Created with Snap29598811814717720623626529532535438441344347250253156136387DUF226450489COG4289
Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam10022 DUF2264 2.01e-125 36 387 1 351
Uncharacterized protein conserved in bacteria (DUF2264). Members of this family of hypothetical bacterial proteins have no known function.
COG4289 COG4289 3.73e-98 50 489 41 456
Uncharacterized protein [Function unknown].

CAZyme Hits      help

Created with Snap29598811814717720623626529532535438441344347250253156148588QQQ91691.1|GH15448588QJU15850.1|GH15448588ANU78343.1|GH15448588ASU31151.1|GH15436588QBE95336.1|GH154
Hit ID E-Value Query Start Query End Hit Start Hit End
QQQ91691.1 1.64e-160 48 588 20 547
QJU15850.1 1.64e-160 48 588 20 547
ANU78343.1 1.64e-160 48 588 20 547
ASU31151.1 1.64e-160 48 588 20 547
QBE95336.1 2.63e-159 36 588 8 547

PDB Hits      help

has no PDB hit.

Swiss-Prot Hits      help

has no Swissprot hit.

SignalP and Lipop Annotations help

This protein is predicted as OTHER

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.999717 0.000323 0.000006 0.000000 0.000000 0.000000

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000000313_00094.