logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000001823_00120

You are here: Home > Sequence: MGYG000001823_00120

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species UMGS1603 sp900553265
Lineage Bacteria; Firmicutes_A; Clostridia_A; Christensenellales; CAG-74; UMGS1603; UMGS1603 sp900553265
CAZyme ID MGYG000001823_00120
CAZy Family GH123
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
548 62695.5 5.1205
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000001823 2784400 MAG Denmark Europe
Gene Location Start: 141008;  End: 142654  Strand: -

Full Sequence      Download help

MLEVRVLSPV  AKVFPDYAPQ  PCTPRFCGLR  NEVISFQLAF  RSAEPLNPDK  PYLRLEIDSP60
VKARIHARRV  KYLPVRFPQM  PNVDDNYLRL  GRPGLYPDAL  TEIPPHGIRA  FPTSWESLWI120
DFEPDGMDAG  EYPAALRLID  EATEQVVGEA  EVSLRVLNAF  LPRQTLIHTK  WFHSDCLATY180
YGVEIFSEEY  WRITENFVRC  AVRHGINTIL  TPIHTPPLDT  RVGSYRPTVQ  LVDVVRENGG240
YRFGFDRLRR  WVDMCKRCGV  EYYEIAHLFS  QWGAKYAPQI  CLTTEHGFER  LFGWDTEALG300
EEYTGFVRAY  LPAVLAEMKA  LGVDDKCIFH  ISDEPSREHL  EGYLAAKAVV  GPILKGYKII360
DALSDASFYD  SGAVEHPVPA  TNHIEPFLER  RVPGLWTYYC  IGQGKDVSNL  FFAMPGARTR420
VLGAQLYKFE  IEGFLQWGFN  FYYSQGSDYP  INPWLDTDCD  GFTPAGDAYQ  VYPSPGGQPV480
ESMRLMLVDQ  ALQDLRAMQL  LESLSDRETV  LRLIDEDVPP  IRFASYPHED  AWVLGLRERI540
NREIEKRI548

Enzyme Prediction      help

No EC number prediction in MGYG000001823_00120.

CAZyme Signature Domains help

Created with Snap27548210913716419121924627430132835638341143846549352095506GH123
Family Start End Evalue family coverage
GH123 95 506 9e-25 0.7267657992565055

CDD Domains      download full data without filtering help

Created with Snap275482109137164191219246274301328356383411438465493520431502DUF4091
Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam13320 DUF4091 1.80e-21 431 502 2 66
Domain of unknown function (DUF4091). This presumed domain is functionally uncharacterized. This domain family is found in bacteria, archaea and eukaryotes, and is approximately 70 amino acids in length. There is a single completely conserved residue G that may be functionally important.

CAZyme Hits      help

Created with Snap2754821091371641912192462743013283563834114384654935202548AUS97906.1|GH02542ALS26464.1|GH03546ANY75033.1|GH03548AZN39976.1|GH02547QHW32778.1|GH0
Hit ID E-Value Query Start Query End Hit Start Hit End
AUS97906.1 1.21e-184 2 548 10 553
ALS26464.1 3.00e-184 2 542 5 542
ANY75033.1 2.49e-181 3 546 8 549
AZN39976.1 3.05e-180 3 548 9 554
QHW32778.1 1.63e-179 2 547 8 552

PDB Hits      help

has no PDB hit.

Swiss-Prot Hits      help

has no Swissprot hit.

SignalP and Lipop Annotations help

This protein is predicted as OTHER

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
1.000055 0.000001 0.000000 0.000000 0.000000 0.000000

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000001823_00120.