logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000002965_00128

You are here: Home > Sequence: MGYG000002965_00128

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species UMGS599 sp900543705
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Ruminococcaceae; UMGS599; UMGS599 sp900543705
CAZyme ID MGYG000002965_00128
CAZy Family GH23
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1638 186492.7 5.3381
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000002965 2046279 MAG United States North America
Gene Location Start: 107639;  End: 112555  Strand: -

Full Sequence      Download help

MRDITFQNEA  GFPINRVEKL  PRVREWLKAQ  EQASPLPEKE  KAPAVTSDRS  NYRITEDTLG60
VGGAKEKFKA  NMAAVNLLHD  LEIENRLATP  EEQKILAGYV  GWGGLSMAFD  ENNAAWANEF120
KELYVTLSPE  EYRAAKESTL  TAFYTPPIVI  KAMYEALDRL  GFSEGNILEP  SCGTGNFFGL180
LPDSMAKSKL  HGVEIDSLTG  RIAKQLYQKA  NIAIEGFEKT  KLPDDHFDVV  LGNVPFGDFR240
VNDSRYNAQN  FRIHDYFFAK  ALDKVRAGGV  MMFITSKGTM  DKASPEVRKY  IAQRAELLGA300
VRLPDNTFRA  NAGTEVTSDI  LILQKRDRVM  DIEPDWIHLD  TDENGITMNS  YFVQHPEMIM360
GDMVMESTRF  GKMDAACKAK  KDISLAELLH  EAVQNIQGEI  PEYEIDEISD  EQEEVIPADP420
SVKNFSYTLV  NGKVYFREND  VMTPANLSMT  AESRVKGLVE  IRDCVRKLIS  YQTEDYPEEL480
IRTEQENLNR  LYDAYTAKYG  LINSRGNYLA  FASDESYFLL  CSLEVLDDEG  NFKRKADMFS540
KRTIKPHRAV  TSVETASEAL  AISIGEKARV  DLPFMAELTG  KEQAEIIRDL  QGVIFKIPHS600
EPAKYVTADE  YLSGNVRNKL  VIAETEAKTD  PELIINAEAL  KKVIPKDLSA  GEISVRLGAT660
WIPQEDIQQF  VMELLTPSSY  AAGKIKVRYT  AMNGDWFIEN  KSSDYGNVKA  DSTFGTKRAS720
AYRIIEDTLN  LRDVRIFDYV  YDEHGNKKAV  FNHKETTAAQ  AKQEVIKQVF  QDWIWKDPER780
RNRLVRYYND  TFNSIRPREY  DGSHITFGGI  SPEIQLRPHQ  VNAIAHILYG  GNTLLAHKVG840
AGKTFEMVAA  AQESKRLGLC  QKSMFVVPNH  LVGQWASEYL  RLYPSANILV  SRKQDFETAN900
RKKFCGRIAT  GDYDAVIIGH  SQFEKVPMSL  ERQYNMLQEQ  LDDIERGIDD  IQKSHGEQFT960
VKQLMKTRKS  IQAKLDKLND  TKRKDTVIDF  EELGVDRLFI  DESHFYKNLY  LYTKMRNVGG1020
IAQTEAQKSS  DLFMKCRYLD  EITGNRGTIF  ATGTPVSNSM  VELYSVQRYL  QYDTLMQNGL1080
QHFDSWASTF  GETVTALELA  PEGTNYRAKT  RFAKFYNLPE  LMQMFREVAD  IQTADMLKLP1140
VPKVNYHNIK  TKPSEIQTEM  VANLAKRAEK  VRARLVEPNI  DNMLKITNDG  RKLALDQRMI1200
DPMLPDDPDS  KVNACVDNVY  RIWEEHADTK  ATQLVFCDLS  TPKNDGSFNI  YDDIRGKLTE1260
RGIPAEQVRF  IHEATTDAQK  KELFAKVRSG  EVRVLLGSTP  KMGAGTNVQD  RLIAIHNLDC1320
PWRPSDLEQR  QGRIERQGNM  FPEVEVYRYV  TEQTFDAYLY  QLVESKQKFI  SQIMTSKSPV1380
RSAEDVDEVA  LSFAEVKMLA  TGDERFKEKM  DLDIQVSKLR  VLKQSYLSEH  YDLEDRILQQ1440
FPKLIKEYEE  RITGYEADTA  LAEQHKPQGE  DKFCPMTLQG  VTYTEKADAG  QMLLALCKEN1500
PLSKPTEIGS  YRGFKLEVYY  DPVNSNYCLN  LCGKSRHKVE  LGSDAFGNLT  RIENELSRLP1560
AMLTAAKTRK  EEIVTQLENA  KVEVKKPFAF  EEEMKEKTDR  LNALNIELNL  NEKDPSAIDI1620
EPEQGNELSE  RKNKDLER1638

Enzyme Prediction      help

No EC number prediction in MGYG000002965_00128.

CDD Domains      download full data without filtering help

Created with Snap811632453274094915736557378199009821064114612281310139214741556348870COG464610111202COG4646143325N6_Mtase143325HsdM12521338HELICc
Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
COG4646 COG4646 1.22e-75 348 870 1 540
Adenine-specific DNA methylase, N12 class [Replication, recombination and repair].
COG4646 COG4646 1.42e-31 1011 1202 439 637
Adenine-specific DNA methylase, N12 class [Replication, recombination and repair].
pfam02384 N6_Mtase 3.24e-09 143 325 28 234
N-6 DNA Methylase. Restriction-modification (R-M) systems protect a bacterial cell against invasion of foreign DNA by endonucleolytic cleavage of DNA that lacks a site specific modification. The R-M system is a complex containing three polypeptides: M (this family), S (pfam01420), and R. This family consists of N-6 adenine-specific DNA methylase EC:2.1.1.72 from Type I and Type IC restriction systems. These methylases have the same sequence specificity as their corresponding restriction enzymes.
COG0286 HsdM 4.13e-09 143 325 168 375
Type I restriction-modification system, DNA methylase subunit [Defense mechanisms].
smart00490 HELICc 7.71e-09 1252 1338 1 82
helicase superfamily c-terminal domain.

CAZyme Hits      help

Created with Snap811632453274094915736557378199009821064114612281310139214741556421577ASV45029.1|GH23421577QIW86704.1|GH23421577QIW86628.1|GH23431630AXF51455.1|GH23431630AEY69616.1|GH23
Hit ID E-Value Query Start Query End Hit Start Hit End
ASV45029.1 0.0 42 1577 1536 3160
QIW86704.1 0.0 42 1577 1604 3228
QIW86628.1 0.0 42 1577 1604 3228
AXF51455.1 0.0 43 1630 1773 3453
AEY69616.1 0.0 43 1630 1680 3360

PDB Hits      help

has no PDB hit.

Swiss-Prot Hits      download full data without filtering help

Created with Snap811632453274094915736557378199009821064114612281310139214741556441440sp|Q71TF8|DARB_BPP1143236sp|P52284|MTR1_PBCVX136274sp|P10835|MTC3_PBCVC
Hit ID E-Value Query Start Query End Hit Start Hit End Description
Q71TF8 1.12e-44 44 1440 23 1616
Defense against restriction protein B OS=Escherichia phage P1 OX=2886926 GN=darB PE=3 SV=1
P52284 5.68e-06 143 236 26 115
Type II methyltransferase M.CvrRI OS=Paramecium bursaria Chlorella virus XZ-6E OX=36360 GN=CVIRIM PE=3 SV=1
P10835 9.87e-06 136 274 21 164
Modification methylase CviBIII OS=Paramecium bursaria Chlorella virus NC1A OX=46020 GN=CVIBIIIM PE=3 SV=1

SignalP and Lipop Annotations help

This protein is predicted as OTHER

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
1.000048 0.000000 0.000000 0.000000 0.000000 0.000000

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000002965_00128.