logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000685_00626

You are here: Home > Sequence: MGYG000000685_00626

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species CAG-590 sp900552885
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; CAG-590; CAG-590 sp900552885
CAZyme ID MGYG000000685_00626
CAZy Family GH9
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1799 191235.96 4.6871
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000685 2863940 MAG Kazakhstan Asia
Gene Location Start: 54753;  End: 60152  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.4

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH9 348 772 5.8e-91 0.9976076555023924

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00759 Glyco_hydro_9 9.64e-100 351 771 1 374
Glycosyl hydrolase family 9.
PLN02345 PLN02345 7.75e-50 352 775 1 459
endoglucanase
PLN02613 PLN02613 3.34e-49 348 791 26 495
endoglucanase
PLN02420 PLN02420 5.72e-43 346 785 39 518
endoglucanase
PLN02340 PLN02340 1.84e-42 346 775 28 494
endoglucanase

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
CBK83841.1 3.58e-129 96 1089 112 1152
AFK82697.1 1.67e-122 96 811 112 812
QNL98526.1 3.06e-112 350 1066 214 1001
ADD61854.1 1.95e-111 343 896 188 739
QWT52133.1 5.51e-101 347 788 36 468

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
2YIK_A 5.66e-95 347 775 38 513
ChainA, Endoglucanase [Acetivibrio thermocellus]
1IA6_A 3.23e-60 347 772 4 424
CrystalStructure Of The Cellulase Cel9m Of C. Cellulolyticum [Ruminiclostridium cellulolyticum],1IA7_A Crystal Structure Of The Cellulase Cel9m Of C. Cellulolyticium In Complex With Cellobiose [Ruminiclostridium cellulolyticum]
2XFG_A 7.41e-51 347 775 24 460
ChainA, ENDOGLUCANASE 1 [Acetivibrio thermocellus]
5GXX_A 1.41e-47 347 777 5 428
ChainA, Glucanase [Acetivibrio thermocellus],5GXX_B Chain B, Glucanase [Acetivibrio thermocellus],5GXY_A Chain A, Glucanase [Acetivibrio thermocellus],5GXY_B Chain B, Glucanase [Acetivibrio thermocellus],5GXZ_A Chain A, Glucanase [Acetivibrio thermocellus],5GXZ_B Chain B, Glucanase [Acetivibrio thermocellus]
5GY0_A 8.27e-47 347 777 5 428
ChainA, Glucanase [Acetivibrio thermocellus],5GY0_B Chain B, Glucanase [Acetivibrio thermocellus]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
Q02934 3.98e-47 343 775 72 512
Endoglucanase 1 OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celI PE=1 SV=2
Q5YLG1 1.19e-46 347 774 47 482
Endoglucanase A OS=Bacillus pumilus OX=1408 GN=eglA PE=1 SV=1
P26224 1.39e-44 347 796 30 486
Endoglucanase F OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celF PE=3 SV=1
P26221 1.90e-44 339 777 42 489
Endoglucanase E-4 OS=Thermobifida fusca OX=2021 GN=celD PE=1 SV=2
P28622 4.09e-44 347 813 28 502
Endoglucanase 4 OS=Bacillus sp. (strain KSM-522) OX=120046 PE=3 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000338 0.998951 0.000183 0.000196 0.000158 0.000141

TMHMM  Annotations      download full data without filtering help

start end
7 26