logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000140_00294

You are here: Home > Sequence: MGYG000000140_00294

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species UMGS1375 sp900066615
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; UMGS1375; UMGS1375 sp900066615
CAZyme ID MGYG000000140_00294
CAZy Family CBM32
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1732 MGYG000000140_1|CGC6 189844.96 5.8808
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000140 3539979 Isolate United Kingdom Europe
Gene Location Start: 371966;  End: 377164  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000000140_00294.

CAZyme Signature Domains help

Family Start End Evalue family coverage
CBM32 112 248 1.7e-17 0.9596774193548387

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00754 F5_F8_type_C 1.37e-18 110 248 1 127
F5/8 type C domain. This domain is also known as the discoidin (DS) domain family.
pfam09479 Flg_new 2.51e-15 1378 1441 1 65
Listeria-Bacteroides repeat domain (List_Bact_rpt). This model describes a conserved core region of about 43 residues, which occurs in at least two families of tandem repeats. These include 78-residue repeats which occur from 2 to 15 times in some proteins of Bacteroides forsythus ATCC 43037, and 70-residue repeats found in families of internalins of Listeria species. Single copies are found in proteins of Fibrobacter succinogenes, Geobacter sulfurreducens, and a few other bacteria.
cd00057 FA58C 7.91e-11 98 250 1 139
Substituted updates: Jan 31, 2002
pfam13306 LRR_5 1.75e-10 1642 1698 1 55
Leucine rich repeats (6 copies). This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.
sd00036 LRR_3 2.04e-10 1640 1698 24 81
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QUT26415.1 2.66e-253 257 1131 86 914
ALG48827.1 1.87e-250 1 1144 3 1182
SQG39183.1 1.01e-249 1 1144 3 1182
AMN35790.1 1.97e-249 1 1144 3 1182
AXH52486.1 2.76e-249 1 1144 3 1182

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
1K3I_A 1.92e-14 83 258 8 170
CrystalStructure of the Precursor of Galactose Oxidase [Fusarium sp.]
2WQ8_A 3.34e-14 90 258 17 175
ChainA, GALACTOSE OXIDASE [Fusarium graminearum]
2EIB_A 7.35e-14 104 258 9 153
ChainA, Galactose oxidase [Fusarium graminearum]
1GOF_A 7.35e-14 104 258 9 153
NOVELTHIOETHER BOND REVEALED BY A 1.7 ANGSTROMS CRYSTAL STRUCTURE OF GALACTOSE OXIDASE [Hypomyces rosellus],1GOG_A NOVEL THIOETHER BOND REVEALED BY A 1.7 ANGSTROMS CRYSTAL STRUCTURE OF GALACTOSE OXIDASE [Hypomyces rosellus],1GOH_A NOVEL THIOETHER BOND REVEALED BY A 1.7 ANGSTROMS CRYSTAL STRUCTURE OF GALACTOSE OXIDASE [Hypomyces rosellus],2EIE_A Chain A, Galactose oxidase [Fusarium graminearum],2JKX_A Chain A, GALACTOSE OXIDASE [Fusarium graminearum],2VZ1_A Chain A, GALACTOSE OXIDASE [Fusarium graminearum],2VZ3_A Chain A, Galactose Oxidase [Fusarium graminearum]
2EIC_A 7.35e-14 104 258 9 153
ChainA, Galactose oxidase [Fusarium graminearum]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
I1S2N3 8.29e-14 83 258 32 194
Galactose oxidase OS=Gibberella zeae (strain ATCC MYA-4620 / CBS 123657 / FGSC 9075 / NRRL 31084 / PH-1) OX=229533 GN=GAOA PE=3 SV=1
P0CS93 1.09e-13 83 258 32 194
Galactose oxidase OS=Gibberella zeae OX=5518 GN=GAOA PE=1 SV=1
P0DTR4 1.91e-10 104 251 509 644
A type blood N-acetyl-alpha-D-galactosamine deacetylase OS=Flavonifractor plautii OX=292800 PE=1 SV=1
Q02834 2.60e-09 102 250 502 643
Sialidase OS=Micromonospora viridifaciens OX=1881 GN=nedA PE=1 SV=1
Q0TR53 4.27e-09 111 305 631 814
O-GlcNAcase NagJ OS=Clostridium perfringens (strain ATCC 13124 / DSM 756 / JCM 1290 / NCIMB 6125 / NCTC 8237 / Type A) OX=195103 GN=nagJ PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000418 0.998715 0.000213 0.000255 0.000225 0.000189

TMHMM  Annotations      download full data without filtering help

start end
9 31