logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000002949_01018

You are here: Home > Sequence: MGYG000002949_01018

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species UMGS1251 sp900549995
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; UMGS1251; UMGS1251 sp900549995
CAZyme ID MGYG000002949_01018
CAZy Family GH136
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
2700 MGYG000002949_9|CGC2 291474.08 4.2068
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000002949 4765419 MAG United States North America
Gene Location Start: 30259;  End: 38361  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000002949_01018.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH136 59 616 5.3e-153 0.9918533604887984

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00754 F5_F8_type_C 2.17e-08 1060 1166 1 123
F5/8 type C domain. This domain is also known as the discoidin (DS) domain family.
pfam11721 Malectin 2.07e-07 905 1031 4 138
Di-glucose binding within endoplasmic reticulum. Malectin is a membrane-anchored protein of the endoplasmic reticulum that recognizes and binds Glc2-N-glycan. It carries a signal peptide from residues 1-26, a C-terminal transmembrane helix from residues 255-274, and a highly conserved central part of approximately 190 residues followed by an acidic, glutamate-rich region. Carbohydrate-binding is mediated by the four aromatic residues, Y67, Y89, Y116, and F117 and the aspartate at D186. NMR-based ligand-screening studies has shown binding of the protein to maltose and related oligosaccharides, on the basis of which the protein has been designated "malectin", and its endogenous ligand is found to be Glc2-high-mannose N-glycan.
pfam13229 Beta_helix 1.80e-06 262 429 9 156
Right handed beta helix region. This region contains a parallel beta helix region that shares some similarity with Pectate lyases.
pfam11721 Malectin 2.27e-06 758 892 6 164
Di-glucose binding within endoplasmic reticulum. Malectin is a membrane-anchored protein of the endoplasmic reticulum that recognizes and binds Glc2-N-glycan. It carries a signal peptide from residues 1-26, a C-terminal transmembrane helix from residues 255-274, and a highly conserved central part of approximately 190 residues followed by an acidic, glutamate-rich region. Carbohydrate-binding is mediated by the four aromatic residues, Y67, Y89, Y116, and F117 and the aspartate at D186. NMR-based ligand-screening studies has shown binding of the protein to maltose and related oligosaccharides, on the basis of which the protein has been designated "malectin", and its endogenous ligand is found to be Glc2-high-mannose N-glycan.
COG5281 COG5281 1.23e-05 2219 2460 458 665
Phage-related minor tail protein [Mobilome: prophages, transposons].

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
AIY85170.1 0.0 51 1906 47 1900
AQM59543.1 0.0 51 1906 47 1900
ASU31098.1 0.0 1 1728 1 1775
ANU78288.1 0.0 1 1728 1 1775
QQQ91631.1 0.0 1 1728 1 1775

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
7V6M_A 6.60e-52 58 618 8 577
ChainA, Fibronectin type III domain-containing protein [Tyzzerella nexilis]
5GQC_A 9.44e-42 58 618 17 597
Crystalstructure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_B Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_C Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_D Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_E Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_F Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_G Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_H Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQF_A Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, lacto-N-biose complex [Bifidobacterium longum subsp. longum],5GQF_B Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, lacto-N-biose complex [Bifidobacterium longum subsp. longum],5GQG_A Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, galacto-N-biose complex [Bifidobacterium longum subsp. longum],5GQG_B Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, galacto-N-biose complex [Bifidobacterium longum subsp. longum]
7V6I_A 3.69e-41 58 618 13 609
ChainA, Lacto-N-biosidase [Bifidobacterium saguini DSM 23967]
6KQS_A 3.96e-29 58 457 244 638
CrystalStructure of GH136 lacto-N-biosidase from Eubacterium ramulus - selenomethionine derivative [Eubacterium ramulus ATCC 29099]
6KQT_A 3.96e-29 58 457 244 638
CrystalStructure of GH136 lacto-N-biosidase from Eubacterium ramulus - native protein [Eubacterium ramulus ATCC 29099]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
E8MGH9 3.86e-11 2261 2453 1674 1871
Beta-L-arabinobiosidase OS=Bifidobacterium longum subsp. longum (strain ATCC 15707 / DSM 20219 / JCM 1217 / NCTC 11818 / E194b) OX=565042 GN=hypBA2 PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000299 0.998892 0.000206 0.000219 0.000174 0.000158

TMHMM  Annotations      download full data without filtering help

start end
7 29
2679 2696