logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000002414_03044

You are here: Home > Sequence: MGYG000002414_03044

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Paenibacillus odorifer
Lineage Bacteria; Firmicutes; Bacilli; Paenibacillales; Paenibacillaceae; Paenibacillus; Paenibacillus odorifer
CAZyme ID MGYG000002414_03044
CAZy Family GH136
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1641 MGYG000002414_1|CGC64 175677.05 4.5691
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000002414 6802552 Isolate South Korea Asia
Gene Location Start: 3439661;  End: 3444586  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000002414_03044.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH136 328 816 9.2e-124 0.9898167006109979
CBM9 968 1117 8e-18 0.7857142857142857

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
cd09621 CBM9_like_5 9.82e-51 965 1130 21 188
DOMON-like type 9 carbohydrate binding module. Family 9 carbohydrate-binding modules (CBM9) play a role in the microbial degradation of cellulose and hemicellulose (materials found in plants). The domain has previously been called cellulose-binding domain. The polysaccharide binding sites of CBMs with available 3D structure have been found to be either flat surfaces with interactions formed by predominantly aromatic residues (tryptophan and tyrosine), or extended shallow grooves. CBM9 domains found in this uncharacterized heterogeneous subfamily are often located at the C-terminus of longer proteins and may co-occur with various other functional domains such as glycosyl hydrolases. The CBM9 module in these architectures may be involved in binding to carbohydrates.
pfam12733 Cadherin-like 2.73e-13 1160 1227 10 88
Cadherin-like beta sandwich domain. This domain is found in several bacterial, metazoan and chlorophyte algal proteins. A profile-profile comparison recovered the cadherin domain and a comparison of the predicted structure of this domain with the crystal structure of the cadherin showed a congruent seven stranded secondary structure. The domain is widespread in bacteria and seen in the firmicutes, actinobacteria, certain proteobacteria, bacteroides and chlamydiae with an expansion in Clostridium. In contrast, it is limited in its distribution in eukaryotes suggesting that it was derived through lateral transfer from bacteria. In prokaryotes, this domain is widely fused to other domains such as FNIII (Fibronectin Type III), TIG, SLH (S-layer homology), discoidin, cell-wall-binding repeat domain and alpha-amylase-like glycohydrolases. These associations are suggestive of a carbohydrate-binding function for this cadherin-like domain. In animal proteins it is associated with an ATP-grasp domain.
cd09619 CBM9_like_4 3.92e-13 967 1132 31 187
DOMON-like type 9 carbohydrate binding module. Family 9 carbohydrate-binding modules (CBM9) play a role in the microbial degradation of cellulose and hemicellulose (materials found in plants). The domain has previously been called cellulose-binding domain. The polysaccharide binding sites of CBMs with available 3D structure have been found to be either flat surfaces with interactions formed by predominantly aromatic residues (tryptophan and tyrosine), or extended shallow grooves. CBM9 domains found in this uncharacterized heterogeneous subfamily are often located at the C-terminus of longer proteins and may co-occur with various other domains.
cd00241 DOMON_like 4.05e-12 957 1112 3 158
Domon-like ligand-binding domains. DOMON-like domains can be found in all three kindgoms of life and are a diverse group of ligand binding domains that have been shown to interact with sugars and hemes. DOMON domains were initially thought to confer protein-protein interactions. They were subsequently found as a heme-binding motif in cellobiose dehydrogenase, an extracellular fungal oxidoreductase that degrades both lignin and cellulose, and in ethylbenzene dehydrogenase, an enzyme that aids in the anaerobic degradation of hydrocarbons. The domain interacts with sugars in the type 9 carbohydrate binding modules (CBM9), which are present in a variety of glycosyl hydrolases, and it can also be found at the N-terminus of sensor histidine kinases.
NF033190 inl_like_NEAT_1 2.14e-10 1512 1641 570 693
NEAT domain-containing leucine-rich repeat protein. Members of this family have an N-terminal NEAT (near transporter) domain often associated with iron transport, followed by a leucine-rich repeat region with significant sequence similarity to the internalins of Listeria monocytogenes. However, since Bacillus cereus (from which this protein was described, in PMID:16978259) is not considered an intracellular pathogen, and the function may be iron transport rather than internalization, applying the name "internalin" to this family probably would be misleading.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
AWV33845.1 0.0 1 1641 1 1641
AIQ24061.1 0.0 1 1641 1 1640
BBH22310.1 0.0 1 1639 1 1534
QTH41501.1 0.0 1 1131 2 1128
BCA49028.1 4.13e-179 329 852 43 556

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
7V6M_A 4.47e-59 329 822 11 581
ChainA, Fibronectin type III domain-containing protein [Tyzzerella nexilis]
5GQC_A 5.49e-39 329 815 20 594
Crystalstructure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_B Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_C Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_D Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_E Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_F Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_G Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_H Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQF_A Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, lacto-N-biose complex [Bifidobacterium longum subsp. longum],5GQF_B Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, lacto-N-biose complex [Bifidobacterium longum subsp. longum],5GQG_A Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, galacto-N-biose complex [Bifidobacterium longum subsp. longum],5GQG_B Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, galacto-N-biose complex [Bifidobacterium longum subsp. longum]
7V6I_A 4.97e-38 329 815 16 606
ChainA, Lacto-N-biosidase [Bifidobacterium saguini DSM 23967]
6KQT_A 4.82e-29 329 687 247 638
CrystalStructure of GH136 lacto-N-biosidase from Eubacterium ramulus - native protein [Eubacterium ramulus ATCC 29099]
6KQS_A 1.12e-28 329 687 247 638
CrystalStructure of GH136 lacto-N-biosidase from Eubacterium ramulus - selenomethionine derivative [Eubacterium ramulus ATCC 29099]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
C6CRV0 9.47e-36 1311 1641 1125 1461
Endo-1,4-beta-xylanase A OS=Paenibacillus sp. (strain JDR-2) OX=324057 GN=xynA1 PE=1 SV=1
P38536 2.42e-25 1332 1640 1558 1856
Amylopullulanase OS=Thermoanaerobacterium thermosulfurigenes OX=33950 GN=amyB PE=3 SV=2
P38535 6.05e-22 1443 1640 887 1082
Exoglucanase XynX OS=Acetivibrio thermocellus OX=1515 GN=xynX PE=3 SV=1
P19424 2.52e-16 1448 1635 16 210
Endoglucanase OS=Bacillus sp. (strain KSM-635) OX=1415 PE=1 SV=1
P38537 9.67e-15 1432 1640 1 207
Surface-layer 125 kDa protein OS=Lysinibacillus sphaericus OX=1421 PE=3 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000239 0.999063 0.000204 0.000183 0.000162 0.000141

TMHMM  Annotations      download full data without filtering help

start end
5 27