logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000001404_03402

You are here: Home > Sequence: MGYG000001404_03402

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Paenibacillus_L senegalensis
Lineage Bacteria; Firmicutes; Bacilli; Paenibacillales; Paenibacillaceae; Paenibacillus_L; Paenibacillus_L senegalensis
CAZyme ID MGYG000001404_03402
CAZy Family GH163
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1527 MGYG000001404_4|CGC3 166797.62 4.1683
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000001404 5495033 Isolate not provided not provided
Gene Location Start: 430553;  End: 435136  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000001404_03402.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH163 483 755 2.9e-66 0.9920318725099602

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam16126 DUF4838 5.07e-44 476 755 2 263
Domain of unknown function (DUF4838). This family consists of several uncharacterized proteins found in various Bacteroides and Chloroflexus species. The function of this family is unknown.
cd09618 CBM9_like_2 7.27e-13 856 1009 22 175
DOMON-like type 9 carbohydrate binding module. Family 9 carbohydrate-binding modules (CBM9) play a role in the microbial degradation of cellulose and hemicellulose (materials found in plants). The domain has previously been called cellulose-binding domain. The polysaccharide binding sites of CBMs with available 3D structure have been found to be either flat surfaces with interactions formed by predominantly aromatic residues (tryptophan and tyrosine), or extended shallow grooves. CBM9 domains found in this uncharacterized subfamily are typically found at the N-terminus of longer proteins that lack additional annotation with domain footprints.
NF033190 inl_like_NEAT_1 1.04e-12 1346 1519 582 749
NEAT domain-containing leucine-rich repeat protein. Members of this family have an N-terminal NEAT (near transporter) domain often associated with iron transport, followed by a leucine-rich repeat region with significant sequence similarity to the internalins of Listeria monocytogenes. However, since Bacillus cereus (from which this protein was described, in PMID:16978259) is not considered an intracellular pathogen, and the function may be iron transport rather than internalization, applying the name "internalin" to this family probably would be misleading.
NF033190 inl_like_NEAT_1 3.74e-08 1340 1455 637 751
NEAT domain-containing leucine-rich repeat protein. Members of this family have an N-terminal NEAT (near transporter) domain often associated with iron transport, followed by a leucine-rich repeat region with significant sequence similarity to the internalins of Listeria monocytogenes. However, since Bacillus cereus (from which this protein was described, in PMID:16978259) is not considered an intracellular pathogen, and the function may be iron transport rather than internalization, applying the name "internalin" to this family probably would be misleading.
cd09620 CBM9_like_3 3.10e-06 856 1025 10 186
DOMON-like type 9 carbohydrate binding module. Family 9 carbohydrate-binding modules (CBM9) play a role in the microbial degradation of cellulose and hemicellulose (materials found in plants). The domain has previously been called cellulose-binding domain. The polysaccharide binding sites of CBMs with available 3D structure have been found to be either flat surfaces with interactions formed by predominantly aromatic residues (tryptophan and tyrosine), or extended shallow grooves. CBM9 domains found in this uncharacterized heterogeneous subfamily may co-occur with various other domains.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
ANY71258.1 0.0 8 1035 48 1068
QNK60258.1 3.40e-303 9 1033 427 1444
QJD84461.1 6.64e-302 9 1033 488 1661
QTH43592.1 1.25e-301 9 1033 470 1647
BBH18867.1 8.78e-294 8 1033 450 1634

PDB Hits      help

has no PDB hit.

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P38536 1.13e-30 1250 1522 1577 1855
Amylopullulanase OS=Thermoanaerobacterium thermosulfurigenes OX=33950 GN=amyB PE=3 SV=2
P38535 3.53e-25 1353 1522 914 1081
Exoglucanase XynX OS=Acetivibrio thermocellus OX=1515 GN=xynX PE=3 SV=1
C6CRV0 5.69e-21 1328 1521 1265 1458
Endo-1,4-beta-xylanase A OS=Paenibacillus sp. (strain JDR-2) OX=324057 GN=xynA1 PE=1 SV=1
P19424 5.75e-19 1346 1526 41 222
Endoglucanase OS=Bacillus sp. (strain KSM-635) OX=1415 PE=1 SV=1
P38537 6.72e-15 1354 1522 42 206
Surface-layer 125 kDa protein OS=Lysinibacillus sphaericus OX=1421 PE=3 SV=1

SignalP and Lipop Annotations help

This protein is predicted as OTHER

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
1.000062 0.000000 0.000000 0.000000 0.000000 0.000000

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000001404_03402.