logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000150_06131

You are here: Home > Sequence: MGYG000000150_06131

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Hungatella sp005845265
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; Hungatella; Hungatella sp005845265
CAZyme ID MGYG000000150_06131
CAZy Family GH136
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1931 MGYG000000150_47|CGC1 211662.55 4.2557
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000150 7470188 Isolate United Kingdom Europe
Gene Location Start: 12324;  End: 18119  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000000150_06131.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH136 81 618 1.4e-118 0.9918533604887984

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
NF033838 PspC_subgroup_1 7.50e-26 1804 1911 481 585
pneumococcal surface protein PspC, choline-binding form. The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
NF033840 PspC_relate_1 2.57e-25 1798 1928 499 646
PspC-related protein choline-binding protein 1. Members of this family share C-terminal homology to the choline-binding form of the pneumococcal surface antigen PspC, but not to its allelic LPXTG-anchored forms because they lack the choline-binding repeat region. Members of this family should not be confused with PspC itself, whose identity and function reflect regions N-terminal to the choline-binding region. See Iannelli, et al. (PMID: 11891047) for information about the different allelic forms of PspC.
NF033930 pneumo_PspA 1.17e-23 1804 1911 438 542
pneumococcal surface protein A. The pneumococcal surface protein proteins, found in Streptococcus pneumoniae, are repetitive, with patterns of localized high sequence identity across pairs of proteins given different specific names that recombination may be presumed. This protein, PspA, has an N-terminal region that lacks a cross-wall-targeting YSIRK type extended signal peptide, in contrast to the closely related choline-binding protein CbpA which has a similar C-terminus but a YSIRK-containing region at the N-terminus.
COG5263 COG5263 1.51e-23 1806 1928 191 312
Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism].
NF033930 pneumo_PspA 4.17e-23 1811 1911 485 582
pneumococcal surface protein A. The pneumococcal surface protein proteins, found in Streptococcus pneumoniae, are repetitive, with patterns of localized high sequence identity across pairs of proteins given different specific names that recombination may be presumed. This protein, PspA, has an N-terminal region that lacks a cross-wall-targeting YSIRK type extended signal peptide, in contrast to the closely related choline-binding protein CbpA which has a similar C-terminus but a YSIRK-containing region at the N-terminus.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
CQR57267.1 5.10e-178 79 1202 39 1145
ASA23882.1 1.68e-175 78 1265 38 1212
QDH21977.1 2.64e-174 82 1204 53 1155
QFY07415.1 2.32e-159 76 1382 29 1262
BBC61707.1 2.50e-156 76 618 75 598

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
7V6M_A 4.00e-39 75 618 4 576
ChainA, Fibronectin type III domain-containing protein [Tyzzerella nexilis]
5GQC_A 5.22e-28 78 618 16 596
Crystalstructure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_B Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_C Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_D Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_E Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_F Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_G Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQC_H Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, ligand-free form [Bifidobacterium longum subsp. longum],5GQF_A Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, lacto-N-biose complex [Bifidobacterium longum subsp. longum],5GQF_B Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, lacto-N-biose complex [Bifidobacterium longum subsp. longum],5GQG_A Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, galacto-N-biose complex [Bifidobacterium longum subsp. longum],5GQG_B Crystal structure of lacto-N-biosidase LnbX from Bifidobacterium longum subsp. longum, galacto-N-biose complex [Bifidobacterium longum subsp. longum]
6KQS_A 1.65e-26 79 500 244 663
CrystalStructure of GH136 lacto-N-biosidase from Eubacterium ramulus - selenomethionine derivative [Eubacterium ramulus ATCC 29099]
6KQT_A 2.19e-26 79 500 244 663
CrystalStructure of GH136 lacto-N-biosidase from Eubacterium ramulus - native protein [Eubacterium ramulus ATCC 29099]
7V6I_A 1.93e-23 75 618 9 608
ChainA, Lacto-N-biosidase [Bifidobacterium saguini DSM 23967]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
A0A401ETL2 3.61e-08 1698 1768 1381 1458
Exo-beta-1,6-galactobiohydrolase OS=Bifidobacterium longum subsp. longum (strain ATCC 15707 / DSM 20219 / JCM 1217 / NCTC 11818 / E194b) OX=565042 GN=bl1,6Gal PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000320 0.998933 0.000212 0.000193 0.000185 0.000158

TMHMM  Annotations      download full data without filtering help

start end
13 30