logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000152_02492

You are here: Home > Sequence: MGYG000000152_02492

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Lacrimispora sp902363835
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; Lacrimispora; Lacrimispora sp902363835
CAZyme ID MGYG000000152_02492
CAZy Family GH136
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1571 MGYG000000152_10|CGC1 172703.16 4.4637
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000152 6260407 Isolate United Kingdom Europe
Gene Location Start: 73077;  End: 77792  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000000152_02492.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH136 74 635 2.3e-122 0.9959266802443992

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
NF033930 pneumo_PspA 5.73e-32 1424 1570 485 660
pneumococcal surface protein A. The pneumococcal surface protein proteins, found in Streptococcus pneumoniae, are repetitive, with patterns of localized high sequence identity across pairs of proteins given different specific names that recombination may be presumed. This protein, PspA, has an N-terminal region that lacks a cross-wall-targeting YSIRK type extended signal peptide, in contrast to the closely related choline-binding protein CbpA which has a similar C-terminus but a YSIRK-containing region at the N-terminus.
NF033840 PspC_relate_1 5.04e-30 1424 1571 512 648
PspC-related protein choline-binding protein 1. Members of this family share C-terminal homology to the choline-binding form of the pneumococcal surface antigen PspC, but not to its allelic LPXTG-anchored forms because they lack the choline-binding repeat region. Members of this family should not be confused with PspC itself, whose identity and function reflect regions N-terminal to the choline-binding region. See Iannelli, et al. (PMID: 11891047) for information about the different allelic forms of PspC.
NF033838 PspC_subgroup_1 1.56e-29 1424 1570 508 683
pneumococcal surface protein PspC, choline-binding form. The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
COG5263 COG5263 1.97e-26 1415 1570 187 313
Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism].
NF033679 DNRLRE_dom 5.35e-20 805 988 1 163
DNRLRE domain. The DNRLRE domain, with a length of about 160 amino acids, appears typically in large, repetitive surface proteins of bacteria and archaea, sometimes repeated several times. It occurs, notably, three times in the C-terminal region of the enzyme disaggregatase from the archaeal species Methanosarcina mazei, each time with the motif DNRLRE, for which the domain is named. Archaeal proteins within this family are described particularly well by the currently more narrowly defined Pfam model, PF06848. Note that the catalytic region of disaggregatase, in the N-terminal portion of the protein, is modeled by a different HMM, PF08480.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
SET83278.1 0.0 1 1571 1 1571
ADL05164.1 0.0 1 1571 1 1579
QRV20653.1 0.0 1 1571 1 1579
QTI51982.1 1.24e-153 70 1122 47 1128
BBD28515.1 2.38e-153 70 1122 47 1128

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
7V6M_A 6.28e-32 73 638 7 579
ChainA, Fibronectin type III domain-containing protein [Tyzzerella nexilis]
6KQT_A 1.98e-21 27 415 188 599
CrystalStructure of GH136 lacto-N-biosidase from Eubacterium ramulus - native protein [Eubacterium ramulus ATCC 29099]
6KQS_A 1.81e-20 74 415 244 599
CrystalStructure of GH136 lacto-N-biosidase from Eubacterium ramulus - selenomethionine derivative [Eubacterium ramulus ATCC 29099]

Swiss-Prot Hits      help

has no Swissprot hit.

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000477 0.998758 0.000198 0.000209 0.000170 0.000148

TMHMM  Annotations      download full data without filtering help

start end
13 35