logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000002356_03609

You are here: Home > Sequence: MGYG000002356_03609

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Bacillus_A wiedmannii
Lineage Bacteria; Firmicutes; Bacilli; Bacillales; Bacillaceae_G; Bacillus_A; Bacillus_A wiedmannii
CAZyme ID MGYG000002356_03609
CAZy Family CBM16
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
2202 246221.62 6.6097
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000002356 5414628 Isolate Netherlands Europe
Gene Location Start: 261537;  End: 268145  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000002356_03609.

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
NF033679 DNRLRE_dom 1.43e-33 308 472 1 164
DNRLRE domain. The DNRLRE domain, with a length of about 160 amino acids, appears typically in large, repetitive surface proteins of bacteria and archaea, sometimes repeated several times. It occurs, notably, three times in the C-terminal region of the enzyme disaggregatase from the archaeal species Methanosarcina mazei, each time with the motif DNRLRE, for which the domain is named. Archaeal proteins within this family are described particularly well by the currently more narrowly defined Pfam model, PF06848. Note that the catalytic region of disaggregatase, in the N-terminal portion of the protein, is modeled by a different HMM, PF08480.
COG3209 RhsA 3.31e-27 1487 2098 39 690
Uncharacterized conserved protein RhsA, contains 28 RHS repeats [General function prediction only].
TIGR03696 Rhs_assc_core 5.01e-22 1988 2071 1 77
RHS repeat-associated core domain. This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.
COG3209 RhsA 9.27e-09 877 1119 41 273
Uncharacterized conserved protein RhsA, contains 28 RHS repeats [General function prediction only].
COG3209 RhsA 2.95e-07 964 1120 1 134
Uncharacterized conserved protein RhsA, contains 28 RHS repeats [General function prediction only].

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QGF42541.1 0.0 64 2127 50 2118
QRT48460.1 7.00e-36 1560 2082 1794 2336
QRT30109.1 5.88e-35 1612 2126 1699 2214
QNM08600.1 1.03e-34 1561 2091 1789 2336
ATW25292.1 1.48e-34 1798 2071 37 322

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
7PQ5_L 9.97e-09 1848 2072 1098 1343
ChainL, Tre23 [Photorhabdus laumondii subsp. laumondii TTO1]
7Q97_A 1.47e-07 1852 2080 1062 1295
ChainA, Rhs family protein [Pseudomonas protegens Pf-5],7Q97_B Chain B, Rhs family protein [Pseudomonas protegens Pf-5]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
D4G3R4 0.0 63 2115 83 2239
tRNA(Glu)-specific nuclease WapA OS=Bacillus subtilis subsp. natto (strain BEST195) OX=645657 GN=wapA PE=1 SV=2
G4NYJ6 0.0 63 2127 70 2238
tRNA3(Ser)-specific nuclease WapA OS=Bacillus spizizenii (strain DSM 15029 / JCM 12233 / NBRC 101239 / NRRL B-23049 / TU-B-10) OX=1052585 GN=wapA PE=1 SV=1
Q07833 0.0 63 2127 70 2238
tRNA nuclease WapA OS=Bacillus subtilis (strain 168) OX=224308 GN=wapA PE=1 SV=2
P42018 8.17e-54 7 257 5 245
Wall-associated protein OS=Geobacillus stearothermophilus OX=1422 GN=wapA' PE=3 SV=1
P0DUH5 6.40e-11 1850 2102 1039 1327
Double-stranded DNA deaminase toxin A OS=Burkholderia cenocepacia (strain H111) OX=1055524 GN=dddA PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000227 0.999191 0.000144 0.000158 0.000143 0.000127

TMHMM  Annotations      download full data without filtering help

start end
2070 2092