logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000160_00806

You are here: Home > Sequence: MGYG000000160_00806

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Collinsella sp003479805
Lineage Bacteria; Actinobacteriota; Coriobacteriia; Coriobacteriales; Coriobacteriaceae; Collinsella; Collinsella sp003479805
CAZyme ID MGYG000000160_00806
CAZy Family GH170
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
361 MGYG000000160_5|CGC1 40208.64 4.5389
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000160 2098512 Isolate China Asia
Gene Location Start: 53849;  End: 54934  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000000160_00806.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH170 3 358 3.5e-98 0.9885714285714285

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
COG3589 COG3589 1.73e-104 1 359 2 356
Uncharacterized protein [Function unknown].
pfam19200 DUF871_N 5.14e-102 4 239 2 235
DUF871 N-terminal domain. This family consists of several conserved hypothetical proteins from bacteria and archaea. The function of this family is unknown.
pfam05913 DUF871 1.04e-34 252 357 8 113
Bacterial protein of unknown function (DUF871). This family consists of several conserved hypothetical proteins from bacteria and archaea. The function of this family is unknown.
cd00551 AmyAc_family 0.005 57 97 85 117
Alpha amylase catalytic domain family. The Alpha-amylase family comprises the largest family of glycoside hydrolases (GH), with the majority of enzymes acting on starch, glycogen, and related oligo- and polysaccharides. These proteins catalyze the transformation of alpha-1,4 and alpha-1,6 glucosidic linkages with retention of the anomeric center. The protein is described as having 3 domains: A, B, C. A is a (beta/alpha) 8-barrel; B is a loop between the beta 3 strand and alpha 3 helix of A; and C is the C-terminal extension characterized by a Greek key. The majority of the enzymes have an active site cleft found between domains A and B where a triad of catalytic residues (Asp, Glu and Asp) performs catalysis. Other members of this family have lost this catalytic activity as in the case of the human 4F2hc, or only have 2 residues that serve as the catalytic nucleophile and the acid/base, such as Thermus A4 beta-galactosidase with 2 Glu residues (GH42) and human alpha-galactosidase with 2 Asp residues (GH31). The family members are quite extensive and include: alpha amylase, maltosyltransferase, cyclodextrin glycotransferase, maltogenic amylase, neopullulanase, isoamylase, 1,4-alpha-D-glucan maltotetrahydrolase, 4-alpha-glucotransferase, oligo-1,6-glucosidase, amylosucrase, sucrose phosphorylase, and amylomaltase.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
BCT45938.1 1.03e-124 4 360 5 360
QNM11117.1 1.46e-124 4 360 5 360
QIX10458.1 9.61e-123 4 360 5 360
ASU20682.1 1.93e-122 4 360 5 360
ANU70831.1 1.93e-122 4 360 5 360

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
1X7F_A 3.48e-112 2 357 28 381
Crystalstructure of an uncharacterized B. cereus protein [Bacillus cereus ATCC 14579]
2P0O_A 6.65e-29 3 359 5 357
Crystalstructure of a conserved protein from locus EF_2437 in Enterococcus faecalis with an unknown function [Enterococcus faecalis V583]

Swiss-Prot Hits      help

has no Swissprot hit.

SignalP and Lipop Annotations help

This protein is predicted as OTHER

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
1.000035 0.000000 0.000000 0.000000 0.000000 0.000000

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000000160_00806.