Basic Information
The basic information descibes the properties of CAZyme and its genome.
Some of them have the external links to other databases.
Genomic Context
Jbrowse2 uses the GFF3 file to display the genomic location of the gene and its neighboring genes on the chromosome.
Full sequence
We provide the full-length sequence of the protein and a download link.
Enzyme prediction
eCAMI annotates protein sequences with Enzyme Function classes (EC numbers).
CAZyme Signature Domains
These are CAZyme domains annotated by dbCAN.
CDD domain
RPS-BLAST was run with full-length CAZyme protein sequences as query and the NCBI CDD database as the database.
CDD is a protein annotation resource that contains well annotated sequence models.
E-value < 1e-2 was used to keep the CDD domain hits.
CAZyme Hits
We use the DIAMOND program to search against the CAZy annotated CAZyme sequences
PDB Hits
The Protein Data Bank protein sequences was downloaded and searched against with DIAMOND program.
E-value < 1e-5 was used to keep significant hits.
Swiss-Prot Hits
Swiss-Prot database was downloaded. E-value < 1e-5 was used to keep significant hits.
SignalP and LipoP Annotations
Signal peptide and lipoprotein were predicted using SignalP.
TMHMM annotation
Full-length sequences were taken to run TMHMMto predict the transmembrane regions.