CAZyme Information

Basic Information
SpeciesLinum usitatissimum
Cazyme IDLus10007212
FamilyGH17
Protein PropertiesLength: 519 Molecular Weight: 57087.2 Isoelectric Point: 6.0923
ChromosomeChromosome/Scaffold: 674 Start: 194886 End: 199754
DescriptionCysteine proteinases superfamily protein
View CDS
External Links
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH17141263.6e-29
  LTGKLVSLYDTLQVSTAIRLDLLGTSYTPSAGEIADSASAFIVPIVQYLANDNTLLLANVYPYFAYIGNPRQVDLKYSNFGFLGQIVVQDGPYGYSNLFH
  AILDSLYATLEKF
Full Sequence
Protein Sequence     Length: 519     Download
MIHGIILMLF SLNLTGKLVS LYDTLQVSTA IRLDLLGTSY TPSAGEIADS ASAFIVPIVQ    60
YLANDNTLLL ANVYPYFAYI GNPRQVDLKY SNFGFLGQIV VQDGPYGYSN LFHAILDSLY    120
ATLEKFAALN VQSQIPPRII KLHPLPPTLP PKLHYPSSSS STMAPKLCLA ALFLLVSLFH    180
FQVSATEIKL NLGSRILQES IVDVVNGNPS AGWKAEISPR FSNYTVAEFK YILGAKPTPK    240
KELLGVPVMR HPKTLALPKE FDARKAWPQC STLTRILDQG HCGSCWAFGA VEALSDRFCI    300
QFGMNISLSA NDLLACCGFL CGEGCDGGYP ISAWRYFVQN GVVTEECDPY FDDIGCSHPG    360
CEPAFPTPKC SRKCVEKNQL WSEEKHYGVN AYRVSSSDVD NIMAEVYKNG PVEVAFTVYE    420
DFAHYKSGVY KHITGGEMGG HAVKLIGWGT TEDGEDYWLL ANQWNRSWGE NGYFRIRRGT    480
NECGIEEDVV AGMPSSRNIV KEVISNDATD KREAIAAA* 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
cd02698Peptidase_C1A_CathepsinX4.0e-40257495253+
cd02621Peptidase_C1A_CathepsinC2.0e-49257494258+
cd02248Peptidase_C1A2.0e-60258490235+
pfam00112Peptidase_C13.0e-93257493239+
cd02620Peptidase_C1A_CathepsinB3.0e-130258492242+
Gene Ontology
GO TermDescription
GO:0004197cysteine-type endopeptidase activity
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0005975carbohydrate metabolic process
GO:0006508proteolysis
GO:0008234cysteine-type peptidase activity
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
GenBankACJ84734.101635073352unknown [Medicago truncatula]
RefSeqNP_563648.1016750716355cathepsin B-like cysteine protease, putative [Arabidopsis thaliana]
RefSeqXP_002301457.101635141357predicted protein [Populus trichocarpa]
RefSeqXP_002320244.101825146339predicted protein [Populus trichocarpa]
RefSeqXP_002515139.1017951021372cathepsin B, putative [Ricinus communis]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB3pbh_A019749610316A Chain A, Crystal Structure At 1.45- Resolution Of The Major Allergen Endo-Beta-1,3-Glucanase Of Banana As A Molecular Basis For The Latex-Fruit Syndrome
PDB2pbh_A019749610316A Chain A, Crystal Structure At 1.45- Resolution Of The Major Allergen Endo-Beta-1,3-Glucanase Of Banana As A Molecular Basis For The Latex-Fruit Syndrome
PDB1pbh_A019749610316A Chain A, Crystal Structure Of Human Recombinant Procathepsin B At 3.2 Angstrom Resolution
PDB3ai8_A02554961255B Chain B, Cathepsin B In Complex With The Nitroxoline
PDB3ai8_B02554961255B Chain B, Cathepsin B In Complex With The Nitroxoline
Signal Peptide
Cleavage Site
20
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
GW8659542831994810
GW8650543031824840
GW8677482741824550
GW8648582981574540
GW8672992901574460
Orthologous Group
SpeciesID
Fragaria vescamrna20137.1-v1.0-hybrid
Linum usitatissimumLus10007208
Vitis viniferaGSVIVT01031544001.129.203GSVIVT01031544001.129.203
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny