CAZyme Information

Basic Information
SpeciesGossypium raimondii
Cazyme IDGorai.010G216500.1
FamilyGH2
Protein PropertiesLength: 1115 Molecular Weight: 125726 Isoelectric Point: 5.8456
ChromosomeChromosome/Scaffold: 10 Start: 58450609 End: 58461810
Descriptionglycoside hydrolase family 2 protein
View CDS
External Links
NCBI Taxonomy
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH2889800
  VKSLSGYWKFLLASNPTAVPKNFYESSFQDSDWETLPVPSNWQMHGYDRPIYTNVVYPFPLDPPHVPTDNPTGCYRTYFHIPKEWKGRRILLHFEAVDSA
  FCAWVNGVPIGYSQDSRLPAEFEITDYCYSCDSDKKNVLSVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFIADYFFKSNLADNFSYADIQLEVK
  IDCSRETPKDIVLTDFIIEAALYDAGSWYNCDGNVDLLSSNVANIELNRFPTQTLGFHGYMLEGKLENPKLWSAEHPNLYTLVIILKDASGKIVDCESCL
  VGIRQVSKAPKQLLVNGHPVVIRGVNRHEHHPRLGKTNIEACMVKDLVVMKQNNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHLKHP
  TQEPSWAAAMMDRVIGMVERDKNHACIFSWSLGNEAGYGPNHSASAGWIRGRDPSRVVHYEGGGSRTPSTDIVCPMYMRVWDVVKIAKDPNESRPLILCE
  YSHAMGNSCGNIHEYWEAIDNIFGLQGGFIWDWVDQALLKDNGNGSKYWAYGGDFGDSPNDLNFCLNGITWPDRTPHPTLHEVKYVYQPIKVYLRESTVK
  IKNTNFYETTEGLVFEWAVLGDGCELGCGILSLPVIEPQSSYDIEWKSGPWYPLGASSDAEEIFLTITTKLLHSKRWVEVGHVVSSTQVQLPSKRDIVPH
  IIKTKDDVLSTEILGDNIIISQSKLWEITFNTKTGSLDSWKVEGVPIMKNGLFPCFWRAPTDNDKGGGPSSYQTKWKAACIDEIVFLTESCSIQNKTDNV
  VKIAVVYLGFIKGEDGTLDESKKASALFKVDMLYTIHASGDIVIESNVKPSSGLPPLPRVGVEFHLEKSVDQVKWYGRGPFECYPDRKAAAHV
Full Sequence
Protein Sequence     Length: 1115     Download
MASLIVSQLG FPSENGYKVW EDQSFIKWRK RDPHVTLHCH ESVEGSLKYW YERNKVDLSV    60
SKSAVWNDDA VQSALESAAF WVKGLPFVKS LSGYWKFLLA SNPTAVPKNF YESSFQDSDW    120
ETLPVPSNWQ MHGYDRPIYT NVVYPFPLDP PHVPTDNPTG CYRTYFHIPK EWKGRRILLH    180
FEAVDSAFCA WVNGVPIGYS QDSRLPAEFE ITDYCYSCDS DKKNVLSVQV FRWSDGSYLE    240
DQDHWWLSGI HRDVLLLSKP QVFIADYFFK SNLADNFSYA DIQLEVKIDC SRETPKDIVL    300
TDFIIEAALY DAGSWYNCDG NVDLLSSNVA NIELNRFPTQ TLGFHGYMLE GKLENPKLWS    360
AEHPNLYTLV IILKDASGKI VDCESCLVGI RQVSKAPKQL LVNGHPVVIR GVNRHEHHPR    420
LGKTNIEACM VKDLVVMKQN NINAVRNSHY PQHPRWYELC DLFGMYMIDE ANIETHGFDL    480
SGHLKHPTQE PSWAAAMMDR VIGMVERDKN HACIFSWSLG NEAGYGPNHS ASAGWIRGRD    540
PSRVVHYEGG GSRTPSTDIV CPMYMRVWDV VKIAKDPNES RPLILCEYSH AMGNSCGNIH    600
EYWEAIDNIF GLQGGFIWDW VDQALLKDNG NGSKYWAYGG DFGDSPNDLN FCLNGITWPD    660
RTPHPTLHEV KYVYQPIKVY LRESTVKIKN TNFYETTEGL VFEWAVLGDG CELGCGILSL    720
PVIEPQSSYD IEWKSGPWYP LGASSDAEEI FLTITTKLLH SKRWVEVGHV VSSTQVQLPS    780
KRDIVPHIIK TKDDVLSTEI LGDNIIISQS KLWEITFNTK TGSLDSWKVE GVPIMKNGLF    840
PCFWRAPTDN DKGGGPSSYQ TKWKAACIDE IVFLTESCSI QNKTDNVVKI AVVYLGFIKG    900
EDGTLDESKK ASALFKVDML YTIHASGDIV IESNVKPSSG LPPLPRVGVE FHLEKSVDQV    960
KWYGRGPFEC YPDRKAAAHV GVYEQSIEGM HVPYIVPGES GGRADVRWVT FQNKDGCGIY    1020
ASTYGKSPPM QLNASYFSTA ELDRAVRNEE LIKGDTIEVH LDHKHMGIGG DDSWTPSVHE    1080
NYLVPAVPYS FSIRLCPVTS ATSGQNLYRS QLQN* 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
smart01038Bgal_small_N1.0e-928131095284+
pfam02836Glyco_hydro_2_C1.0e-107397678292+
COG3250LacZ5.0e-16488978896+
PRK09525lacZ08610971051+
PRK10340ebgA08910981014+
Gene Ontology
GO TermDescription
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0004565beta-galactosidase activity
GO:0005975carbohydrate metabolic process
GO:0009341beta-galactosidase complex
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
RefSeqXP_002266400.101111411114PREDICTED: hypothetical protein isoform 1 [Vitis vinifera]
RefSeqXP_002266434.101111411115PREDICTED: hypothetical protein isoform 2 [Vitis vinifera]
RefSeqXP_002299206.101111411110predicted protein [Populus trichocarpa]
RefSeqXP_002303929.101111411113predicted protein [Populus trichocarpa]
RefSeqXP_002513059.101111411110beta-galactosidase, putative [Ricinus communis]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB1f4h_D0881096491019A Chain A, Crystal Structure Of Puromycin Hydrolase S511a Mutant
PDB1f4h_C0881096491019A Chain A, Crystal Structure Of Puromycin Hydrolase S511a Mutant
PDB1f4h_B0881096491019A Chain A, Crystal Structure Of Puromycin Hydrolase S511a Mutant
PDB1f4h_A0881096491019A Chain A, Crystal Structure Of Puromycin Hydrolase S511a Mutant
PDB1f4a_D0881096491019A Chain A, E. Coli (Lacz) Beta-Galactosidase (Ncs Constrained Monomer- Orthorhombic)
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
ES7944864064528520
DW51782430176210620
CO1221112946519440
HO8042743851305110
HO804274505025500.00000001
Orthologous Group
SpeciesID
Aquilegia coeruleaAquca_014_00605.1Aquca_014_00605.2
Arabidopsis lyrata485844
Arabidopsis thalianaAT3G54440.1AT3G54440.2AT3G54440.3
Brachypodium distachyonBradi2g61010.1
Brassica rapaBra014816
Carica papayaevm.model.supercontig_540.3
Capsella rubellaCarubv10019396m
Chlamydomonas reinhardtiiCre08.g379450.t1.3
Citrus clementinaCiclev10030068mCiclev10027821m
Citrus sinensisorange1.1g004315morange1.1g004363morange1.1g006933morange1.1g023943morange1.1g023667m
orange1.1g022918morange1.1g007038m
Cucumis sativusCucsa.307940.11Cucsa.307940.1Cucsa.307940.12
Eucalyptus grandisEucgr.J01181.1
Fragaria vescamrna14286.1-v1.0-hybrid
Glycine maxGlyma13g26700.1Glyma15g37670.1
Linum usitatissimumLus10024155Lus10039520Lus10039519
Manihot esculentacassava4.1_002723m
Medicago truncatulaMedtr8g039160.1
Mimulus guttatusmgv1a000801m
Oryza sativaLOC_Os01g72340.1
Panicum virgatumPavirv00010284mPavirv00069653m
Physcomitrella patensPp1s130_43V6.1
Phaseolus vulgarisPhvul.011G206800.1
Picea abiesMA_10429668g0010
Populus trichocarpaPotri.003G196500.1Potri.003G196500.2Potri.001G027400.1
Prunus persicappa000508mppa000532m
Ricinus communis30131.m007196
Setaria italicaSi000118mSi000119m
Selaginella moellendorffii181800
Sorghum bicolorSb03g046050.2Sb03g046050.1
Thellungiella halophilaThhalv10010080m
Vitis viniferaGSVIVT01013489001
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny