CAZyme Information

Basic Information
SpeciesMedicago truncatula
Cazyme IDAC148471_30.1
FamilyGH109
Protein PropertiesLength: 1188 Molecular Weight: 129595 Isoelectric Point: 9.4438
ChromosomeChromosome/Scaffold: 14847115 Start: 83648 End: 87477
Description
View CDS
External Links
NCBI Taxonomy
Plaza
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH1095806956.7e-23
  VGVGLIGVGNRGSVLLQNLLQIPGVDVRAVCDVDGGRLERALKTVEAAGKKRPEGSTDGTWKELLAKDGLDAIVSAIPCDLHAHCYLDMIAAGKDLYGEK
  PMCLNRADLDALAKAA
Full Sequence
Protein Sequence     Length: 1188     Download
MRNLLPHRLL HLSLCCAAVG IAAGAEAAGS NAAQAAKKPN ILVIWGDDIG SWNISHNNRG    60
MMGYRTPNID RIAREGVSFT DYYGQQSCTA GRAAFIGGNV PVRTGMTKVG LPGSKEGWQK    120
TDVTMATVLK SLGYATGQFG KNHQGDRDEH LPTMHGFDEF FGNLYHLNAQ EEPENRDYPR    180
DLKLPNGKTF LETYGPRGIL KCKADGRGGQ TIEDTGPLTK KRMETIDDET VAAAKEFITR    240
QKNANQPFFC WWNGTRMHFR THVKESNRGK SGQDEYGDGM VEHDAHVGEL LKLIDDLGLA    300
EDTIVFYSTD NGPHYNSWPD AGTTPFRSEK NSNWEGAYRV PAFVRWTGKS PPARRSTASS    360
RTRTAPRRRR AERPQIPQLH RRPQPARLPG RQGGEVPAQG VRLRQRRRPD RRHAGRGLEG    420
GVPGEPRPGV RGLARAVHRA PRPLAVQPPP RPVREVPAQL EHLQRLVPRP GVPPHPHAAA    480
GREVPHVDEG LPAEPDAGIV QPGEDPEDDR GGRIREVSCR RSSLRRRGRL ADRGAAARAG    540
PAGDSPMTNM PRRSFLGATA GALATSPLLA AGARGANDTV GVGLIGVGNR GSVLLQNLLQ    600
IPGVDVRAVC DVDGGRLERA LKTVEAAGKK RPEGSTDGTW KELLAKDGLD AIVSAIPCDL    660
HAHCYLDMIA AGKDLYGEKP MCLNRADLDA LAKAARESKQ IVQIGHQRRA DPHFIEPIAA    720
VLPGEIGDTL PGPHPVVEVV GAAIRCFLNS AHVCLTMSDP WPSIDDIVAS SGVDRELIRR    780
AADVYIKSDR VIVCWAMGLT QHKNGVANIQ EVVNLLLLRG NLGRPGAGAC PVRGHSNVQG    840
DRTMGIWERP KPEFLDRLGE AFGFEPPREH GHDVVAAIKA MHDGEAQVFF GMGGNFQMAS    900
PDTAYTSEAL RRCRLTAHVA TKLNRSHLTT GRQALILPCL GRTELDVQAT GEQFVTVEDS    960
MSAVHASHGG LRPASEHLLS EPAIVARLAR AVLGPESKVD WEGLAADYDR IRDRIAEVVP    1020
GFHDFNERVR RPGGFTLVNA AAERRFRTST GKARFMVHAI PPSGPPPGRL LLMTIRSHDQ    1080
YNTTIYGLDD RYRGVYHERR VVFMNPKDVA SLGLVDRQVV DLIGEFRGET RVAHRFIVVS    1140
HDIPVGCAAA YYPETNVLVA VDDYADGSRT PASKSIVVRV VPTRKER* 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
COG0243BisC2.0e-547631182475+
COG3119AslA8.0e-7337349337+
PRK09939PRK099397.0e-1247571183433+
cd02767MopB_ydeP1.0e-1367601055297+
TIGR01701Fdhalpha-like3.0e-1607651183424+
Gene Ontology
GO TermDescription
GO:0008152metabolic process
GO:0008484sulfuric ester hydrolase activity
GO:0016491oxidoreductase activity
GO:0030151molybdenum ion binding
GO:0055114oxidation-reduction process
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
RefSeqYP_003165484.103435129348sulfatase [Candidatus Accumulibacter phosphatis clade IIA str. UW-1]
RefSeqZP_01090276.1053543344Aryl-sulfate sulphohydrolase [Blastopirellula marina DSM 3645]
RefSeqZP_01852926.10103546346Aryl-sulfate sulphohydrolase [Planctomyces maris DSM 8797]
RefSeqZP_02927743.10303512315sulfatase [Verrucomicrobium spinosum DSM 4136]
RefSeqZP_04429778.103635429346arylsulfatase A family protein [Planctomyces limnophilus DSM 3776]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB1auk_A2e-33373661320A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes Takasagoensis, At Ph 2.5.
PDB1e2s_P2e-33373661320P Chain P, Crystal Structure Of An Arylsulfatase A Mutant C69a
PDB1e33_P3e-33373661320P Chain P, Crystal Structure Of An Arylsulfatase A Mutant P426l
PDB1n2l_A3e-33373661320P Chain P, Crystal Structure Of An Arylsulfatase A Mutant P426l
PDB1n2k_A3e-33373661320A Chain A, Crystal Structure Of A Covalent Intermediate Of Endogenous Human Arylsulfatase A
Signal Peptide
Cleavage Site
27
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
BQ15547023087511010
CV25242121989211070
CV25244421789211050
CV25247021789211050
CV25274420889210960