PUL ID

PUL0602

PubMed

26442136, Stand Genomic Sci. 2015 Oct 5;10:73. doi: 10.1186/s40793-015-0031-z. eCollection 2015.

Characterization method

sequence homology analysis

Genomic accession number

CP002835.1

Nucelotide position range

2158671-2196700

Substrate

xylan

Loci

Geoth_2242-Geoth_2272

Species

Parageobacillus thermoglucosidasius/1426

Degradation or Biosynthesis

degradation

Gene Name

Locus Tag

Protein ID

Gene Position

GenBank Contig Range

EC Number

- Geoth_2242 AEH48175.1 0 - 1500 (-) CP002835.1:2158671-2160171 -
- Geoth_2243 AEH48176.1 1518 - 2844 (-) CP002835.1:2160189-2161515 5.3.1.5
- Geoth_2246 AEH48177.1 5616 - 6279 (-) CP002835.1:2164287-2164950 -
- Geoth_2247 AEH48178.1 6321 - 7116 (-) CP002835.1:2164992-2165787 -
- Geoth_2250 AEH48179.1 9318 - 10557 (+) CP002835.1:2167989-2169228 3.2.1.8
- Geoth_2251 AEH48180.1 10627 - 11272 (-) CP002835.1:2169298-2169943 -
- Geoth_2252 AEH48181.1 11492 - 12338 (-) CP002835.1:2170163-2171009 1.1.1.100
- Geoth_2253 AEH48182.1 12313 - 13429 (-) CP002835.1:2170984-2172100 4.2.1.8
- Geoth_2254 AEH48183.1 13447 - 14851 (-) CP002835.1:2172118-2173522 5.3.1.12
- Geoth_2255 AEH48184.1 14883 - 15582 (-) CP002835.1:2173554-2174253 -
- Geoth_2256 AEH48185.1 15714 - 16365 (-) CP002835.1:2174385-2175036 -
- Geoth_2257 AEH48186.1 16384 - 17338 (-) CP002835.1:2175055-2176009 2.7.1.45
- Geoth_2258 AEH48187.1 17401 - 18913 (-) CP002835.1:2176072-2177584 3.2.1.37
- Geoth_2259 AEH48188.1 18928 - 20980 (-) CP002835.1:2177599-2179651 3.2.1.139
- Geoth_2260 AEH48189.1 20995 - 21886 (-) CP002835.1:2179666-2180557 -
- Geoth_2261 AEH48190.1 21899 - 22847 (-) CP002835.1:2180570-2181518 -
- Geoth_2262 AEH48191.1 22964 - 24611 (-) CP002835.1:2181635-2183282 -
- Geoth_2264 AEH48192.1 26774 - 27770 (-) CP002835.1:2185445-2186441 3.2.1.8
- Geoth_2265 AEH48193.1 27772 - 29890 (-) CP002835.1:2186443-2188561 3.2.1.37
- Geoth_2266 AEH48194.1 30054 - 30675 (-) CP002835.1:2188725-2189346 -
- Geoth_2267 AEH48195.1 30690 - 31740 (-) CP002835.1:2189361-2190411 5.1.3.3
- Geoth_2268 AEH48196.1 32086 - 32953 (-) CP002835.1:2190757-2191624 -
- Geoth_2269 AEH48197.1 32970 - 33837 (-) CP002835.1:2191641-2192508 -
- Geoth_2270 AEH48198.1 34001 - 35324 (-) CP002835.1:2192672-2193995 -
- Geoth_2271 AEH48199.1 35436 - 36225 (-) CP002835.1:2194107-2194896 -
- Geoth_2272 AEH48200.1 36224 - 38030 (-) CP002835.1:2194895-2196701 -

Cluster number

1

Gene name

Gene position

Gene type

Found by CGCFinder?

- 1 - 1500 (-) CDS No
- 1519 - 2844 (-) CDS No
- 5617 - 6279 (-) CDS No
- 6322 - 7116 (-) CDS No
- 9319 - 10557 (+) CAZyme: CBM22|GH10 Yes
- 10628 - 11272 (-) TC: gnl|TC-DB|G4PA66|9.B.28.1.5 Yes
- 11493 - 12338 (-) other Yes
- 12314 - 13429 (-) other Yes
- 13448 - 14851 (-) other Yes
- 14884 - 15582 (-) TF: DBD-Pfam|GntR,DBD-SUPERFAMILY|0039384 Yes
- 15715 - 16365 (-) other Yes
- 16385 - 17338 (-) STP: STP|PfkB Yes
- 17402 - 18913 (-) CAZyme: GH39 Yes
- 18929 - 20980 (-) CAZyme: GH67 Yes
- 20996 - 21886 (-) TC: gnl|TC-DB|Q09LY6|3.A.1.1.9 Yes
- 21900 - 22847 (-) TC: gnl|TC-DB|Q09LY7|3.A.1.1.9 Yes
- 22965 - 24611 (-) TC: gnl|TC-DB|C9RT46|3.A.1.1.9 Yes
- 26775 - 27770 (-) CAZyme: GH10 Yes
- 27773 - 29890 (-) CAZyme: GH52 Yes
- 30055 - 30675 (-) CAZyme: CE4 Yes
- 30691 - 31740 (-) other Yes
- 32087 - 32953 (-) TC: gnl|TC-DB|G4FGN6|3.A.1.1.41 Yes
- 32971 - 33837 (-) TC: gnl|TC-DB|Q8RJU9|3.A.1.1.18 Yes
- 34002 - 35324 (-) STP: STP|SBP_bac_1 Yes
- 35437 - 36225 (-) TF: DBD-Pfam|HTH_AraC,DBD-Pfam|HTH_AraC,DBD-SUPERFAMILY|0036286,DBD-SUPERFAMILY|0035607 Yes
- 36225 - 38030 (-) TC: gnl|TC-DB|F4LXP4|8.A.59.2.1 Yes

PUL ID

PUL0602

PubMed

26442136, Stand Genomic Sci. 2015 Oct 5;10:73. doi: 10.1186/s40793-015-0031-z. eCollection 2015.

Title

Complete genome sequence of Geobacillus thermoglucosidasius C56-YS93, a novel biomass degrader isolated from obsidian hot spring in Yellowstone National Park.

Author

Brumm PJ, Land ML, Mead DA

Abstract

Geobacillus thermoglucosidasius C56-YS93 was one of several thermophilic organisms isolated from Obsidian Hot Spring, Yellowstone National Park, Montana, USA under permit from the National Park Service. Comparison of 16 S rRNA sequences confirmed the classification of the strain as a G. thermoglucosidasius species. The genome was sequenced, assembled, and annotated by the DOE Joint Genome Institute and deposited at the NCBI in December 2011 (CP002835). The genome of G. thermoglucosidasius C56-YS93 consists of one circular chromosome of 3,893,306 bp and two circular plasmids of 80,849 and 19,638 bp and an average G + C content of 43.93 %. G. thermoglucosidasius C56-YS93 possesses a xylan degradation cluster not found in the other G. thermoglucosidasius sequenced strains. This cluster appears to be related to the xylan degradation cluster found in G. stearothermophilus. G. thermoglucosidasius C56-YS93 possesses two plasmids not found in the other two strains. One plasmid contains a novel gene cluster coding for proteins involved in proline degradation and metabolism, the other contains a collection of mostly hypothetical proteins.