Skip to content

Releases: CogStack/tika-service

v0.5.3

14 Jan 18:31
Compare
Choose a tag to compare

Changes:

  • Tika updated to 2.6.0
  • New settings in application.yaml please read the README.md carefully, these settings affect performance under certain circumstances.

Full Changelog: v0.5.2...v0.5.3

v0.5.2

12 Oct 09:50
Compare
Choose a tag to compare

Minor release:

  • fixed issue where some documents did not get buffered into the stream properly, resulting in 0 length errors.

Full Changelog: v0.5.1...v0.5.2

v0.5.1

07 Oct 09:09
Compare
Choose a tag to compare

The is a minor release, summary of changes:

  • default docker base image change from openjdk to ubuntu:kinetic after noticing drastic performance improvements (the performance of the image is now closer to that of running the .JAR file directly on one's system).
  • possible bugfix to RenderState errors.

Full Changelog: 0.5.0...v0.5.1

v0.5.0

05 Oct 12:42
Compare
Choose a tag to compare

Updates:

  • Tika updated to 2.5.0
  • Dependencies updated to latest available versions
  • Added extra configurable options for PDF OCR extractions
  • Encoding output issue fixes
  • Docker service Tesseract OCR env updates (dynamic number of threads enabled, do not change unless necessary).

Full Changelog: 0.4.0...0.5.0

v0.4.0

12 Apr 11:21
Compare
Choose a tag to compare

Improvements/updates:

  • Tika updated to version 2.3.0
  • post OCR text encoding output that is configurable and supports custom string encoding
  • added OpenCL support on the docker image for Tesseract

0.3.1

04 Jan 15:03
Compare
Choose a tag to compare

Tika 2.2.1
Various application configuration changes (see README.md) that allows more granular control over how images are processed (DPI, resizing, etc.)
Batch processing improvements.

Full Changelog: https://github.com/CogStack/tika-service/commits/0.3.1

Version 0.2.0

06 Jul 15:34
72e69d3
Compare
Choose a tag to compare
Version 0.2.0 Pre-release
Pre-release

Key changes include:

  • Added threading control for Tomcat serving endpoint
  • Update Tika to 1.24 version
  • Update Tesseract to 4.1.1 version

Initial version 0.1.0

15 Aug 10:15
56727da
Compare
Choose a tag to compare
Initial version 0.1.0 Pre-release
Pre-release

Initial stable version release