Skip to content

Releases: ufal/morphodita

MorphoDiTa 1.11.2

15 Nov 17:25
Compare
Choose a tag to compare

Changes since 1.11.0:

  • Fix a bug in the tokenizers, which did not end a sentence on an empty line if the preceding line ended with whitespace. However, only the REST server and the API was affected; the run_* binaries read input a paragraph at a time, and so circumvent the problem.
  • Fix warnings in Clang 15 by qualifying std::move.
  • Do not assume char is signed; fixes silent errors on aarch64 when compiled from the source (e.g. in Docker on ARM macOS).

MorphoDiTa 1.11.0

01 Sep 14:43
Compare
Choose a tag to compare

Changes since 1.10.1:

  • When analyzing numerals, if the form ends with a dot, keep the dot in the lemma (previously the dot was discarded).
  • Correctly handle lemma number which is not null-terminated in generate.
  • On Windows, the file paths are now UTF-8 encoded, instead of ANSI. This change affects the API, binary arguments, and program outputs.
  • Add ARM64 macOS build.
  • The Windows binaries are now compiled with VS 2019, older systems than Windows 7 are no longer supported.
  • Add billing information to the REST server.
  • Add type information to the Python package.
  • The Python wheels are provided for Pythons 3.6-3.11.

MorphoDiTa 1.10.1

30 Jun 08:12
Compare
Choose a tag to compare

Changes since 1.9.2:

  • Allow overriding path to Java library in the bindings.
  • Document statistical guesser training using train_guesser binary.
  • Improve the GRU tokenizer.
  • Do not generate empty lemmas in guesser.
  • Add explicit GUESSER_UNSPECIFIED guesser mode already used by tagger.
  • Fix unaligned pointer access and other UndefinedBehaviourSanitizer findings.
  • Fix bug in tag_filter incorrectly matching negates sets of 2+ characters.
  • Add several REST server options.
  • Add methods running tagset_converter during derivation_formatter.
    This allows correct handling of their interactions.
  • Add binary wheels of the Python package.

MorphoDiTa 1.9.2

22 Sep 12:36
Compare
Choose a tag to compare

Changes since 1.9.1:

  • Fix regression in morpho::generate caused in 1.9.1.
  • Add check in encode_dictionary that no lemma is repeated.

MorphoDiTa 1.9.1 - contains a regression!

20 Sep 12:37
Compare
Choose a tag to compare

Please use MorphoDiTa 1.9.2 which fixes regression caused by 1.9.1.

Changes since 1.9.0-prerelease:

  • Allow loading larger dictionaries by increasing the internal limit of size of search structures created during loading.
  • Export string_piece using correct namespace in src_lib_only/morphodita.h.

MorphoDiTa 1.9.0-prerelease

15 May 14:49
Compare
Choose a tag to compare
Pre-release

Changes since 1.3.0:

MorphoDiTa 1.3.0

30 Aug 12:00
Compare
Choose a tag to compare

Changes since 1.2.0:

  • Add support for OS X 10.7 and later.
  • Use UniLib instead of MorphoDiTa specific utf8 class.
  • Update Unicode in tokenizer from 6.3.0 to 7.0.0.
  • Fix English tokenizer bug #1.
  • Use PyTypeObject in Python bindings instead of proxy classes.

MorphoDiTa 1.2.0

22 Apr 08:01
Compare
Choose a tag to compare

Changes since 1.1.0:

  • Add strip_lemma_comment and strip_lemma_id tag set converters.

MorphoDiTa 1.1.0

09 Apr 09:24
Compare
Choose a tag to compare

Changes since 1.0.0:

  • Allow English negative prefix rules to overlap, allowing to use also "non-" as negative prefix in additon to "non".
  • Allow passing tagger model instead of morphological model to run_morpho_analyze, run_morpho_cli and run_morpho_generate binaries.
  • Add --version option printing current version of binaries.
  • Clarify stable/development versions.

MorphoDiTa 1.0.0

07 Apr 13:38
Compare
Choose a tag to compare

First public release of MorphoDiTa 1.0.0.