Skip to content

Releases: nicodv/kmodes

0.12.2

06 Sep 19:52
Compare
Choose a tag to compare

What's changed

  • Improve estimation of gamma for k-prototypes (#186)

Full Changelog: 0.12.1...0.12.2

0.12.1

14 Apr 21:25
Compare
Choose a tag to compare

What's changed

  • Fix for broken fit_predict on KPrototypes (#176)
  • Improved validation of sample weights (#176)

Full Changelog: 0.12.0...0.12.1

0.12.0

31 Mar 00:58
Compare
Choose a tag to compare

What's changed

  • Support for sample weights for both k-modes and k-prototypes algorithms, courtesy of @kklein (#174, #171)
  • Add official support for Python 3.10 (#170)
  • Bugfix for algorithm convergence (370d64b)
  • Switch internally to pytest from nose (#170)
  • Some small fixes and cleanups

Full Changelog: 0.11.1...0.12.0

0.11.1

08 Oct 04:41
Compare
Choose a tag to compare

What's Changed

  • 155: Make _labels_cost function public by @nicodv in #156
  • Iterations were running for 1 more than expected by @nicodv in #160
  • Change feature array initialization dtype to uint32 by @rggelles in #166. This reduces memory footprint significantly.
  • Drop support for missing values, following scikit-learn: a20f6ed

Full Changelog: 0.11.0...0.11.1

0.11.0

13 Feb 03:34
Compare
Choose a tag to compare
  • Python 3.9 support
  • Minimum sklearn version upgrade to 0.22
  • Default init method for k-prototypes is now the Cao method (same as k-modes and in line with documentation), courtesy of @larroy
  • Optimizations

0.10.2

25 Feb 21:38
Compare
Choose a tag to compare
  • Added Jaccard dissimilarity function, courtesy of @BikashPandey17 (#129 )
  • Return the costs per epoch after training, courtesy of @daffidwilde (#79 )
  • Python 3.8 now supported
  • Python 3.4 no longer supported because sklearn dropped it too
  • Various bugfixes and improvements

0.10.1

25 Feb 20:58
Compare
Choose a tag to compare
  • Improved pandas compatibility, courtesy of @Genie-Liu
  • Forward compatible with future scikit-learn versions that will no longer include joblib, courtesy of @trevorstephens

0.10.0

25 Feb 20:59
Compare
Choose a tag to compare
  • Support for more than 256 clusters
  • Optional parallel execution of the multiple initialization runs (courtesy of @rphes )
  • Enhanced error checking when using pandas DataFrames as inputs to the algorithms
  • Various bug fixes and improvements
  • Semantic versioning from now on

0.9

25 Feb 21:04
Compare
Choose a tag to compare
0.9
  • Bugfixes

0.8

25 Feb 21:00
Compare
Choose a tag to compare
0.8
  • Huge speedup for k-prototypes, especially for large numbers of samples (#45). A k-prototypes benchmark script is included in examples now.
  • Offer an implementation of Ng's dissimilarity measure, which could improve convergence (#37).
  • Allow pandas DataFrames to be presented to the algorithm, instead of just numpy arrays (#40).
  • Improved handling of dependencies (#49, #53).
  • Various small bugfixes and improvements.