Skip to content

Commit

Permalink
updated README
Browse files Browse the repository at this point in the history
  • Loading branch information
proycon committed Jan 23, 2017
1 parent a672bcf commit db357de
Showing 1 changed file with 16 additions and 2 deletions.
18 changes: 16 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,9 +1,23 @@
# uctodata 0.1 CLST/ILK/CLiPS 1998 - 2016
# uctodata 0.4 CLST/ILK/CLiPS 1998 - 2016
https://github.com/LanguageMachines/uctodata/

Website and documentation: https://languagemachines.github.io/ucto

uctodata provides datafiles for the tokeniser ucto for several languages
uctodata provides datafiles for the tokeniser ucto for several languages. The
language code can be supplied to ucto using the ``-L`` paramater (e.g. ``ucto
-L nld input.txt``):

* ``eng`` - English
* ``nld`` - Dutch
* ``deu`` - German
* ``fra`` - French
* ``ita`` - Italian
* ``spa`` - Spanish
* ``por`` - Portuguese
* ``rus`` - Russian
* ``swe`` - Swedish
* ``tur`` - Turkish
* ``fry`` - Frisian

uctodata is architecture independent.

Expand Down

0 comments on commit db357de

Please sign in to comment.