glem

GLEM is a lemmatizer for Ancient Greek.

It has been created in the project Unraveling the Language of Perspective, which is supported by the EU under FP7, ERC Starting Grant 338421-Perspective.

The paper 'A memory-based lemmatizer for Ancient Greek' reports on how it works, what material it uses, and what the accuracy is. It can be found in the repository and at http://dl.acm.org/citation.cfm?id=3078100.

A webservice where you can upload texts that you want to have lemmatized can be found at https://webservices.cls.ru.nl/. Or you can host your own.

Dependencies

Just Python 3 for the simple word list based lemmatizer.

To add machine learning based lemmatization that also takes into account the context, glem uses Frog via its python binding.

Installation

Run: pip install .

We recommend using a Python virtual environment of your own. Alternatively for a global installation, prepend sudo.

Example usage

Glem comes with a pretrained model, based on lemmas chosen by humans (in the UiO PROIEL project, PI: Dag Haug), for Herodotus. You can use it (with or without Frog) as follows:

glem -f input.txt

The files for this model can be found in glem/pretrained_models/herodotus .

Webservice

A Dockerfile is provided for deployment of the GLEM webservice in production environments.

From the repository root, build as follows:

$ docker build -t webglem .

Consult the Dockerfile for various build-time parameters that you may want to set for your own production environment.

When running, mount the path where you want the user data stored into the container, a directory webglem-userdata will be created here:

$ docker run -p 8080:80 -v /path/to/data/dir:/data webglem

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
doc		doc
glem		glem
webglem		webglem
.gitignore		.gitignore
Dockerfile		Dockerfile
GLEM_paper.pdf		GLEM_paper.pdf
README.md		README.md
WEBSERVICE_INSTRUCTIONS.rst		WEBSERVICE_INSTRUCTIONS.rst
codemeta.json		codemeta.json
setup.py		setup.py
startserver_development.sh		startserver_development.sh
startserver_production.sh		startserver_production.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

glem

Dependencies

Installation

Example usage

Webservice

About

Releases 6

Packages

Contributors 5

Languages

GreekPerspective/glem

Folders and files

Latest commit

History

Repository files navigation

glem

Dependencies

Installation

Example usage

Webservice

About

Resources

Stars

Watchers

Forks

Releases 6

Packages 0

Contributors 5

Languages

Packages