Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add absolute confidence metric #235

Open
pemistahl opened this issue Aug 14, 2024 · 0 comments · May be fixed by #241
Open

Add absolute confidence metric #235

pemistahl opened this issue Aug 14, 2024 · 0 comments · May be fixed by #241

Comments

@pemistahl
Copy link
Owner

Currently, the library only provides a relative confidence metric that tells you how likely a language is in comparison to another language. It is desirable to have an additional absolute confidence metric that works with a single language only and independently from any other language. With such an absolute confidence metric, a LanguageDetector instance could be built from a single language. This instance would then be able to provide binary decisions, i.e. tell whether some text is written in a specific language or not.

An absolute confidence metric could be based on unique or the most common n ngrams of a language.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant