Skip to content

colarusso/text2label

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Text2Label: A Quick and Dirty Client-Side Text "Classifier"

This is not a true text classifier, hence the quotes. It is a JavaScript-only implementation that mostly does the job. I needed a solution that would run on something like GitHub Pages (where you only can do client-side scripting). Anyhow, you start with a two-column CSV file. The first column contains text examples, and the second column contains label for these texts. We then build vectors based on the texts and associate them with their labels. Instead of using this as training data for an honest-to-goodness classifier, we just match the vectors of novel texts with existing examples, looking for the closest match.

You can "train" you classifier and test it out here: https://colarusso.github.io/text2label/

I got the idea for this after seeing word2vecjson, which is where the word2vec json files came from originally. See https://github.com/turbomaze/word2vecjson

About

A Quick and Dirty Client-Side (JavaScript) Text "Classifier"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published