You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Purpose of implementation request:
To try something very challenging at the interface of X-bar theory and Finite
State Machines.
When implementing the request, please focus on these
steps/functions/components:
* design a module (it will be rather complicated) to break unrecognized words into morphemes, and ask the user if that is why the word wasn't recognized (ex: the word tokenization is not in most English wordlists, but it is an English word. In this module you would present token-ize-ation to the user and they could approve the word and add it to the generative lexicon) (3 hours)
* get feedback on your design (2 hours)
* try to implement your design (10 hours)
* try to find existing projects which implement parts of your design (4 hours) [NRC's inuktitutcomputing.ca, hunspell, SIL's ToolBox, Goldsmith's Linguistica, Creutz' Morphessor]
* try to compile/run those existing projects (15 hours)
* pair program your module using whatever you have learned (10 hours)
When implementing the request, watch out for the following potential
security/lack of access/lack of data/formatting etc hiccups:
* this will be a near impossible task (it requires expertise in NLP, Machine Learning and Software Engineering)
Expected next steps see Iteration 4 - Re-factor word recognition to use the API
for the NRC's morphological analyser
Original issue reported on code.google.com by [email protected] on 25 Nov 2011 at 7:10
Original issue reported on code.google.com by
[email protected]
on 25 Nov 2011 at 7:10The text was updated successfully, but these errors were encountered: