Universal dictionary with information about the world #80

msklvsk · 2018-10-02T16:44:05Z

If there is (for example) a valency dictionary, one can tag each verb in the gold standard with valency, train the parser using that additional annotation, and then provide the dictionary at the inference stage so that the parser can take better, more informed decisions — like UDPipe already does with a morphological dictionary. I wonder if putting everything into the FEATS column isn’t suboptimal. Should there be a dedicated way to aid the parser with additional non-morphological annotation or using FEATS should suffice? What if one does not have a morpho dict but has a valency dict?

The text was updated successfully, but these errors were encountered:

foxik · 2018-10-03T08:27:43Z

That is interesting idea. Currently UDPipe can utilize only some columns in the CoNLL-U file, so using FEATS is now probably the only possibility. But as you say, it is suboptimal, expecially since we consider FEATS as a whole instead of being able to look at individual features.

So either we could implement utilizing individual features from FEATS (which we should anyway), or support explicit "external" knowledge (i.e., a mapping from FORM (or maybe any other column) to a value, which is passed to the tagger/parser/...)).

I will be improving support for morphological dictionary in several months (because currently it needs to be specified during training and is embedded in the model; we want to be able to utilize any given dictionary during inference, and I wanted to add support for providing only some of the columns). Maybe during the rewrite I could generalize the dictionary to provide also "additional" columns (like valency), which would be passed to tagger/lemmatizer/parser. I will think about it, and I am leaving this open as a remainder.

msklvsk · 2018-10-03T15:40:19Z

A fun example

You can provide a dictionary of average lengths of objects. The parser will deep-learn that bigger objects rarely are in smaller ones, which should help to disambiguate e.g. classical Alice drove down the street in her car.

foxik modified the milestones: UDPipe 1.2, UDPipe 3.0 Oct 3, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Universal dictionary with information about the world #80

Universal dictionary with information about the world #80

msklvsk commented Oct 2, 2018 •

edited

Loading

foxik commented Oct 3, 2018

msklvsk commented Oct 3, 2018 •

edited

Loading

Universal dictionary with information about the world #80

Universal dictionary with information about the world #80

Comments

msklvsk commented Oct 2, 2018 • edited Loading

foxik commented Oct 3, 2018

msklvsk commented Oct 3, 2018 • edited Loading

A fun example

msklvsk commented Oct 2, 2018 •

edited

Loading

msklvsk commented Oct 3, 2018 •

edited

Loading