Discriminative learning rates for FineTuningTask #289

vreis · 2019-12-05T16:42:08Z

🚀 Feature

Right now we only support fine tuning by freezing the trunk weights, or training all weights together. Discriminative learning rates means we can apply different learning rates for different parts of the model, which usually leads to better performance.

Motivation

https://arxiv.org/pdf/1801.06146.pdf introduced discriminative fine-tuning in NLP. Since then it's been found to be useful in computer vision as well.

Pitch

This could be implemented in either FineTuningTask or ClassyModel. I'd rather keep ClassyModel as simple as possible and move this type of logic to the task level.

Alternatives

N/A

Additional context

N/A

vreis added the enhancement New feature or request label Dec 5, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Discriminative learning rates for FineTuningTask #289

Discriminative learning rates for FineTuningTask #289

vreis commented Dec 5, 2019

Discriminative learning rates for FineTuningTask #289

Discriminative learning rates for FineTuningTask #289

Comments

vreis commented Dec 5, 2019

🚀 Feature

Motivation

Pitch

Alternatives

Additional context