DPO Fine-tuning #489

jens5588 · 2024-05-07T15:36:14Z

Is it possible to adapt the fine-tuning script for DPO finetuning? The current version seems to only work for next token prediction fine-tuning.

No response

No response

init27 · 2024-08-19T18:19:47Z

Thanks for the feedback! We are working on some examples and will let you know once they are integrated!

mreso added the enhancement New feature or request label May 10, 2024

init27 self-assigned this Aug 19, 2024

Provide feedback