Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Release training dataset and loss log to help reproduce results #26

Open
srzhu97 opened this issue Oct 9, 2024 · 1 comment
Open

Release training dataset and loss log to help reproduce results #26

srzhu97 opened this issue Oct 9, 2024 · 1 comment

Comments

@srzhu97
Copy link

srzhu97 commented Oct 9, 2024

I am trying to reproduce the Mistral-7B-SPPO Iter1 model. However, after my first iteration, the model I trained diverged significantly from the published Mistral-7B-SPPO Iter1 model when comparing the results on a benchmark dataset.
To help with diagnosing the issue and improving my training, could you kindly provide the training dataset and the loss log so that I can compare my run to?

I was using this prompt dataset, but it doesn't contains the columns chosen, rejected, chosen_probs, chosen_probs_win, chosen_probs_lose. While running the generate.sh script can produce these columns, it would be incredibly helpful if the actual training dataset could be released. This would make it much easier for me to debug and identify where things might have gone wrong in my training process.

@srzhu97
Copy link
Author

srzhu97 commented Oct 9, 2024

@angelahzyuan. ^ thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant