We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The aim is for all trainers to apply the same procedure in their init function:
BCOTrainer
CPOTrainer
DPOTrainer
GKDTrainer
SFTTrainer
IterativeSFTTrainer
KTOTrainer
NashMDTrainer
OnlineDPOTrainer
ORPOTrainer
PPOv2Trainer
RewardTrainer
"dataset_text_field"
XPOTrainer
get_formatting_func_from_dataset
collate_fn
dataset_kwargs={"skip_prepare_dataset": True},
docs/dataset_format.mdx
The text was updated successfully, but these errors were encountered:
No branches or pull requests
The aim is for all trainers to apply the same procedure in their init function:
Support todo:
Standard dataset
BCOTrainer
CPOTrainer
DPOTrainer
GKDTrainer
(same asSFTTrainer
)IterativeSFTTrainer
KTOTrainer
NashMDTrainer
OnlineDPOTrainer
ORPOTrainer
PPOv2Trainer
RewardTrainer
[RewardTrainer] Tokenize inputs within trainer #2102SFTTrainer
(via"dataset_text_field"
, needs refactoring)XPOTrainer
Conversational dataset
BCOTrainer
BCOTrainer
conversational dataset support #2107CPOTrainer
DPOTrainer
Conversational dataset support forDPOTrainer
#2131GKDTrainer
IterativeSFTTrainer
KTOTrainer
NashMDTrainer
Conversational dataset support for Online DPO #2075OnlineDPOTrainer
Conversational dataset support for Online DPO #2075ORPOTrainer
PPOv2Trainer
RewardTrainer
[RewardTrainer] Tokenize inputs within trainer #2102SFTTrainer
(yes, viaget_formatting_func_from_dataset
for now, needs refactoring)XPOTrainer
Conversational dataset support for Online DPO #2075Tokenized dataset (to be discussed, do we want this?)
BCOTrainer
CPOTrainer
DPOTrainer
GKDTrainer
IterativeSFTTrainer
KTOTrainer
NashMDTrainer
OnlineDPOTrainer
ORPOTrainer
PPOv2Trainer
RewardTrainer
SFTTrainer
(yes, via customcollate_fn
anddataset_kwargs={"skip_prepare_dataset": True},
, needs refactoring)XPOTrainer
Misc
docs/dataset_format.mdx
The text was updated successfully, but these errors were encountered: