G
genxcode
Templates
Docs
Contact
Install
← Templates
RL — HuggingFace
RLHF with HuggingFace TRL
+
Project Name
+
RL Method
+
Model & Dataset
+
LoRA Config
+
Method Params
+
Sequence Config
+
Training
→
Generate
Progress
0%
Step 1 of 8
Project Name
Project Name
DPO, GRPO, PPO, ORPO, or KTO with TRL trainers.
Next