G
genxcode
Templates
Docs
Contact
Install
← Templates
RL — Unsloth
RLHF with Unsloth
+
Project Name
+
RL Method
+
Model & Dataset
+
LoRA Config
+
Method Params
+
Sequence Config
+
Training
→
Generate
Progress
0%
Step 1 of 8
Project Name
Project Name
DPO, GRPO, or ORPO at 4x speed with Unsloth.
Next