Distillation — PyTorch
Step 1 of 6
Distill GPT-4/Claude/Gemini into a GPT-2 student built from scratch.