Training Lab
Use this after an eval has already identified a real failure mode worth fixing. VERL plus execution_mode=real is the launch path; deterministic or simulated runs are useful for local lab work, but they are not launch-proof evidence.
Start a training experiment
No training jobs yet.
Begin with an eval run, then use the form above to create a training experiment if the evidence supports it.
LoRA Adapter Management
Loading adapters...