running

  • Need to figure out the OOM issues. ✅ 2026-03-09
    • Something is leaking the memory in-between steps.
      • Trying to fix this with garbage collection on prime.py and ray_trainer.py. Currently running on grpo-train-yuki.
  • Look into evaluation harness ✅ 2026-03-15