ProcVLM: Learning Procedure-Grounded Progress Rewards for Robotic Manipulation Paper • 2605.08774 • Published May 9 • 2
MOPD: Multi-Teacher On-Policy Distillation for Capability Integration in LLM Post-Training Paper • 2606.30406 • Published 4 days ago • 10