Omni-WorldBench: Towards a Comprehensive Interaction-Centric Evaluation for World Models Paper • 2603.22212 • Published 1 day ago • 114
HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning Paper • 2603.17024 • Published 8 days ago • 100
ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents Paper • 2603.18815 • Published 6 days ago • 10
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation Paper • 2603.19220 • Published 6 days ago • 57
MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification Paper • 2603.15726 • Published 9 days ago • 179
EvoScientist: Towards Multi-Agent Evolving AI Scientists for End-to-End Scientific Discovery Paper • 2603.08127 • Published 16 days ago • 15
IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse Paper • 2603.12201 • Published 13 days ago • 52
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 15 days ago • 78
MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data Paper • 2603.09206 • Published 15 days ago • 52
AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios Paper • 2602.23166 • Published 27 days ago • 44
Heterogeneous Agent Collaborative Reinforcement Learning Paper • 2603.02604 • Published 22 days ago • 188
MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier Paper • 2603.03756 • Published 21 days ago • 89