ImageWAM: Do World Action Models Really Need Video Generation, or Just Image Editing? Paper • 2606.19531 • Published 7 days ago • 18
DragMesh-2: Physically Plausible Dexterous Hand-Object Interaction with Articulated Objects Paper • 2606.15133 • Published 11 days ago • 72
Rethinking the Role of Efficient Attention in Hybrid Architectures Paper • 2606.15378 • Published 11 days ago • 17
Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients Paper • 2606.18216 • Published 8 days ago • 60
Who Should Lead Decoding Now? Tracking Reliable Trajectories for Ensembling Masked Diffusion Language Models Paper • 2606.16281 • Published 9 days ago • 33
Direct 3D-Aware Object Insertion via Decomposed Visual Proxies Paper • 2606.06601 • Published 20 days ago • 26
Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning Paper • 2606.04923 • Published 21 days ago • 40
From Activation to Causality: Discovery of Causal Visual Representations in the Human Brain Paper • 2605.23895 • Published May 22 • 52
LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding Paper • 2605.27365 • Published 29 days ago • 144
Running on Zero Agents Featured 62 L2P - Z-Image 6B Pixel-Space 🎨 62 End-to-end pixel-space 6B diffusion via L2P
Enhancing Train-Free Infinite-Frame Generation for Consistent Long Videos Paper • 2605.18233 • Published May 18 • 93
GATES: Self-Distillation under Privileged Context with Consensus Gating Paper • 2602.20574 • Published Feb 24 • 1
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published May 12 • 196