RiT: Vanilla Diffusion Transformers Suffice in Representation Space Paper • 2605.21981 • Published 4 days ago • 9
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published 12 days ago • 264
Leveraging Verifier-Based Reinforcement Learning in Image Editing Paper • 2604.27505 • Published 25 days ago • 57
GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents Paper • 2604.07429 • Published Apr 8 • 121
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 503
GBQA: A Game Benchmark for Evaluating LLMs as Quality Assurance Engineers Paper • 2604.02648 • Published Apr 3 • 47