Web2BigTable: A Bi-Level Multi-Agent LLM System for Internet-Scale Information Search and Extraction Paper • 2604.27221 • Published 7 days ago • 29
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors Paper • 2605.00658 • Published 5 days ago • 71
Map2World: Segment Map Conditioned Text to 3D World Generation Paper • 2605.00781 • Published 5 days ago • 20
World2Minecraft: Occupancy-Driven Simulated Scenes Construction Paper • 2604.27578 • Published 6 days ago • 3
Synthetic Computers at Scale for Long-Horizon Productivity Simulation Paper • 2604.28181 • Published 6 days ago • 15
Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling Paper • 2604.28185 • Published 6 days ago • 85
Coevolving Representations in Joint Image-Feature Diffusion Paper • 2604.17492 • Published 17 days ago • 5
EditCrafter: Tuning-free High-Resolution Image Editing via Pretrained Diffusion Model Paper • 2604.10268 • Published 25 days ago • 12
StyleID: A Perception-Aware Dataset and Metric for Stylization-Agnostic Facial Identity Recognition Paper • 2604.21689 • Published 13 days ago • 24
WorldMark: A Unified Benchmark Suite for Interactive Video World Models Paper • 2604.21686 • Published 13 days ago • 36
LLaTiSA: Towards Difficulty-Stratified Time Series Reasoning from Visual Perception to Semantics Paper • 2604.17295 • Published 17 days ago • 84
Exploring Spatial Intelligence from a Generative Perspective Paper • 2604.20570 • Published 14 days ago • 21
A Self-Evolving Framework for Efficient Terminal Agents via Observational Context Compression Paper • 2604.19572 • Published 15 days ago • 21
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published 14 days ago • 239
Elucidating the SNR-t Bias of Diffusion Probabilistic Models Paper • 2604.16044 • Published 19 days ago • 74
GlobalSplat: Efficient Feed-Forward 3D Gaussian Splatting via Global Scene Tokens Paper • 2604.15284 • Published 20 days ago • 24
HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds Paper • 2604.14268 • Published 21 days ago • 117
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe Paper • 2604.13016 • Published 22 days ago • 90