OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory Paper β’ 2512.07802 β’ Published 3 days ago β’ 36
Mixture of States: Routing Token-Level Dynamics for Multimodal Generation Paper β’ 2511.12207 β’ Published 26 days ago β’ 8
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models Paper β’ 2512.02014 β’ Published 10 days ago β’ 61
Mixture of States: Routing Token-Level Dynamics for Multimodal Generation Paper β’ 2511.12207 β’ Published 26 days ago β’ 8
Scaling Zero-Shot Reference-to-Video Generation Paper β’ 2512.06905 β’ Published 4 days ago β’ 28 β’ 4
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models Paper β’ 2512.02014 β’ Published 10 days ago β’ 61
Running on Zero MCP Featured 1.57k Qwen Image Edit Camera Control π¬ 1.57k Fast 4 step inference with Qwen Image Edit 2509
One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models Paper β’ 2511.10629 β’ Published 28 days ago β’ 122
Running on Zero MCP Featured 2.38k Wan2.2 14B Fast π₯ 2.38k generate a video from an image with a text prompt