Allen Zhang's picture

9 10 8

Allen Zhang

allencbzhang

·

AI & ML interests

object detection, regularization, vision and language

Recent Activity

liked a dataset 13 days ago

HuggingFaceFV/finevideo

liked a dataset 13 days ago

HuanjinYao/Mulberry-SFT

upvoted a paper about 1 month ago

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 223

upvoted a paper 4 months ago

Mr. DETR: Instructive Multi-Route Training for Detection Transformers

Paper • 2412.10028 • Published Dec 13, 2024 • 1

upvoted 3 papers 7 months ago

VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning

Paper • 2506.09049 • Published Jun 10, 2025 • 37

CreatiPoster: Towards Editable and Controllable Multi-Layer Graphic Design Generation

Paper • 2506.10890 • Published Jun 12, 2025 • 9

What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?

Paper • 2505.22129 • Published May 28, 2025 • 15

upvoted a paper 8 months ago

A Survey of Interactive Generative Video

Paper • 2504.21853 • Published Apr 30, 2025 • 46

upvoted a paper 9 months ago

v-CLR: View-Consistent Learning for Open-World Instance Segmentation

Paper • 2504.01383 • Published Apr 2, 2025 • 1

upvoted 3 papers 10 months ago

Long-Context Autoregressive Video Modeling with Next-Frame Prediction

Paper • 2503.19325 • Published Mar 25, 2025 • 73

Position: Interactive Generative Video as Next-Generation Game Engine

Paper • 2503.17359 • Published Mar 21, 2025 • 61

RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints

Paper • 2503.16408 • Published Mar 20, 2025 • 42