Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zijie Xin's picture
2 7 12

Zijie Xin

xxayt
·
https://xxayt.github.io/
  • xxayt

AI & ML interests

multi-modal learning, AIGC

Recent Activity

upvoted a paper 11 days ago
OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models
upvoted a collection about 1 month ago
Qwen3-Omni
authored a paper 3 months ago
Multi-Object Sketch Animation by Scene Decomposition and Motion Planning
View all activity

Organizations

SeekWorld's profile picture

upvoted a paper 11 days ago

OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models

Paper • 2511.14582 • Published Nov 18, 2025 • 18
upvoted a collection about 1 month ago

Qwen3-Omni

Collection
6 items • Updated 2 days ago • 176
upvoted a paper 3 months ago

Multi-Object Sketch Animation by Scene Decomposition and Motion Planning

Paper • 2503.19351 • Published Mar 25, 2025 • 1
upvoted a collection 4 months ago

MGSV

Collection
[ICCV 2025] Music Grounding by Short Video • 3 items • Updated Sep 9, 2025 • 1
upvoted 3 papers 5 months ago

Music Grounding by Short Video

Paper • 2408.16990 • Published Aug 30, 2024 • 2

VideoDeepResearch: Long Video Understanding With Agentic Tool Using

Paper • 2506.10821 • Published Jun 12, 2025 • 19

TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning

Paper • 2410.19702 • Published Oct 25, 2024 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs