UALM: Unified Audio Language Model for Understanding, Generation and Reasoning Paper • 2510.12000 • Published Oct 13, 2025 • 1
Do Audio-Visual Large Language Models Really See and Hear? Paper • 2604.02605 • Published 19 days ago • 7
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music Paper • 2604.10905 • Published 9 days ago • 28
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music Paper • 2604.10905 • Published 9 days ago • 28
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music Paper • 2604.10905 • Published 9 days ago • 28
MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos Paper • 2603.14145 • Published Mar 14 • 14
MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos Paper • 2603.14145 • Published Mar 14 • 14
Running on Zero Agents 171 Music Flamingo 🎵 171 Analyze music and answer questions from audio or YouTube links
Music Flamingo: Scaling Music Understanding in Audio Language Models Paper • 2511.10289 • Published Nov 13, 2025 • 19
Music Flamingo: Scaling Music Understanding in Audio Language Models Paper • 2511.10289 • Published Nov 13, 2025 • 19 • 2
Audio Flamingo Sound-CoT Technical Report: Improving Chain-of-Thought Reasoning in Sound Understanding Paper • 2508.11818 • Published Aug 15, 2025
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM Paper • 2510.15870 • Published Oct 17, 2025 • 92
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM Paper • 2510.15870 • Published Oct 17, 2025 • 92
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM Paper • 2510.15870 • Published Oct 17, 2025 • 92
Running on CPU Upgrade Agents Featured 334 GPT-OSS-120B on AMD MI300X 💻 334 gpt-oss-120b on AMD MI300X GPUs
Audio Collection Research related to audio (speech, sounds, and music) • 1 item • Updated Sep 1, 2025