13 7 12

Ghosh

Sreyan88

AI & ML interests

None yet

Recent Activity

new activity 4 days ago

nvidia/audio-flamingo-next-hf:KeyError: 'audioflamingonext'

authored a paper 5 days ago

UALM: Unified Audio Language Model for Understanding, Generation and Reasoning

authored a paper 5 days ago

Do Audio-Visual Large Language Models Really See and Hear?

View all activity

Organizations

New activity in nvidia/audio-flamingo-next-hf 4 days ago

KeyError: 'audioflamingonext'

#3 opened 4 days ago by

lby01

authored 3 papers 5 days ago

UALM: Unified Audio Language Model for Understanding, Generation and Reasoning

Paper • 2510.12000 • Published Oct 13, 2025 • 1

Do Audio-Visual Large Language Models Really See and Hear?

Paper • 2604.02605 • Published 19 days ago • 7

Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music

Paper • 2604.10905 • Published 9 days ago • 28

upvoted a paper 7 days ago

Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music

Paper • 2604.10905 • Published 9 days ago • 28

submitted a paper to Daily Papers 7 days ago

Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music

Paper • 2604.10905 • Published 9 days ago • 28

liked a dataset about 1 month ago

nvidia/MMOU

Viewer • Updated 25 days ago • 15k • 1.67k • 15

authored a paper about 1 month ago

MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos

Paper • 2603.14145 • Published Mar 14 • 14

submitted a paper to Daily Papers about 1 month ago

MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos

Paper • 2603.14145 • Published Mar 14 • 14

liked a Space 3 months ago

Music Flamingo

🎵

171

Analyze music and answer questions from audio or YouTube links

liked a model 3 months ago

nvidia/music-flamingo-2601-hf

Audio-Text-to-Text • 8B • Updated 12 days ago • 80.7k • 96

authored a paper 5 months ago

Music Flamingo: Scaling Music Understanding in Audio Language Models

Paper • 2511.10289 • Published Nov 13, 2025 • 19

commented a paper 5 months ago

Music Flamingo: Scaling Music Understanding in Audio Language Models

Paper • 2511.10289 • Published Nov 13, 2025 • 19 •

authored 3 papers 6 months ago

Audio Flamingo Sound-CoT Technical Report: Improving Chain-of-Thought Reasoning in Sound Understanding

Paper • 2508.11818 • Published Aug 15, 2025

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

Paper • 2510.15870 • Published Oct 17, 2025 • 92

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

Paper • 2510.15870 • Published Oct 17, 2025 • 92

upvoted a paper 6 months ago

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

Paper • 2510.15870 • Published Oct 17, 2025 • 92

liked a Space 8 months ago

GPT-OSS-120B on AMD MI300X

💻

334

gpt-oss-120b on AMD MI300X GPUs

updated a collection 8 months ago

Audio

Collection

liked a dataset 8 months ago

gamma-lab-umd/MMAU-Pro

Viewer • Updated Aug 28, 2025 • 5.31k • 6.46k • 18

Ghosh

AI & ML interests

Recent Activity

Organizations

Sreyan88's activity

KeyError: 'audioflamingonext'

Music Flamingo

GPT-OSS-120B on AMD MI300X