Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper • 2603.21986 • Published 3 days ago • 104
Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs Paper • 2603.16932 • Published 12 days ago • 78
SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning Paper • 2603.23483 • Published 2 days ago • 50
WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG Paper • 2603.23497 • Published 2 days ago • 73
MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding Paper • 2603.22458 • Published 3 days ago • 117
Kimodo-v1 Collection Models for human(oid) motion generation • 6 items • Updated 1 day ago • 15
Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training Paper • 2603.12255 • Published 14 days ago • 90
NLE: Non-autoregressive LLM-based ASR by Transcript Editing Paper • 2603.08397 • Published 17 days ago • 21
AutoResearch-RL: Perpetual Self-Evaluating Reinforcement Learning Agents for Autonomous Neural Architecture Discovery Paper • 2603.07300 • Published 19 days ago • 17
TADA Collection TADA: A Generative Framework for Speech Modeling via Text-Acoustic Dual Alignment | https://huggingface.co/papers/2602.23068 • 7 items • Updated 2 days ago • 69
Anatomy of Agentic Memory: Taxonomy and Empirical Analysis of Evaluation and System Limitations Paper • 2602.19320 • Published Feb 22 • 9
Utonia: Toward One Encoder for All Point Clouds Paper • 2603.03283 • Published 23 days ago • 183
Flavors of Moonshine: Tiny Specialized ASR Models for Edge Devices Paper • 2509.02523 • Published Sep 2, 2025 • 21
Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano and Super v3. • 28 items • Updated 1 day ago • 101
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 15 items • Updated 1 day ago • 239
Nemotron-CC-Math: A 133 Billion-Token-Scale High Quality Math Pretraining Dataset Paper • 2508.15096 • Published Aug 20, 2025 • 7
NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model Paper • 2508.14444 • Published Aug 20, 2025 • 46