sigma's picture

sigma

sigma7863

·

AI & ML interests

None yet

Recent Activity

liked a Space about 22 hours ago

SII-GAIR/daVinci-MagiHuman

upvoted a paper about 22 hours ago

Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

liked a dataset about 22 hours ago

NimrodShabtay1986/AwaRes

View all activity

Organizations

None yet

upvoted 2 papers about 22 hours ago

Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

Paper • 2603.21986 • Published 3 days ago • 104

Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs

Paper • 2603.16932 • Published 12 days ago • 78

upvoted 2 papers about 24 hours ago

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

Paper • 2603.23483 • Published 2 days ago • 50

WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG

Paper • 2603.23497 • Published 2 days ago • 73

upvoted a paper 1 day ago

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Paper • 2603.22458 • Published 3 days ago • 117

upvoted a collection 2 days ago

Kimodo-v1

Models for human(oid) motion generation • 6 items • Updated 1 day ago • 15

upvoted a paper 13 days ago

Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training

Paper • 2603.12255 • Published 14 days ago • 90

upvoted 2 papers 16 days ago

NLE: Non-autoregressive LLM-based ASR by Transcript Editing

Paper • 2603.08397 • Published 17 days ago • 21

AutoResearch-RL: Perpetual Self-Evaluating Reinforcement Learning Agents for Autonomous Neural Architecture Discovery

Paper • 2603.07300 • Published 19 days ago • 17

upvoted an article 16 days ago

Article

Introducing Storage Buckets on the Hugging Face Hub

+10

17 days ago

•

184

upvoted a collection 16 days ago

TADA

TADA: A Generative Framework for Speech Modeling via Text-Acoustic Dual Alignment | https://huggingface.co/papers/2602.23068 • 7 items • Updated 2 days ago • 69

upvoted a paper 18 days ago

Anatomy of Agentic Memory: Taxonomy and Empirical Analysis of Evaluation and System Limitations

Paper • 2602.19320 • Published Feb 22 • 9

upvoted a collection 22 days ago

Bim

49 items • Updated about 20 hours ago • 7

upvoted 2 papers 22 days ago

Utonia: Toward One Encoder for All Point Clouds

Paper • 2603.03283 • Published 23 days ago • 183

Flavors of Moonshine: Tiny Specialized ASR Models for Edge Devices

Paper • 2509.02523 • Published Sep 2, 2025 • 21

upvoted a paper 27 days ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published about 1 month ago • 517

upvoted 2 collections 29 days ago

Nemotron-Post-Training-v3

Collection of datasets used in the post-training phase of Nemotron Nano and Super v3. • 28 items • Updated 1 day ago • 101

NVIDIA Nemotron v3

Open, Production-ready Enterprise Models • 15 items • Updated 1 day ago • 239

upvoted 2 papers 29 days ago

Nemotron-CC-Math: A 133 Billion-Token-Scale High Quality Math Pretraining Dataset

Paper • 2508.15096 • Published Aug 20, 2025 • 7

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published Aug 20, 2025 • 46