Aviral Kumar's picture

4 5

Aviral Kumar

aviralku

·

AI & ML interests

None yet

Recent Activity

upvoted an article about 2 months ago

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

liked a model 2 months ago

nvidia/Nemotron-Cascade-2-30B-A3B

liked a dataset 3 months ago

microsoft/webgym_tasks

View all activity

Organizations

upvoted an article about 2 months ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+7

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 158

upvoted 2 collections 4 months ago

InT

7 items • Updated Feb 4 • 1

POPE

5 items • Updated Mar 2 • 3

upvoted a paper about 1 year ago

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published Mar 10, 2025 • 48