Dongfu Jiang's picture

Dongfu Jiang

DongfuJiang

·

https://jdf-prog.github.io/

AI & ML interests

Large Language Model, Modality Reasoning and their evaluation

Recent Activity

authored a paper 4 days ago

EvolveCoder: Evolving Test Cases via Adversarial Verification for Code Reinforcement Learning

authored a paper 4 days ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

authored a paper 4 days ago

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

View all activity

Organizations

authored 3 papers 4 days ago

EvolveCoder: Evolving Test Cases via Adversarial Verification for Code Reinforcement Learning

Paper • 2603.12698 • Published 16 days ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published 10 days ago • 61

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

Paper • 2603.20278 • Published 12 days ago • 88

upvoted a paper 5 days ago

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

Paper • 2603.20278 • Published 12 days ago • 88

New activity in nvidia/Nemotron-Cascade-2-30B-A3B 6 days ago

Add documentation on how to use with vLLM to README.md

#7 opened 7 days ago by

liked a model 9 days ago

nvidia/Nemotron-Cascade-2-30B-A3B

Text Generation • 32B • Updated 4 days ago • 74.8k • 367

upvoted a paper 9 days ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published 10 days ago • 61

upvoted a paper 12 days ago

Attention Residuals

Paper • 2603.15031 • Published 13 days ago • 165

liked a dataset 14 days ago

stepfun-ai/Step-3.5-Flash-SFT

Viewer • Updated 15 days ago • 1.62M • 49.9k • 285

liked a model 17 days ago

nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16

Text Generation • 124B • Updated 5 days ago • 171k • 310

upvoted a paper about 1 month ago

VisPhyWorld: Probing Physical Reasoning via Code-Driven Video Reconstruction

Paper • 2602.13294 • Published Feb 9 • 13

liked 2 models about 1 month ago

Qwen/Qwen3.5-397B-A17B

Image-Text-to-Text • 403B • Updated 14 days ago • 1.38M • • 1.39k

Qwen/Qwen3.5-35B-A3B

Image-Text-to-Text • 36B • Updated about 1 month ago • 3.03M • • 1.28k

liked a dataset about 1 month ago

OpenResearcher/OpenResearcher-Dataset

Viewer • Updated 4 days ago • 97.6k • 4.38k • 115

liked 2 models about 2 months ago

moonshotai/Kimi-K2.5

Image-Text-to-Text • 1.1T • Updated 30 days ago • 4.51M • • 2.38k

stepfun-ai/Step-3.5-Flash

Text Generation • 199B • Updated 12 days ago • 91.9k • • 743

updated 4 models about 2 months ago

DongfuJiang/nano_v3_search_incorrect_only_347_steps

32B • Updated Jan 28

DongfuJiang/nano_v3_search_correct_only_347_steps

32B • Updated Jan 28

DongfuJiang/nano_v3_search_200_steps

32B • Updated Jan 28 • 1

DongfuJiang/nano_v3_search_347_steps

32B • Updated Jan 28