Zhuoran Yang's picture

1 6 8

Zhuoran Yang

zhuoranyang

·

AI & ML interests

reinforcement learning, game theory, AGI

Recent Activity

authored a paper about 2 months ago

A Theoretical Analysis of Deep Q-Learning

authored a paper about 2 months ago

Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning

authored a paper about 2 months ago

Symmetric Mean-field Langevin Dynamics for Distributional Minimax Problems

View all activity

Organizations

liked 6 datasets 3 months ago

ReasoningTransferability/math_sft_40K

Viewer • Updated Jul 8 • 39.9k • 102 • 4

interstellarninja/hermes_reasoning_tool_use

Viewer • Updated Aug 5 • 51k • 823 • 141

smirki/Agentic-Coding-Tessa

Viewer • Updated Aug 12 • 44.1k • 117 • 12

interstellarninja/tool-use-multiturn-reasoning

Viewer • Updated Jul 27 • 14.6k • 202 • 22

openbmb/UltraInteract_sft

Viewer • Updated Apr 5, 2024 • 289k • 596 • 124

AquilaX-AI/security_assistant_data

Viewer • Updated Apr 27 • 18.3k • 36 • 2

liked a dataset 4 months ago

nvidia/Nemotron-Post-Training-Dataset-v1

Viewer • Updated Aug 25 • 25.7M • 11.3k • 164

liked a dataset 10 months ago

open-r1/OpenR1-Math-220k

Viewer • Updated Feb 18 • 450k • 13.7k • 681