Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
6
8
Zhuoran Yang
zhuoranyang
Follow
0 followers
·
13 following
AI & ML interests
reinforcement learning, game theory, AGI
Recent Activity
authored
a paper
about 2 months ago
A Theoretical Analysis of Deep Q-Learning
authored
a paper
about 2 months ago
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning
authored
a paper
about 2 months ago
Symmetric Mean-field Langevin Dynamics for Distributional Minimax Problems
View all activity
Organizations
zhuoranyang
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
6 datasets
3 months ago
ReasoningTransferability/math_sft_40K
Viewer
•
Updated
Jul 8
•
39.9k
•
102
•
4
interstellarninja/hermes_reasoning_tool_use
Viewer
•
Updated
Aug 5
•
51k
•
823
•
141
smirki/Agentic-Coding-Tessa
Viewer
•
Updated
Aug 12
•
44.1k
•
117
•
12
interstellarninja/tool-use-multiturn-reasoning
Viewer
•
Updated
Jul 27
•
14.6k
•
202
•
22
openbmb/UltraInteract_sft
Viewer
•
Updated
Apr 5, 2024
•
289k
•
596
•
124
AquilaX-AI/security_assistant_data
Viewer
•
Updated
Apr 27
•
18.3k
•
36
•
2
liked
a dataset
4 months ago
nvidia/Nemotron-Post-Training-Dataset-v1
Viewer
•
Updated
Aug 25
•
25.7M
•
11.3k
•
164
liked
a dataset
10 months ago
open-r1/OpenR1-Math-220k
Viewer
•
Updated
Feb 18
•
450k
•
13.7k
•
681