Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zhuoran Yang's picture
1 6 8

Zhuoran Yang

zhuoranyang
·

AI & ML interests

reinforcement learning, game theory, AGI

Recent Activity

authored a paper about 2 months ago
A Theoretical Analysis of Deep Q-Learning
authored a paper about 2 months ago
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning
authored a paper about 2 months ago
Symmetric Mean-field Langevin Dynamics for Distributional Minimax Problems
View all activity

Organizations

Zhuoran Yang Research Group's profile picture Cisco Foundation AI's profile picture Center for Algorithms, Data, and Market Design at Yale's profile picture

liked 6 datasets 3 months ago

ReasoningTransferability/math_sft_40K

Viewer • Updated Jul 8 • 39.9k • 102 • 4

interstellarninja/hermes_reasoning_tool_use

Viewer • Updated Aug 5 • 51k • 823 • 141

smirki/Agentic-Coding-Tessa

Viewer • Updated Aug 12 • 44.1k • 117 • 12

interstellarninja/tool-use-multiturn-reasoning

Viewer • Updated Jul 27 • 14.6k • 202 • 22

openbmb/UltraInteract_sft

Viewer • Updated Apr 5, 2024 • 289k • 596 • 124

AquilaX-AI/security_assistant_data

Viewer • Updated Apr 27 • 18.3k • 36 • 2
liked a dataset 4 months ago

nvidia/Nemotron-Post-Training-Dataset-v1

Viewer • Updated Aug 25 • 25.7M • 11.3k • 164
liked a dataset 10 months ago

open-r1/OpenR1-Math-220k

Viewer • Updated Feb 18 • 450k • 13.7k • 681
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs