Runzhe Zhan

rzzhan

·

https://runzhe.me/

Ririkoo

AI & ML interests

None yet

Recent Activity

upvoted a paper about 20 hours ago

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

upvoted a paper 21 days ago

ComBench: A Benchmark for Rigorous Proof Reasoning and Constructive Realization in Olympiad-Level Combinatorics

upvoted a paper about 1 month ago

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

View all activity

Organizations

Collections 2

models 10

rzzhan/ThinMQM-8B

Text Generation • 8B • Updated Oct 28, 2025 • 4

rzzhan/ExGRPO-Llama3.1-8B-Instruct

Text Generation • 8B • Updated Oct 24, 2025 • 2

rzzhan/ExGRPO-Llama3.1-8B-Zero

Text Generation • 8B • Updated Oct 24, 2025 • 2

rzzhan/ExGRPO-Qwen2.5-Math-1.5B-Zero

Text Generation • 2B • Updated Oct 24, 2025 • 10

rzzhan/ExGRPO-Qwen2.5-7B-Instruct

Text Generation • 8B • Updated Oct 24, 2025 • 3

rzzhan/ExGRPO-LUFFY-7B-Continual

Text Generation • 8B • Updated Oct 24, 2025 • 5 • 1

rzzhan/ExGRPO-Qwen2.5-Math-7B-Zero

Text Generation • 8B • Updated Oct 24, 2025 • 17 •

rzzhan/ThinMQM-7B

8B • Updated Oct 24, 2025 • 3

rzzhan/ThinMQM-32B

33B • Updated Oct 24, 2025 • 4

rzzhan/tiny-llama-stories-42m

Updated Sep 17, 2024 • 11 • 1

datasets 1

rzzhan/ThinMQM-12k

Viewer • Updated Oct 24, 2025 • 23.9k • 14