Training GUI agents with augmented reasoning data and a tailored post-training recipe
Rui Yang PRO
Ray2333
AI & ML interests
Deep Reinforcement Learning
Recent Activity
liked a dataset about 6 hours ago
ReCAP-Agent/ReCAP-187k-SFT liked a model about 6 hours ago
ReCAP-Agent/ReCAP-32B liked a model about 6 hours ago
ReCAP-Agent/ReCAP-8B