penfever/rl__24GPU_shaped__stackexchange-overflow-sandboxes-skywork-response__exp_tas_optimal_comb__40-0 Viewer • Updated Apr 1 • 41.8k • 12
penfever/rl__24GPU_shaped__inferredbugs-sandboxes-verifier__exp_tas_optimal_comb__40-0 Viewer • Updated Mar 26 • 30.8k • 10
penfever/rl__64GPU_shaped_32b_entropy__swe_rebench_patched_oracle__syh-r2eg-askl-glm_4__40-0 Viewer • Updated Mar 26 • 8.51k • 5
penfever/rl__24GPU_shaped__nemotron-math-oracle-filtered__exp_tas_optimal_comb__40-0 Viewer • Updated Mar 26 • 22.4k • 11
penfever/Kimi-2.5-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-32k-reward1 Viewer • Updated Mar 26 • 5.24k • 10
penfever/Kimi-2.5-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-32k Viewer • Updated Mar 26 • 9.36k • 4
penfever/rl__24GPU_shaped_entropy__swe_rebench_patched_oracle__100k_wd0__Qwen3-8B__20-0 Viewer • Updated Mar 26 • 9.97k • 8
penfever/rl__24GPU_shaped__selfinstruct-naive-sandboxes-2-verified__exp_tas_optimal_comb__40-0 Viewer • Updated Mar 26 • 30.2k • 6
penfever/rl__24GPU_shaped_entropy__nemotron-math-oracle-filtered__100k_wd0 Viewer • Updated Mar 25 • 6.16k • 6
penfever/rl__24GPU_shaped__exp_rpt_pymethods2test-large__GLM-4_7-swesmith-san Viewer • Updated Mar 23 • 21.8k • 3
penfever/rl__24GPU_shaped__exp_rpt_pymethods2test-large__exp_tas_optimal_comb Viewer • Updated Mar 23 • 47.2k • 6
penfever/rl__48GPU_shaped_32b__swe_rebench_patched_oracle__Qwen3-32B Viewer • Updated Mar 22 • 38.2k • 4
penfever/rl__24GPU_base__code-contests-noblock__r2egym-nl2bash-stack Viewer • Updated Mar 19 • 48.1k • 8
penfever/rl__24GPU_shaped__nemotron-code-oracle-filtered__r2egym-nl2bash-stack Viewer • Updated Mar 18 • 8.92k • 4
penfever/rl__24GPU_shaped__stackexchange-tezos-sandboxes-skywork-response__r2egym-nl2bash-stack Viewer • Updated Mar 18 • 42.9k • 5
penfever/rl__24GPU_shaped__swe_rebench_patched_oracle__r2egym-nl2bash-stack Viewer • Updated Mar 17 • 18k • 4
penfever/rl__24GPU_base__mix_h2_language_balanced__r2egym-nl2bash-stack Viewer • Updated Mar 13 • 40.9k • 4
penfever/rl__24GPU_base__exp_rpt_pymethods2test-large__qwen3base-GLM-4_7-sw Viewer • Updated Mar 12 • 36.7k • 4
penfever/rl__24GPU_base__exp_rpt_curriculum-hard__r2egym-nl2bash-stack Viewer • Updated Mar 12 • 7.17k • 4
penfever/rl__24GPU_base__exp_rpt_pymethods2test-large__Qwen3-8B-Base Viewer • Updated Mar 12 • 34.9k • 4
penfever/rl__40GPU_base_32b__exp_rpt_codeelo-v2__sft_GLM-4-7-swesmith Viewer • Updated Mar 10 • 331 • 113