penfever/rl__24GPU_shaped__inferredbugs-sandboxes-verifier__exp_tas_optimal_comb__40-0 Viewer • Updated about 10 hours ago • 30.8k
penfever/rl__64GPU_shaped_32b_entropy__swe_rebench_patched_oracle__syh-r2eg-askl-glm_4__40-0 Viewer • Updated about 13 hours ago • 8.51k
penfever/rl__24GPU_shaped__nemotron-math-oracle-filtered__exp_tas_optimal_comb__40-0 Viewer • Updated about 13 hours ago • 22.4k
penfever/Kimi-2.5-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-32k-reward1 Viewer • Updated about 13 hours ago • 5.24k
penfever/Kimi-2.5-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-32k Viewer • Updated about 14 hours ago • 9.36k
penfever/rl__24GPU_shaped_entropy__swe_rebench_patched_oracle__100k_wd0__Qwen3-8B__20-0 Viewer • Updated about 14 hours ago • 9.97k
penfever/rl__24GPU_shaped__selfinstruct-naive-sandboxes-2-verified__exp_tas_optimal_comb__40-0 Viewer • Updated about 14 hours ago • 30.2k
penfever/rl__24GPU_shaped_entropy__nemotron-math-oracle-filtered__100k_wd0 Viewer • Updated 1 day ago • 6.16k • 7
penfever/stackexchange-tezos-sandboxes__Kimi-2.5-smaxeps-32k Viewer • Updated 3 days ago • 8.62k • 11
penfever/rl__24GPU_shaped__exp_rpt_pymethods2test-large__GLM-4_7-swesmith-san Viewer • Updated 4 days ago • 21.8k • 12
penfever/rl__24GPU_shaped__exp_rpt_pymethods2test-large__exp_tas_optimal_comb Viewer • Updated 4 days ago • 47.2k • 11
penfever/rl__48GPU_shaped_32b__swe_rebench_patched_oracle__Qwen3-32B Viewer • Updated 5 days ago • 38.2k • 10
penfever/rl__24GPU_base__code-contests-noblock__r2egym-nl2bash-stack Viewer • Updated 8 days ago • 48.1k • 10
penfever/rl__24GPU_shaped__nemotron-code-oracle-filtered__r2egym-nl2bash-stack Viewer • Updated 8 days ago • 8.92k • 9
penfever/rl__24GPU_shaped__stackexchange-tezos-sandboxes-skywork-response__r2egym-nl2bash-stack Viewer • Updated 9 days ago • 42.9k • 29
penfever/rl__24GPU_shaped__swe_rebench_patched_oracle__r2egym-nl2bash-stack Viewer • Updated 10 days ago • 18k • 32
penfever/rl__24GPU_base__mix_h2_language_balanced__r2egym-nl2bash-stack Viewer • Updated 14 days ago • 40.9k • 47
penfever/rl__24GPU_base__exp_rpt_pymethods2test-large__qwen3base-GLM-4_7-sw Viewer • Updated 15 days ago • 36.7k • 29
penfever/rl__24GPU_base__exp_rpt_curriculum-hard__r2egym-nl2bash-stack Viewer • Updated 15 days ago • 7.17k • 23
penfever/rl__24GPU_base__exp_rpt_pymethods2test-large__Qwen3-8B-Base Viewer • Updated 15 days ago • 34.9k • 28
penfever/rl__40GPU_base_32b__exp_rpt_codeelo-v2__sft_GLM-4-7-swesmith Viewer • Updated 17 days ago • 331 • 14
penfever/eval__openthoughts-tb-dev__r2egym-nl2bash-stack__lambda Viewer • Updated 18 days ago • 210 • 20
penfever/rl_rl-conf_24GP_base-yaml_mode-path_r2eg-nl2b-stac-bugs-fixt-agai_trai-data_exp_rpt_stac-rust Viewer • Updated 21 days ago • 26.4k • 43