DCAgent/selfinstruct-naive-sandboxes-2_10k_glm_4.7_traces_jupiter Viewer • Updated about 3 hours ago • 10.1k
DCAgent/eval-terminal-bench-2.0__OpenThinker-Agent-v1__eval_ctx32k_non_it_2x_eval_ Viewer • Updated about 5 hours ago • 979
DCAgent/exp_rpt_nemotron-csharp_10k_glm_4.7_traces_jupiter Viewer • Updated about 5 hours ago • 11.8k
DCAgent/exp_rpt_nemotron-bash-withtests-gpt5mini_glm_4.7_traces_jupiter Viewer • Updated about 8 hours ago • 10.7k
DCAgent/exp_rpt_nemotron-bash-withtests_glm_4.7_traces_jupiter Viewer • Updated about 12 hours ago • 10.4k
DCAgent/code-contests-sandboxes-with-tests_10k_glm_4.7_traces_jupiter Viewer • Updated about 13 hours ago • 8.19k
DCAgent/eval-swebench-verified-random-100-folders__exp-psu-swesmith-31K__eval_ctx32k_non9933e620 Viewer • Updated about 16 hours ago • 4.51k
DCAgent/exp_rpt_stack-go-v3-test_10k_glm_4.7_traces_jupiter Viewer • Updated about 18 hours ago • 14.3k
DCAgent/eval-swebench-verified-random-100-folders__exp-psu-swesmith-10K__eval_ctx32k_nona9ab762c Viewer • Updated about 20 hours ago • 5.35k