DCAgent/eval-Qwen3-Coder-30B-A3B-Instruct_16concurrency_openhands_eval_c_terminal-bench-2.0 Viewer • Updated about 19 hours ago • 3
DCAgent/eval-NVIDIA-Nemotron-3-Nano-30B-A3B-BF16_16concurrency_openhands_eval_c_terminal67fe5eed Viewer • Updated about 21 hours ago • 8
DCAgent/eval-NVIDIA-Nemotron-3-Nano-30B-A3B-BF16_16concurrency_openhands_eval_c_OpenThoue429c793 Viewer • Updated about 21 hours ago • 4
DCAgent/swesmith-sandboxes-with_tests-gpt-5-mini-passed_glm_4.7_traces Viewer • Updated 1 day ago • 7.17k • 14
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_learning-rate_0a0458a3 Viewer • Updated 1 day ago • 764 • 7
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_learning-rate_c267e2e6 Viewer • Updated 1 day ago • 509 • 8
DCAgent/eval-terminal-bench-2.0-claude-haiku-4-5-20251001-20260115_165217 Viewer • Updated 1 day ago • 272 • 9
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_learning-rate_a60f4588 Viewer • Updated 1 day ago • 371 • 13
DCAgent/eval-terminal-bench-2.0-gpt-5-mini-2025-08-07-20260115_093339 Viewer • Updated 2 days ago • 269 • 16
DCAgent/eval-terminal-bench-2.0-gemini-2.5-flash-20260114_222605 Viewer • Updated 2 days ago • 312 • 10
DCAgent/eval-a2e51e9e0e8029156ed340719eb8cc7ceee3ed1a-gpt-5-nano-2025-08-07-20260114_142654 Viewer • Updated 2 days ago • 293 • 8
DCAgent/eval-a2e51e9e0e8029156ed340719eb8cc7ceee3ed1a-gpt-5-mini-2025-08-07-20260114_222454 Viewer • Updated 3 days ago • 300 • 9
DCAgent/eval-a2e51e9e0e8029156ed340719eb8cc7ceee3ed1a-gemini-2.5-flash-20260114_200318 Viewer • Updated 3 days ago • 339 • 9
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-gpt-5-mini-2025-08-07-20260114_203811 Viewer • Updated 3 days ago • 216 • 9
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-claude-haiku-4-5-20251001-20260114_164534 Viewer • Updated 3 days ago • 195 • 11
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-gemini-2.5-flash-20260114_175612 Viewer • Updated 3 days ago • 266 • 11
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epocf7b91126 Viewer • Updated 3 days ago • 305 • 9
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-gpt-5-nano-2025-08-07-20260114_152435 Viewer • Updated 3 days ago • 198 • 14
DCAgent/eval-Qwen3-Coder-30B-A3B-Instruct_swebench-verified-random-100-folders Viewer • Updated 3 days ago • 300 • 16
DCAgent/eval-a2e51e9e0e8029156ed340719eb8cc7ceee3ed1a-claude-haiku-4-5-20251001-20260114_133343 Viewer • Updated 3 days ago • 300 • 15