DCAgent/eval-SERA-8B_16concurrency_eval_ctx32k_terminal-bench-2.0 Viewer • Updated about 3 hours ago • 165
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epoc9b2dd9fa Viewer • Updated about 5 hours ago • 265 • 15
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-986b6fefd Viewer • Updated about 8 hours ago • 552 • 19
DCAgent/eval-glm46-swegym-tasks-maxeps-131k_16concurrency_eval_ctx32k_OpenThoughts-TB-dev Viewer • Updated about 10 hours ago • 817 • 4
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epoc7bd19372 Viewer • Updated about 11 hours ago • 541 • 11
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-93b7ec80c Viewer • Updated 1 day ago • 390 • 16
DCAgent/eval-NVIDIA-Nemotron-3-Nano-30B-A3B-BF16_16concurrency_eval_ctx131k_terminal-bench-2.0 Viewer • Updated 1 day ago • 261 • 7
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-99a1741f7 Viewer • Updated 1 day ago • 415 • 10