Surprisal Guided Selection Training at test-time for kernel optimization Surprisal-Guided Selection: Compute-Optimal Test-Time Strategies for Execution-Grounded Code Generation Paper • 2602.07670 • Published 6 days ago • 1 Jarrodbarnes/KernelBench-RLVR-120b Text Generation • 117B • Updated 3 days ago • 32 • 1
Surprisal-Guided Selection: Compute-Optimal Test-Time Strategies for Execution-Grounded Code Generation Paper • 2602.07670 • Published 6 days ago • 1
OpenSec: Incident Response Agent Calibration OpenSec is a dual-control RL environment, dataset, and evaluation suite that measures agent calibration on incident response tasks. OpenSec: Measuring Incident Response Agent Calibration Under Adversarial Evidence Paper • 2601.21083 • Published 16 days ago • 1 Jarrodbarnes/opensec-seeds Viewer • Updated 4 days ago • 380 • 130 • 1 Jarrodbarnes/opensec-gdpo-4b Text Generation • 4B • Updated 1 day ago • 80 • 1 Sleeping RL OpenSec Environment 🔐
OpenSec: Measuring Incident Response Agent Calibration Under Adversarial Evidence Paper • 2601.21083 • Published 16 days ago • 1
Surprisal Guided Selection Training at test-time for kernel optimization Surprisal-Guided Selection: Compute-Optimal Test-Time Strategies for Execution-Grounded Code Generation Paper • 2602.07670 • Published 6 days ago • 1 Jarrodbarnes/KernelBench-RLVR-120b Text Generation • 117B • Updated 3 days ago • 32 • 1
Surprisal-Guided Selection: Compute-Optimal Test-Time Strategies for Execution-Grounded Code Generation Paper • 2602.07670 • Published 6 days ago • 1
OpenSec: Incident Response Agent Calibration OpenSec is a dual-control RL environment, dataset, and evaluation suite that measures agent calibration on incident response tasks. OpenSec: Measuring Incident Response Agent Calibration Under Adversarial Evidence Paper • 2601.21083 • Published 16 days ago • 1 Jarrodbarnes/opensec-seeds Viewer • Updated 4 days ago • 380 • 130 • 1 Jarrodbarnes/opensec-gdpo-4b Text Generation • 4B • Updated 1 day ago • 80 • 1 Sleeping RL OpenSec Environment 🔐
OpenSec: Measuring Incident Response Agent Calibration Under Adversarial Evidence Paper • 2601.21083 • Published 16 days ago • 1