Hacker News

Favorites Setup
Comment by iLoveOncall | original | Senior SWE-Bench: open-source benchmark that assesses agents as senior engineers
[−]iLoveOncall · 2026-07-02 Thu 11:28 UTC · link
My colleagues will thank me for speaking non-stop right next to them surely.