Hacker News
Favorites
Setup
☰
Home
Favorites
Setup
Comment by wwind123 |
original
|
Senior SWE-Bench: open-source benchmark that assesses agents as senior engineers
[−]
wwind123
· 2026-07-02 Thu 07:34 UTC ·
link
fave
Who knows. Maybe Mythos 5 already found a hole in SHA256, so this won't be too hard. :)