Hacker News

Favorites Setup
Comment by Eridrus | original | Senior SWE-Bench: open-source benchmark that assesses agents as senior engineers
[−]Eridrus · 2026-07-02 Thu 07:01 UTC · link
See, this goes back to the, all software engineers besides me are wrong, because I see this list and do not think it is anywhere close to a sufficient list for good quality software. The thing about all these criteria is that sometimes they are important, sometimes they are not.

This "standard" exists for the sake of code analysis vendors to be able to have some sort of shared taxonomy, but also provide a fig leaf of standardization to their products.