HN Top New Show Ask Jobs

settings

Theme

Hand Mode

Feed

Comment by esafak

Comment by esafak 10 hours ago

0 replies

View on Hacker News

For the tasks in SWE-Bench Pro they obtained a distribution of agent turns, summarized as the box plot. The box likely describes the inter-quartile range while the whiskers describe the some other range. You'd have to read their report to be sure. https://en.wikipedia.org/wiki/Box_plot