Comment by CuriouslyC
Comment by CuriouslyC 5 days ago
The technique people use is to capture PR diffs from public repos and extract the tests then use that to see if agents can reconstruct the patch that satisfies the tests.
Comment by CuriouslyC 5 days ago
The technique people use is to capture PR diffs from public repos and extract the tests then use that to see if agents can reconstruct the patch that satisfies the tests.