MatthiasPortzel 14 hours ago

The claim is that these models are training on data which include the problems and explanations. The fact that the first model trained after the public release of the questions (and crowdsourced answers) performs best is not a counter example, but is expected and supported by the claim.

jsemrau 21 hours ago

The same timing is actually suspicious. And it would not be the first time something like this happened.

iamacyborg 20 hours ago

I was noodling with Gemini 2.5 Pro a couple days ago and it was convinced Donald Trump didn’t win the 2024 election and that he conceded to Kamala Harris so I’m not entirely sure how much weight I’d put behind it.