astrange 4 days ago

That answer seems to conflict with "in the future we'd like to give users more control over the thinking time".

I've gotten mini to think harder by asking it to, but it didn't make a better answer. Though now I've run out of usage limits for both of them so can't try any more…

  • qeternity 4 days ago

    I'm not convinced there isn't more going on behind the scenes but influencing test-time compute via prompt is a pretty universal capability.

    • whimsicalism 4 days ago

      not in a way that it is effectively used - in real life all of the papers using CoT compare against a weak baseline and the benefits level off extremely quickly.

      nobody except for recent deepmind research has shown test time scaling like o1

  • bratwurst3000 4 days ago

    i am telling claude to give me not the obvious answer. that put thinking time up and the quality of answers is better. hope it helps.