Comment by simonw

Comment by simonw 8 months ago

How long ago was this? I'd be surprised to see Claude 3.7 Sonnet make a mistake of this nature.

Either way, when a model starts making dumb mistakes like that these days I start a fresh conversation (to blow away all of the bad tokens in the current one), either with that model or another one.

I often switch from Claude 3.7 Sonnet to o3 or o4-mini these days. I paste in the most recent "good" version of the thing we're working on and prompt from there.

th0ma5 8 months ago

Lol, "it didn't do it... and if it did it didn't mean it... and if it meant it it surely can't mean it now." This is unserious.

Reply View 4 replies

simonw 8 months ago

A full two thirds of the comment you replied to there were me saying "when these things start to make dumb mistakes here are the steps I take to fix the problem".

Reply View | 1 reply
- th0ma5 8 months ago
  
  Not actual knowledge but adages ! Lol "here is my magic potion that I can't tell you how it differs from the other magic potion!" That is not fixing anything. That's trying to string people along and insert yourself otherwise you'd be able to point so some kind of empirical documented experience as to why one vendor is better than the other or why older models have the problem but newer ones don't, and more importantly assurances that future models or even future unannounced changes to the models you mention will even work the same as they do today. What knowledge are you actually imparting other than "I think this is a road you could try." Thanks.
  
  Reply View | 0 replies
gotimo 8 months ago

this is the rhetoric that you will see replied to effectively any negative experience with LLMs in programming.

Reply View | 1 reply
- th0ma5 8 months ago
  
  Yeah I'm starting to think they aren't aware of the shell game in their own rhetoric. If nothing can ever be wrong then nothing is right either in that worldview.
  
  Reply View | 0 replies