Comment by gtsop

Comment by gtsop 6 months ago

Honestly, i have been bitten so many times by LLM hallucinations when I work in parallel with the LLM, I wouldn't trust it autonomously running anything at all. If you have tried to use imaginary APIs, imaginary configuration and imaginary cli arguments, you know what I mean

energy123 6 months ago

> If you have tried to use imaginary APIs, imaginary configuration and imaginary cli arguments, you know what I mean

I see this comment a lot but I can't help but feel it's 4 weeks out of date. The version of o1 released on 2024-12-17 so rarely hallucinates when asked code questions of basic to medium difficulty and provided with good context and a well written prompt, in my experience. If the context window is sub-10k tokens, I have very high confidence that the output will be correct. GPT-4o and o1-mini, on the other hand, hallucinates a lot and I have learned to put low trust in the output.

Reply View 3 replies

gtsop 6 months ago

o1 is way to slow to keep up with my flow of thinking in order to be of any help in the scenario i am describing

Reply View | 2 replies
- energy123 6 months ago
  
  How are you using LLMs? With o1 I've switched to spelling out in lots of details what I want, then asking it it to one shot the full file, so with this approach the wait time has been acceptable.
  
  Reply View | 1 reply
  
  gtsop 6 months ago
  
  I'm using it to orient me when tackling something new. For instance, the other day i was making a web driver client in shell and i asked it something apong the lines of "is there an http endpoint to the webdriver to get the class name of an element?"
  These are the sort of questions i mostly do. "What is the best practice to read output from a device file in C", "Is there a cli tool to find dead typescript interface fields?"
  
  Reply View | 0 replies

meiraleal 6 months ago

I have been feeling LLM burnout and favoring code it all my self after a year of LLM assistance. When it gets things wrong it is too annoying. Like, I would get mad and start to curse it, shouting loud and in the chat.

Reply View 2 replies

gtsop 6 months ago

Exactly this. At first started verbally abusing it untill it conformed, but i quickly realised that after the context gets very long it simply discards former instructions and abusing. So i get frustrated, toxic AND don't get my job done

Reply View | 0 replies
nejsjsjsbsb 6 months ago

I mainly use it as a typing assist. If it suggests ahead what I was thinking it saves time.

Reply View | 0 replies