Comment by nullbio
Comment by nullbio 19 hours ago
Anyone else find that despite Gemini performing best on benches, it's actually still far worse than ChatGPT and Claude? It seems to hallucinate nonsense far more frequently than any of the others. Feels like Google just bench maxes all day every day. As for Mistral, hopefully OSS can eat all of their lunch soon enough.
No, I've been using Gemini for help while learning / building my onprem k8s cluster and it has been almost spotless.
Granted, this is a subject that is very well present in the training data but still.