zozbot234 a day ago

It's hard to judge from this particular question, but the K2.5 output looks at least marginally better AIUI, the only real problem with it is the snarky initial "That's very interesting" quip. Even then a British user would probably be fine with it.

logicprog a day ago

I agree. K2 was blunt, straightforward, pretty... rational? K2.5 has a much stronger slop vibe.

orbital-decay a day ago

K2 in your example is using the GPT reply template (tl;dr - terse details - conclusion, with contradictory tendencies), there's nothing unique about it. That's exactly how GPT-5.0 talked. The only model with a strong "personality" vibe was Claude 3 Opus.

  • user_7832 19 hours ago

    > The only model with a strong "personality" vibe was Claude 3 Opus.

    Did you have the chance to use 3.5 (or 3.6) Sonnet, and if yes, how did they compare?

    As a non-paying user, 3.5 era Claude was absolutely the best LLM I've ever used in terms of having a conversation. It felt like talking to a human and not a bot. Its replies were readable, even if they were several paragraphs long. I've unfortunately never found anything remotely as good.

    • orbital-decay 18 hours ago

      Pretty poorly in that regard. In 3.5 they killed Claude 3's agency, pretty much reversing their previous training policy in favor of "safety", and tangentially mentioned that they didn't want to make the model too human-like. [1] Claude 3 was the last version of Claude, and one of the very few models in general, that had a character. That doesn't mean it wasn't writing slop though, falling into annoying stereotypes is still unsolved in LLMs.

      [1] https://www.anthropic.com/research/claude-character (see the last 2 paragraphs)

Grosvenor a day ago

[flagged]

  • Grimblewald a day ago

    Disagree, i've found kimi useful in solving creative coding problems gemini, claude, chatgpt etc failed at. Or, it is far better at verifying, augmenting and adding to human reviews of resumes for positions. It catches missed detials humans and other llm's routinley miss. There is something special to K2.