Comment by crimsoneer

Comment by crimsoneer 20 hours ago

4 replies

If someone is using these models, they probably can't or won't use the existing SOTA models, so not sure how useful those comparisons actually are. "Here is a benchmark that makes us look bad from a model you can't use on a task you won't be undertaking" isn't actually helpful (and definitely not in a press release).

constantcrying 20 hours ago

Completely agree, that there are legitimate reasons to prefer comparison to e.g. deepeek models. But that doesn't change my point, we both agree that the comparisons would be extremely unfavorable.

  • Lapel2742 20 hours ago

    > that the comparisons would be extremely unfavorable.

    Why should they compare apples to oranges? Ministral3 Large costs ~1/10th of Sonnet 4.5. They clearly target different users. If you want a coding assistant you probably wouldn't choose this model for various reasons. There is place for more than only the benchmark king.

    • constantcrying 20 hours ago

      Come on. Do you just not read posts at all?

      • esafak 19 hours ago

        Which lightweight models do these compare unfavorably with?