Comment by 9cb14c1ec0

Comment by 9cb14c1ec0 3 days ago

2 replies

I don't think so. There are other knobs they can tweak to reduce load that affect quality less than quantizing. Like trimming the conversation length without telling you, reducing reasoning effort, etc.

mgraczyk 3 days ago

We never do anything that reduce model intelligence like that

  • siva7 a day ago

    You said "like that", ok but there may be some truth to reduced model intelligence. Also how AWS deployed Anthropic models for Amazons Kiro feel much dumber than those controlled entirely by Anthropic. Can't be just me