Comment by geerlingguy Comment by geerlingguy 2 days ago 1 reply Copy Link View on Hacker News Kiki K2 was made to be optimized at 4-bit, though.
Copy Link natrys 2 days ago Collapse Comment - That's the Kimi K2 Thinking, this post seems to be talking about original Kimi K2 Instruct though, I don't think INT4 QAT (quantization aware training) version was released for this. Reply View | 0 replies
That's the Kimi K2 Thinking, this post seems to be talking about original Kimi K2 Instruct though, I don't think INT4 QAT (quantization aware training) version was released for this.