Comment by Alifatisk Comment by Alifatisk 2 days ago 3 replies Copy Link View on Hacker News You should mention that it is 4bit quant. Still very impressive!
Copy Link geerlingguy 2 days ago Next Collapse Comment - Kiki K2 was made to be optimized at 4-bit, though. Reply View | 1 reply Copy Link natrys 2 days ago Parent Collapse Comment - That's the Kimi K2 Thinking, this post seems to be talking about original Kimi K2 Instruct though, I don't think INT4 QAT (quantization aware training) version was released for this. Reply View | 0 replies
Copy Link natrys 2 days ago Parent Collapse Comment - That's the Kimi K2 Thinking, this post seems to be talking about original Kimi K2 Instruct though, I don't think INT4 QAT (quantization aware training) version was released for this. Reply View | 0 replies
Copy Link elif 2 days ago Prev Collapse Comment - I think when you say trillion parameters, it's implied that it's quantized Reply View | 0 replies
Kiki K2 was made to be optimized at 4-bit, though.