Comment by CamperBob2

Comment by CamperBob2 2 days ago

That's pretty good. Are you running the real 600B+ parameter R1, or a distill, though?

The full thing, 671B. It loses some intelligence at 1.5 bit quantisation, but it's acceptable. I could actually go for around 3 bits if I max out my RAM, but I haven't done that yet.

Reply View 1 reply

apitman 18 hours ago

I've seen people say the models get more erratic at higher (lower?) quantization levels. What's your experience been?

Reply View | 0 replies

[removed] 2 days ago

[deleted]

Reply View 0 replies