Comment by mikae1
Comment by mikae1 2 days ago
Or perhaps a 512GB Mac Studio. 671B Q4 of R1 runs on it.
Comment by mikae1 2 days ago
Or perhaps a 512GB Mac Studio. 671B Q4 of R1 runs on it.
> I run it all the time, token generation is pretty good.
I feel like because you didn't actually talk about prompt processing speed or token/s, you aren't really giving the whole picture here. What is the prompt processing tok/s and the generation tok/s actually like?
I wouldn’t say runs. More of a gentle stroll.