Comment by michaelt

Comment by michaelt 17 hours ago

1 reply

Certainly, nobody would buy an Apple hoping to run triple-A PC games.

But among people running LLMs outside of the data centre, Apple's unified memory together with a good-enough GPU has attracted quite a bit of attention. If you've got the cash, you can get a Mac Studio with 512GB of unified memory. So there's one workload where apple silicon gives nvidia a run for their money.

gehsty 11 hours ago

Only in the size of model it can run, not speed of token generation.