Comment by jasonjmcghee
Comment by jasonjmcghee 21 hours ago
I read it similarly - that this is a specific attribute of bfloat16, so the quants folks tend to run on local hardware don't have the same inefficiency to exploit
Comment by jasonjmcghee 21 hours ago
I read it similarly - that this is a specific attribute of bfloat16, so the quants folks tend to run on local hardware don't have the same inefficiency to exploit