Comment by jjice

Comment by jjice a day ago

View on Hacker News

I think the software they were referring to is CUDA and the developer experience around the nvidia stack.

moralestapia a day ago

???

Know any LLMs that are implemented in CUDA?

Reply View 6 replies

wmf a day ago

Ultimately all of them except Gemini.

Reply View | 4 replies
- moralestapia a day ago
  
  Wrong.
  Show me one single CUDA kernel on Llama's source code.
  (and that's a really easy one, if one knows a bit about it)
  
  Reply View | 3 replies
  
  rnrn a day ago
  
  removing comment since I regret attempting to engage in this thread
  
  Reply View | 2 replies
imtringued 14 hours ago

The average consumer uses llama.cpp. So here is your list of kernels: https://github.com/ggml-org/llama.cpp/tree/master/ggml/src/g...
And here is pretty damning evidence that you're full of shit: https://github.com/ggml-org/llama.cpp/blob/master/ggml/src/g...
The ggml-hip backend references the ggml-cuda kernels. The "software is the same" (as in, it is CUDA) and yet AMD is still behind.

Reply View | 0 replies