Comment by wkat4242
How so? It's rock solid for me. I use ollama but it's based on llama.cpp
It's quite fast also, probably because that card has fast HBM2 memory (it has the same memory bandwidth as a 4090). And it was really cheap as it was on deep sale as an outgoing model.
"Sometimes" as in "on some cards". You're having luck with yours, but that doesn't mean it's a good place to build a community.