Comment by Aurornis
> A note, your install script appears to still have a placeholder at the "apply patch" step. A suggestion, might be more user-friendly to fork llama.cpp and then include that as a git submodule rather than make it a "git clone and apply patch" step.
The patch doesn't actually apply to llama.cpp because argument parsing was moved to arg.cpp 8 months ago.
That doesn't matter, though, because the options to set K and V quantization were added to llama.cpp in 2023.
I don't understand why the patch exists at all, other than as an attempt to make this look novel by changing the settings through a different command line argument?
I would strongly recommend that nobody run an install.sh file from a new repo like this, especially when it's not necessary for something as simple as applying a patch file.