Comment by danielhanchen
Comment by danielhanchen 2 hours ago
Super cool! Also with `--fit on` you don't need `--ctx-size 32768` technically anymore - llama-server will auto determine the max context size!
Comment by danielhanchen 2 hours ago
Super cool! Also with `--fit on` you don't need `--ctx-size 32768` technically anymore - llama-server will auto determine the max context size!