omitting `--n-gpu-layers` means use metal on macos which isn't correct since ollama uses `num_gpu=0` to explicitly disable gpu for file types that are not implemented in metal |
||
|---|---|---|
| .. | ||
| llama.cpp | ||
| falcon.go | ||
| ggml.go | ||
| gguf.go | ||
| llama.go | ||
| llm.go | ||
| starcoder.go | ||
| utils.go | ||