Go to file

Jeffrey Morgan 6292f4b64c update `Dockerfile`		2023-07-06 16:34:44 -04:00
api	add llama.cpp go bindings	2023-07-06 16:34:44 -04:00
app	auto updater for macos	2023-07-06 00:04:06 -04:00
cmd	move prompt templates out of python bindings	2023-07-06 16:34:44 -04:00
docs	Move python docs to separate file	2023-07-01 17:54:29 -04:00
llama	fix llama.cpp build	2023-07-06 16:34:44 -04:00
python	move prompt templates out of python bindings	2023-07-06 16:34:44 -04:00
server	move prompt templates out of python bindings	2023-07-06 16:34:44 -04:00
signature	wip go engine	2023-07-06 16:34:44 -04:00
templates	move prompt templates out of python bindings	2023-07-06 16:34:44 -04:00
web	fix auto update route	2023-07-06 16:18:40 -04:00
.dockerignore	update `Dockerfile`	2023-07-06 16:34:44 -04:00
.gitignore	add templates to prompt command	2023-06-26 13:41:16 -04:00
.prettierrc.json	move .prettierrc.json to root	2023-07-02 17:34:46 -04:00
Dockerfile	update `Dockerfile`	2023-07-06 16:34:44 -04:00
go.mod	add llama.cpp go bindings	2023-07-06 16:34:44 -04:00
go.sum	add llama.cpp go bindings	2023-07-06 16:34:44 -04:00
LICENSE	`proto` -> `ollama`	2023-06-26 15:57:13 -04:00
main.go	add llama.cpp go bindings	2023-07-06 16:34:44 -04:00
models.json	format `models.json`	2023-07-02 20:33:23 -04:00
README.md	add llama.cpp go bindings	2023-07-06 16:34:44 -04:00

Ollama

An easy, fast runtime for large language models, powered by llama.cpp.

Note: this project is a work in progress. Certain models that can be run with ollama are intended for research and/or non-commercial use only.

Install

Using pip:

pip install ollama

Using docker:

docker run ollama/ollama

To run a model, use ollama run:

ollama run orca-mini-3b

You can also run models from hugging face:

ollama run huggingface.co/TheBloke/orca_mini_3B-GGML

Or directly via downloaded model files:

ollama run ~/Downloads/orca-mini-13b.ggmlv3.q4_0.bin

go generate ./...
go build .