Add Dockerfile + build workflow

2023-09-07 17:34:22 +03:00 · 2023-04-12 11:53:39 +02:00
parent 9f1e565594
commit c14201dc0f
3 changed files with 48 additions and 1 deletions
--- a/README.md
+++ b/README.md
@@ -66,6 +66,14 @@ python3 -m llama_cpp.server

 Navigate to [http://localhost:8000/docs](http://localhost:8000/docs) to see the OpenAPI documentation.

+## Docker image
+
+A Docker image is available on [GHCR](https://ghcr.io/abetlen/llama-cpp-python). To run the server:
+
+```bash
+docker run --rm -it -p8000:8000 -v /path/to/models:/models -eMODEL=/models/ggml-vicuna-7b-4bit.bin ghcr.io/abetlen/llama-cpp-python:latest
+```
+
 ## Low-level API

 The low-level API is a direct `ctypes` binding to the C API provided by `llama.cpp`.