clearer example docs and add 405b example

This commit is contained in:
Alex Cheema
2024-10-01 01:37:06 +04:00
parent abca3bfa37
commit 67f789b6ac

View File

@@ -120,7 +120,9 @@ That's it! No configuration required - exo will automatically discover the other
exo starts a ChatGPT-like WebUI (powered by [tinygrad tinychat](https://github.com/tinygrad/tinygrad/tree/master/examples/tinychat)) on http://localhost:8000
For developers, exo also starts a ChatGPT-compatible API endpoint on http://localhost:8000/v1/chat/completions. Example with curls:
For developers, exo also starts a ChatGPT-compatible API endpoint on http://localhost:8000/v1/chat/completions. Examples with curl:
#### Llama 3.1 8B:
```sh
curl http://localhost:8000/v1/chat/completions \
@@ -132,6 +134,20 @@ curl http://localhost:8000/v1/chat/completions \
}'
```
#### Llama 3.1 405B:
```sh
curl http://localhost:8000/v1/chat/completions \
-H "Content-Type: application/json" \
-d '{
"model": "llama-3.1-405b",
"messages": [{"role": "user", "content": "What is the meaning of exo?"}],
"temperature": 0.7
}'
```
#### Llava 1.5 7B (Vision Language Model):
```sh
curl http://localhost:8000/v1/chat/completions \
-H "Content-Type: application/json" \