This website requires JavaScript.
Explore
Help
Register
Sign In
alihan
/
llm-text-generation-inference
Watch
1
Star
0
Fork
0
You've already forked llm-text-generation-inference
mirror of
https://github.com/huggingface/text-generation-inference.git
synced
2023-08-15 01:09:35 +03:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
a072660bf51022f9e1d59b64efc5954a0e1eee45
llm-text-generation-inference
/
integration-tests
History
OlivierDehaene
73a4d65d26
feat: add cuda memory fraction (
#659
)
...
Close
#673
2023-07-24 11:43:58 +02:00
..
models
feat: add cuda memory fraction (
#659
)
2023-07-24 11:43:58 +02:00
conftest.py
feat(server): Add exllama GPTQ CUDA kernel support
#553
(
#666
)
2023-07-21 10:59:00 +02:00
pytest.ini
feat(server): Rework model loading (
#344
)
2023-06-08 14:51:52 +02:00
requirements.txt
feat(server): only compute prefill logprobs when asked (
#406
)
2023-06-02 17:12:30 +02:00