41 Commits

Author SHA1 Message Date
Mishig
5df4c7c0d7 [docs] Build docs only when doc files change (#812)
Build docs only when change happens in `docs/source`

See for example
https://github.com/huggingface/api-inference/blob/main/.github/workflows/build_documentation.yml#L3-L8
2023-08-11 07:07:53 +02:00
Merve Noyan
647ae7a7d3 Setup for doc-builder and docs for TGI (#740)
I added ToC for docs v1 & started setting up for doc-builder. cc @Narsil
@osanseviero

---------

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: osanseviero <osanseviero@gmail.com>
Co-authored-by: Mishig <mishig.davaadorj@coloradocollege.edu>
2023-08-10 10:24:52 +02:00
Nicolas Patry
92bb56b0c1 Local gptq support. (#738)
# What does this PR do?

Redoes #719

<!--
Congratulations! You've made it this far! You're not quite done yet
though.

Once merged, your PR is going to appear in the release notes with the
title you set, so make sure it's a great title that fully reflects the
extent of your awesome contribution.

Then, please replace this with a description of the change and which
issue is fixed (if applicable). Please also include relevant motivation
and context. List any dependencies (if any) that are required for this
change.

Once you're done, someone will review your PR shortly (see the section
"Who can review?" below to tag some potential reviewers). They may
suggest changes to make the code even better. If no one reviewed your PR
after a week has passed, don't hesitate to post a new comment
@-mentioning the same persons---sometimes notifications get lost.
-->

<!-- Remove if not applicable -->

Fixes # (issue)


## Before submitting
- [ ] This PR fixes a typo or improves the docs (you can dismiss the
other checks if that's the case).
- [ ] Did you read the [contributor
guideline](https://github.com/huggingface/transformers/blob/main/CONTRIBUTING.md#start-contributing-pull-requests),
      Pull Request section?
- [ ] Was this discussed/approved via a Github issue or the
[forum](https://discuss.huggingface.co/)? Please add a link
      to it if that's the case.
- [ ] Did you make sure to update the documentation with your changes?
Here are the
[documentation
guidelines](https://github.com/huggingface/transformers/tree/main/docs),
and
[here are tips on formatting
docstrings](https://github.com/huggingface/transformers/tree/main/docs#writing-source-documentation).
- [ ] Did you write any new necessary tests?


## Who can review?

Anyone in the community is free to review the PR once the tests have
passed. Feel free to tag
members/contributors who may be interested in your PR.

<!-- Your PR will be replied to more quickly if you can figure out the
right person to tag with @


@OlivierDehaene OR @Narsil

 -->
2023-07-31 10:32:52 +02:00
Nicolas Patry
f063ebde10 chore: migrate ci region for more availability. (#581) 2023-07-12 10:01:01 +02:00
OlivierDehaene
e3e487dc71 feat(server): support trust_remote_code (#363) 2023-05-23 20:40:39 +02:00
OlivierDehaene
5f67923cac feat: add nightly load testing (#358) 2023-05-23 17:42:19 +02:00
oOraph
0a6494785c fix(ci): fix security group (#359)
# What does this PR do?
Switch security group used for ci
(open outbound rules)

Signed-off-by: Raphael <oOraph@users.noreply.github.com>
Co-authored-by: Raphael <oOraph@users.noreply.github.com>
2023-05-23 16:49:11 +02:00
OlivierDehaene
5a58226130 fix(server): fix decode token (#334)
Fixes #333

---------

Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
2023-05-16 23:23:27 +02:00
OlivierDehaene
dbdc587ddd feat(integration-tests): improve comparison and health checks (#336) 2023-05-16 20:22:11 +02:00
OlivierDehaene
e71471bec9 feat: add snapshot testing (#282) 2023-05-15 23:36:30 +02:00
OlivierDehaene
66b277321d feat(ci): custom gpu runners (#328) 2023-05-15 15:53:08 +02:00
Nicolas Patry
411b0d4e1f chore(github): add templates (#264) 2023-05-02 15:43:19 +02:00
Ehsan M. Kermani
f092ba9b22 feat(server): add watermarking tests (#248) 2023-04-27 19:16:35 +02:00
Nicolas Patry
db2b4e0754 feat(router): new healthcheck that skips the queue (#244)
Co-authored-by: OlivierDehaene <23298448+OlivierDehaene@users.noreply.github.com>
Co-authored-by: OlivierDehaene <olivier@huggingface.co>
2023-04-26 20:23:54 +02:00
Nicolas Patry
c4fb09f2ae feat(router): add tests to validation (#237) 2023-04-26 16:14:40 +02:00
OlivierDehaene
274513e6a3 fix(ci): fix sha in docker image (#212) 2023-04-20 18:50:47 +02:00
OlivierDehaene
709d8936f6 feat(router): drop requests when client closes the channel (#202) 2023-04-20 11:07:40 +02:00
OlivierDehaene
b6ee0ec7b0 feat(router): add git sha to info route (#208) 2023-04-19 21:36:59 +02:00
OlivierDehaene
7a1ba58557 fix(docker): fix docker image dependencies (#187) 2023-04-17 00:26:47 +02:00
OlivierDehaene
1bb394631d fix(docker): fix docker image (#184) 2023-04-14 17:31:13 +02:00
OlivierDehaene
01c0e368e5 fix(ci): fix cosign error (#183) 2023-04-14 12:35:26 +02:00
OlivierDehaene
53ee09c0b0 fea(dockerfile): better layer caching (#159) 2023-04-14 10:12:21 +02:00
OlivierDehaene
12e5633c4d fix(ci): fix ci permissions (#181) 2023-04-13 16:32:37 +02:00
OlivierDehaene
c1e2ea3b78 feat(ci): faster scanning (#180) 2023-04-13 16:23:47 +02:00
OlivierDehaene
13f1cd024b feat(ci): use large runners (#179) 2023-04-13 16:11:48 +02:00
OlivierDehaene
9683c37bd3 feat(ci): add Trivy and scan docker image (#178) 2023-04-13 15:43:17 +02:00
OlivierDehaene
643a39d556 feat(ci): add image signing with cosign (#175) 2023-04-13 15:26:34 +02:00
OlivierDehaene
64347b05ff fix(ci): fix CVE in github-slug-action (#174) 2023-04-13 12:43:05 +02:00
OlivierDehaene
55106ec476 fix(ci): fix sagemaker action (#148) 2023-03-29 22:27:01 +02:00
OlivierDehaene
d503e8f09d feat: aws sagemaker compatible image (#147)
The only difference is that now it pushes to
registry.internal.huggingface.tech/api-inference/community/text-generation-inference/sagemaker:...
instead of
registry.internal.huggingface.tech/api-inference/community/text-generation-inference:sagemaker-...

---------

Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com>
2023-03-29 21:38:30 +02:00
OlivierDehaene
05e9a796cc feat(server): flash neoX (#133) 2023-03-24 14:02:14 +01:00
OlivierDehaene
23e1028822 feat(python-client): add CI (#136) 2023-03-23 18:13:04 +01:00
OlivierDehaene
603e20b5f7 feat(ci): add ci paths (#134) 2023-03-23 18:01:30 +01:00
OlivierDehaene
0b6807caa4 feat(server): fix transformers commit (#96) 2023-03-03 17:56:27 +01:00
OlivierDehaene
e3ded361b2 feat(ci): improve CI speed (#94) 2023-03-03 15:07:27 +01:00
OlivierDehaene
9af454142a feat: add distributed tracing (#62) 2023-02-13 13:02:45 +01:00
OlivierDehaene
e114d87486 feat(ci): push to AML registry (#56) 2023-02-06 14:33:56 +01:00
OlivierDehaene
20c3c5940c feat(router): refactor API and add openAPI schemas (#53) 2023-02-03 12:43:37 +01:00
OlivierDehaene
404ed7a1f6 feat(ci): Docker build and push (#46) 2023-01-31 20:14:05 +01:00
OlivierDehaene
3e2e6240b8 feat(launcher): Add integration tests (#9) 2022-12-16 11:29:36 +01:00
OlivierDehaene
32a253063d feat: Return logprobs (#8) 2022-12-15 17:03:56 +01:00