Alex Cheema
|
4f4696e09d
|
remove calls to updateTotalTokens in tiny, not sure why its there
|
2024-07-17 20:57:58 -07:00 |
|
Alex Cheema
|
d4e0a7d14b
|
add endpoint to get number of encoded tokens
|
2024-07-17 20:57:04 -07:00 |
|
Alex Cheema
|
127b8e011c
|
dont explicitly specify show_index
|
2024-07-17 20:32:33 -07:00 |
|
Alex Cheema
|
a94bdbb9bd
|
serve tinychat static
|
2024-07-17 20:30:44 -07:00 |
|
Alex Cheema
|
e82dab1d4c
|
remove tinygrad hidden files
|
2024-07-17 20:23:02 -07:00 |
|
Alex Cheema
|
c2fcee4387
|
only retail examples/tinychat from tinygrad subtree
|
2024-07-17 20:14:46 -07:00 |
|
Alex Cheema
|
e1cb840b31
|
Merge commit '0870e6bfb0d46d58e29c6d5822e8bf629d03bdcc' as 'tinychat'
|
2024-07-17 20:12:20 -07:00 |
|
Alex Cheema
|
0870e6bfb0
|
Squashed 'tinychat/' content from commit fa7e734b4
git-subtree-dir: tinychat
git-subtree-split: fa7e734b49529f3b436d329279347b541fcf6435
|
2024-07-17 20:12:20 -07:00 |
|
Alex Cheema
|
d8c40bb4f1
|
print a warning if stream task ever times out
|
2024-07-17 19:09:11 -07:00 |
|
Alex Cheema
|
1e1e11cdc6
|
check if inference_engine has tokenizer before printing with it
|
2024-07-17 19:05:39 -07:00 |
|
Alex Cheema
|
8df2f4d857
|
support for streaming from non-tail nodes on chatgpt api, addresses #20
|
2024-07-17 19:05:02 -07:00 |
|
Alex Cheema
|
8762effaf4
|
chatgpt api repsonse streaming solves #20
|
2024-07-17 19:00:18 -07:00 |
|
Alex Cheema
|
5de2ea51f5
|
default to llama-3-8b and temperature=0 if not provided
|
2024-07-17 17:12:14 -07:00 |
|
Alex Cheema
|
5c3f0e3ac6
|
faster initial node discovery
|
2024-07-17 17:11:50 -07:00 |
|
Alex Cheema
|
8a35fd83f6
|
support chatgpt api endpoint fron any node #24
|
2024-07-17 15:37:47 -07:00 |
|
Alex Cheema
|
ba7abb9896
|
fix ring topology img
|
2024-07-17 15:14:01 -07:00 |
|
Alex Cheema
|
c432871ef5
|
replace the ring topology image as it was not rendering sometimes
|
2024-07-17 15:11:09 -07:00 |
|
Alex Cheema
|
bcab97cb41
|
instructions for how to force a node to be the tail
|
2024-07-17 15:00:23 -07:00 |
|
Alex Cheema
|
eb92da2c3e
|
cleaner chatgpt api impl with async callbacks
|
2024-07-17 14:50:16 -07:00 |
|
Alex Cheema
|
7c97ef521d
|
DEBUG should be imported from exo
|
2024-07-17 14:24:55 -07:00 |
|
Alex Cheema
|
5055e3788d
|
separate prerequisities seciton / troubleshooting section in installation of readme
|
2024-07-17 11:54:09 -07:00 |
|
Alex Cheema
|
998d484384
|
match psutil platform detection might catch some edge cases
|
2024-07-17 11:48:52 -07:00 |
|
Alex Cheema
|
99d40b1df0
|
download tinygrad model log
|
2024-07-17 11:47:47 -07:00 |
|
Alex Cheema
|
35335706d6
|
Merge pull request #21 from the-alex-b/setup
Added setup.py so exo is installable
|
2024-07-17 11:14:31 -07:00 |
|
Alex
|
442c7d8c3e
|
update readme
|
2024-07-17 13:22:29 +02:00 |
|
Alex
|
879b969028
|
Removed requirements.txt
|
2024-07-17 13:22:12 +02:00 |
|
Alex
|
4ea7db854a
|
more complete python gitignore template
|
2024-07-17 13:22:04 +02:00 |
|
Alex
|
bfaeccc7d5
|
added setup py
|
2024-07-17 13:21:46 +02:00 |
|
Alex Cheema
|
12fcbc0d4c
|
switch over to psutil, more robust system detection
|
2024-07-17 00:53:46 -07:00 |
|
Alex Cheema
|
43fa45990e
|
Merge pull request #18 from andvarfolomeev/fix/requirements
fix: syntax error in requirements.txt
|
2024-07-16 23:21:05 -07:00 |
|
Andrey Varfolomeev
|
7545e0605b
|
fix: syntax error in requirements.txt
|
2024-07-17 16:12:38 +10:00 |
|
Alex Cheema
|
6b3727f023
|
fetch model if doesnt exist on tinygrad
|
2024-07-16 23:09:42 -07:00 |
|
Alex Cheema
|
b1f3204e80
|
add Jinja requirement for linux
|
2024-07-16 22:56:40 -07:00 |
|
Alex Cheema
|
365114ec57
|
add stars to readme
|
2024-07-16 22:46:15 -07:00 |
|
Alex Cheema
|
71e00745cc
|
fix tokenizer inconsistencies
|
2024-07-16 22:46:15 -07:00 |
|
Alex Cheema
|
c673b4c37b
|
clarify readme
|
2024-07-16 21:21:49 -07:00 |
|
Alex Cheema
|
7cb1ba55a4
|
Clarify readme iOS
|
2024-07-16 21:19:09 -07:00 |
|
Alex Cheema
|
c819f675cb
|
fix linux/amd gpu memory, convert to MB
|
2024-07-16 18:38:57 -07:00 |
|
Alex Cheema
|
e93a319ca4
|
typo
|
2024-07-16 18:34:13 -07:00 |
|
Alex Cheema
|
ce46f00059
|
linux device capabilities
|
2024-07-16 18:33:29 -07:00 |
|
Alex Cheema
|
dbbc7be57f
|
remove hard dependency on MLX fixes #8
|
2024-07-16 18:07:45 -07:00 |
|
Alex Cheema
|
5e8bfc8a90
|
remove deprecated main_static
|
2024-07-16 17:57:01 -07:00 |
|
Alex Cheema
|
e049f701ce
|
debugging instructions in README
|
2024-07-16 16:59:49 -07:00 |
|
Alex Cheema
|
bde1e53f5f
|
add license
|
2024-07-16 14:06:36 -07:00 |
|
Alex Cheema
|
dd8d18128c
|
add an opaque inference_state that inference engines can use to pass around small state to other devices
|
2024-07-16 13:05:20 -07:00 |
|
Alex Cheema
|
03ba31c020
|
wip state
|
2024-07-16 13:05:20 -07:00 |
|
Alex Cheema
|
ed7672e32c
|
Update README.md
|
2024-07-16 08:39:57 -07:00 |
|
Alex Cheema
|
9354081b21
|
Merge pull request #9 from mattroyer/patch-1
Fix broken links
|
2024-07-16 07:59:31 -07:00 |
|
Matt Royer
|
9324fc15d9
|
Fix broken links
|
2024-07-16 07:47:59 -06:00 |
|
Alex Cheema
|
b897fa4420
|
Typo ring memory weighted partitioning strategy
|
2024-07-16 02:42:02 -07:00 |
|