Alex Cheema
|
719e149aeb
|
test trigger workflow
|
2024-07-20 12:45:16 -07:00 |
|
Alex Cheema
|
9d939b3703
|
disable tinygrad test again, we need a smaller model or a machine with more memory otherwise we get Metal OOM
|
2024-07-20 12:42:56 -07:00 |
|
Alex Cheema
|
774e620973
|
add space between outputs in github workflow integration test
|
2024-07-20 12:40:51 -07:00 |
|
Alex Cheema
|
a2a7ca1f8b
|
cleaner node info =
|
2024-07-20 12:33:38 -07:00 |
|
Alex Cheema
|
04f2aa2a65
|
try with METAL_XCODE=1 for tinygrad metal
|
2024-07-20 12:16:54 -07:00 |
|
Alex Cheema
|
d2ed4c2a16
|
disable tinygrad infernece engine test waiting Waiting on https://github.com/tinygrad/tinygrad/issues/5549
|
2024-07-20 12:13:14 -07:00 |
|
Alex Cheema
|
115aab0d6c
|
cache tinygrad models in github workflow
|
2024-07-20 12:12:11 -07:00 |
|
Alex Cheema
|
a4cc667754
|
async model downloading fixes #30
|
2024-07-20 12:08:00 -07:00 |
|
Alex Cheema
|
e49924e1b9
|
add chatgpt-api-response-timeout-secs flag, set this to 20 mins in test
|
2024-07-20 00:27:14 -07:00 |
|
Alex Cheema
|
7dd7ccab91
|
do one request to load the model then another to check the response
|
2024-07-20 00:17:52 -07:00 |
|
Alex Cheema
|
144af10686
|
separate discovery and chatgpt api integration test
|
2024-07-20 00:13:19 -07:00 |
|
Alex Cheema
|
93df43d07f
|
redundant sh
|
2024-07-20 00:11:05 -07:00 |
|
Alex Cheema
|
bf7aa51ba1
|
rename to discovery integration test as thats all it checks
|
2024-07-20 00:09:34 -07:00 |
|
Alex Cheema
|
b9a2c0f730
|
fix tests
|
2024-07-20 00:08:45 -07:00 |
|
Alex Cheema
|
d9516d2eb4
|
insstall in workflow
|
2024-07-20 00:00:00 -07:00 |
|
Alex Cheema
|
8efd656303
|
set different api ports so they dont conlict
|
2024-07-19 23:51:48 -07:00 |
|
Alex Cheema
|
8dd17fe00b
|
integration test with discovery
|
2024-07-19 23:51:15 -07:00 |
|
Alex Cheema
|
4d962ffcaa
|
fix hardcoded path in debug_inference_engine
|
2024-07-19 23:44:42 -07:00 |
|
Alex Cheema
|
30ab126c08
|
fix test_inference_engine
|
2024-07-19 23:44:19 -07:00 |
|
Alex Cheema
|
56e5e34ec9
|
fix invalid escape sequence exo_text
|
2024-07-19 23:41:06 -07:00 |
|
Alex Cheema
|
62a2407352
|
github workflow: use python3 consistently
|
2024-07-19 23:38:25 -07:00 |
|
Alex Cheema
|
ba1916a388
|
github workflow for tests
|
2024-07-19 23:37:15 -07:00 |
|
Alex Cheema
|
10a043773f
|
check for the last file that downloads in case it fails part way through
|
2024-07-19 22:16:10 -07:00 |
|
Alex Cheema
|
1475c735c9
|
fix inference_state serialization. related: #40 #44 #45
|
2024-07-19 21:39:57 -07:00 |
|
Alex Cheema
|
e18549e9e1
|
rm print
|
2024-07-19 16:52:13 -07:00 |
|
Alex Cheema
|
0c5a927f9a
|
spacing in viz
|
2024-07-19 16:43:38 -07:00 |
|
Alex Cheema
|
9fa0cb1ab1
|
add gpu poor/rich bar in panel. fixes #33
|
2024-07-19 16:35:48 -07:00 |
|
Alex Cheema
|
5b8f127bf4
|
fix opaque broadcast
|
2024-07-19 11:37:02 -07:00 |
|
Alex Cheema
|
a342e1abd8
|
add web url and chatgpt api endpoint to panel (fixes #43), fix a rounding error in the partition to shard mapping implementation
|
2024-07-19 11:30:13 -07:00 |
|
Alex Cheema
|
8939f888eb
|
remove spammy log
|
2024-07-19 10:36:25 -07:00 |
|
Alex Cheema
|
d9484906a3
|
remove the spammy logs
|
2024-07-18 23:22:09 -07:00 |
|
Alex Cheema
|
dd09c59719
|
fix issues with chatgpt api where it would generate too long output. avoid nonlocal
|
2024-07-18 23:16:59 -07:00 |
|
Alex Cheema
|
4b592f9d45
|
exo topology visualisation that shows the topology of the network, device capabilities and the currently active node using opaque statuses. fixes #36. ready for #33
|
2024-07-18 22:50:18 -07:00 |
|
Alex Cheema
|
35177690bd
|
by default find an ephemeral node port fixes #35, more robust topology updates. both fix #15 and #14
|
2024-07-18 19:59:57 -07:00 |
|
Alex Cheema
|
54c98607ef
|
more robust grpc discovery with asyncio and proper error handling, add flops to device capabilities. fixes #23 and progress on #33
|
2024-07-18 19:06:23 -07:00 |
|
Alex Cheema
|
fa9d416955
|
rm unused imports
|
2024-07-18 16:20:01 -07:00 |
|
Alex Cheema
|
0af164f076
|
remove old PartitioningStrategy
|
2024-07-18 16:19:20 -07:00 |
|
Alex Cheema
|
1b194b43f6
|
reference the code for each feature listed in README
|
2024-07-18 16:18:01 -07:00 |
|
Alex Cheema
|
945f90f676
|
allow overriding inference_engine and separate flag for TINYGRAD_DEBUG
|
2024-07-18 16:09:37 -07:00 |
|
Alex Cheema
|
47163d22db
|
broadcast results concurrently fixes #31
|
2024-07-18 16:09:37 -07:00 |
|
Alex Cheema
|
621a5f5d7b
|
Add license badge
|
2024-07-18 16:00:34 -07:00 |
|
Alex Cheema
|
46d618abed
|
tiny fixes
|
2024-07-18 04:33:44 -07:00 |
|
Alex Cheema
|
d4f55002ea
|
sort topology by memory descending (works well for now to workaround #12
|
2024-07-18 03:44:35 -07:00 |
|
Alex Cheema
|
071b1caa0b
|
drop exo to 0.0.1 (still experimental)
|
2024-07-18 01:17:19 -07:00 |
|
Alex Cheema
|
e7dcdac24c
|
fix exo text
|
2024-07-18 01:06:18 -07:00 |
|
Alex Cheema
|
17ecf26677
|
typo bullet point
|
2024-07-17 23:41:49 -07:00 |
|
Alex Cheema
|
9958ac3917
|
Make known issues more prominent
|
2024-07-17 23:41:24 -07:00 |
|
Alex Cheema
|
72fe293729
|
exo text on start and stop
|
2024-07-17 21:59:05 -07:00 |
|
Alex Cheema
|
fbbb45c37e
|
install script
|
2024-07-17 21:55:37 -07:00 |
|
Alex Cheema
|
3778301b47
|
add alternative installation through install.sh
|
2024-07-17 21:13:59 -07:00 |
|