1562 Commits

Author SHA1 Message Date
Alex Cheema
56f783b38d Update. 2025-10-21 17:29:48 +01:00
Alex Cheema
e4238f9ef3 Merge pull request #800 from exo-explore/grpcio1.71.0
downgrade grpcio, grpcio-tools to 1.70.0
2025-03-21 15:23:32 -07:00
Alex Cheema
ad3bc6ceaa downgrade grpcio, grpcio-tools to 1.70.0 2025-03-21 15:23:11 -07:00
Alex Cheema
04d5dca18f Merge pull request #778 from exo-explore/grpcio1.71.0
upgrade grpcio and grpcio-tools to 1.71.0
2025-03-12 06:24:57 +00:00
Alex Cheema
50b6800a61 m3 ultra flops estimates based on some quick profiling 2025-03-11 22:51:23 -07:00
Alex Cheema
2857975bf3 upgrade grpcio and grpcio-tools to 1.71.0 2025-03-11 17:23:37 -07:00
Alex Cheema
854f515cf5 Merge pull request #763 from deftdawg/amdgpu
AMD/ROCm: Changes required to detect and inference on AMD GPUs
2025-03-06 16:07:05 +00:00
DeftDawg
f98d9bac53 Changes required to detect AMD GPUs 2025-03-05 22:49:29 -05:00
Alex Cheema
017bf93cf5 Merge pull request #753 from mags0ft/patch-1
remove dead links in README
2025-03-03 23:01:34 +00:00
mags0ft
013d2573e7 remove dead links in README 2025-03-02 18:37:59 +01:00
Alex Cheema
2702975762 Merge pull request #746 from exo-explore/grpcio1.70.0
downgrade grpc to 1.67.0. waiting for fix
2025-02-28 21:26:11 +00:00
Alex Cheema
30c3f58a00 downgrade grpc to 1.67.0. waiting for fix bd8f8a86e0 2025-02-28 21:25:11 +00:00
Alex Cheema
1bbbb1e1d8 Merge pull request #745 from exo-explore/grpcio1.70.0
Grpcio1.70.0
2025-02-28 21:05:41 +00:00
Alex Cheema
4081305e60 adjust grpc settings, ensure connected before sending any grpc commands 2025-02-28 20:52:12 +00:00
Alex Cheema
52a21645c6 Merge pull request #742 from samiamjidkhan/main
build fix
2025-02-28 12:29:58 +00:00
Sami Khan
63570c7b8b Merge pull request #1 from samiamjidkhan/build-fix
build fix
2025-02-28 15:47:36 +05:00
Sami Khan
971f5240bf build fix 2025-02-28 15:45:57 +05:00
Alex Cheema
36a6389af0 bump grpcio and grpcio-tools to 1.70.0 2025-02-27 01:40:04 +00:00
Alex Cheema
af734f1bf6 Merge pull request #737 from exo-explore/handlegzipdownload
handle -gzip suffix in etag for integrity check fixes #633
v0.0.15-alpha
2025-02-25 22:10:05 +00:00
Alex Cheema
ee095766d9 handle -gzip suffix in etag for integrity check fixes #633 2025-02-25 22:08:15 +00:00
Alex Cheema
a605e233ad Merge pull request #709 from exo-explore/notice
update notice in README
2025-02-18 11:43:14 +00:00
Alex Cheema
f9a1e5342b update notice in README 2025-02-18 11:41:09 +00:00
Alex Cheema
7a374a74cd Merge pull request #708 from exo-explore/notice
add notice to README
2025-02-17 22:55:44 +00:00
Alex Cheema
5a00899d73 Merge pull request #705 from cadenmackenzie/addingModelNameInputContainer
adding current model name to input container information
2025-02-17 22:55:29 +00:00
Alex Cheema
cb4bee2694 add notice to README 2025-02-17 22:54:56 +00:00
Caden MacKenzie
9078d094b9 adding current model name to input container information 2025-02-16 18:34:38 -08:00
Alex Cheema
ed70d47cfd Merge pull request #702 from exo-explore/alwayslogdownloaderror
make max_parallel_downloads configurable, increase download chunk size to 8MB
2025-02-14 21:27:12 +00:00
Alex Cheema
477e3a5e4c make max_parallel_downloads configurable, increase download chunk size to 8MB 2025-02-14 21:26:41 +00:00
Alex Cheema
be3b9ee973 Merge pull request #698 from exo-explore/alwayslogdownloaderror
always log download errors. some people e.g cant access huggingface
2025-02-13 22:56:33 +00:00
Alex Cheema
b4e6f8acad always log download errors. some people eg cant access huggingface which causes confusion 2025-02-13 22:55:09 +00:00
Alex Cheema
de99da7c75 Merge pull request #684 from divinity76/patch-1
workaround f16 cast ambiguity
2025-02-08 12:45:10 +00:00
Alex Cheema
76d1bd95f5 Merge pull request #688 from exo-explore/readmeupdate
apt-get debian noninteractive in circleci
2025-02-08 02:41:19 +00:00
Alex Cheema
928214d479 apt-get debian noninteractive in circleci 2025-02-08 02:40:51 +00:00
Alex Cheema
ce34a886c2 Merge pull request #687 from exo-explore/readmeupdate
README updates
2025-02-08 02:15:50 +00:00
Alex Cheema
d8c3aed0cc update discovery / peer networking modules 2025-02-08 02:15:13 +00:00
Alex Cheema
2c982d9295 update README to better reflect support for other devices like NVIDIA and Pi's 2025-02-08 02:13:04 +00:00
divinity76
5fe241ec61 code-breaking typo
oops
2025-02-06 19:02:02 +01:00
divinity76
05ff20fa89 workaround f16 cast ambiguity
for unknown reasons, without this, when trying to execute "Llama 3.2 1B", I get the error below. Fwiw I do not know the performance impact for this change. I can't even get exo running, but this change allows me to /get further/ (before running into a second issue with vram allocation? story for another day i suppose)


error: 
Failed to fetch completions: Error processing prompt (see logs with DEBUG>=2): Nvrtc Error 6, NVRTC_ERROR_COMPILATION
<null>(18): error: more than one user-defined conversion from "nv_bfloat16" to "half" applies:
            function "__half::__half(float)" (declared at line 214 of /usr/include/cuda_fp16.hpp)
            function "__half::__half(short)" (declared at line 227 of /usr/include/cuda_fp16.hpp)
            function "__half::__half(unsigned short)" (declared at line 228 of /usr/include/cuda_fp16.hpp)
            function "__half::__half(int)" (declared at line 229 of /usr/include/cuda_fp16.hpp)
            function "__half::__half(unsigned int)" (declared at line 230 of /usr/include/cuda_fp16.hpp)
            function "__half::__half(long long)" (declared at line 231 of /usr/include/cuda_fp16.hpp)
            function "__half::__half(unsigned long long)" (declared at line 232 of /usr/include/cuda_fp16.hpp)
    *((half4*)((data0+(alu0+(gidx1<<14)+(lidx0<<11)+alu1)))) = make_half4(((half)(val0)),((half)(val1)),((half)(val2)),((half)(val3)));
                                                                                 ^

<null>(18): error: more than one user-defined conversion from "nv_bfloat16" to "half" applies:
            function "__half::__half(float)" (declared at line 214 of /usr/include/cuda_fp16.hpp)
            function "__half::__half(short)" (declared at line 227 of /usr/include/cuda_fp16.hpp)
            function "__half::__half(unsigned short)" (declared at line 228 of /usr/include/cuda_fp16.hpp)
            function "__half::__half(int)" (declared at line 229 of /usr/include/cuda_fp16.hpp)
            function "__half::__half(unsigned int)" (declared at line 230 of /usr/include/cuda_fp16.hpp)
            function "__half::__half(long long)" (declared at line 231 of /usr/include/cuda_fp16.hpp)
            function "__half::__half(unsigned long long)" (declared at line 232 of /usr/include/cuda_fp16.hpp)
    *((half4*)((data0+(alu0+(gidx1<<14)+(lidx0<<11)+alu1)))) = make_half4(((half)(val0)),((half)(val1)),((half)(val2)),((half)(val3)));
                                                                                                ^

<null>(18): error: more than one user-defined conversion from "nv_bfloat16" to "half" applies:
            function "__half::__half(float)" (declared at line 214 of /usr/include/cuda_fp16.hpp)
            function "__half::__half(short)" (declared at line 227 of /usr/include/cuda_fp16.hpp)
            function "__half::__half(unsigned short)" (declared at line 228 of /usr/include/cuda_fp16.hpp)
            function "__half::__half(int)" (declared at line 229 of /usr/include/cuda_fp16.hpp)
            function "__half::__half(unsigned int)" (declared at line 230 of /usr/include/cuda_fp16.hpp)
            function "__half::__half(long long)" (declared at line 231 of /usr/include/cuda_fp16.hpp)
            function "__half::__half(unsigned long long)" (declared at line 232 of /usr/include/cuda_fp16.hpp)
    *((half4*)((data0+(alu0+(gidx1<<14)+(lidx0<<11)+alu1)))) = make_half4(((half)(val0)),((half)(val1)),((half)(val2)),((half)(val3)));
                                                                                                               ^

<null>(18): error: more than one user-defined conversion from "nv_bfloat16" to "half" applies:
            function "__half::__half(float)" (declared at line 214 of /usr/include/cuda_fp16.hpp)
            function "__half::__half(short)" (declared at line 227 of /usr/include/cuda_fp16.hpp)
            function "__half::__half(unsigned short)" (declared at line 228 of /usr/include/cuda_fp16.hpp)
            function "__half::__half(int)" (declared at line 229 of /usr/include/cuda_fp16.hpp)
            function "__half::__half(unsigned int)" (declared at line 230 of /usr/include/cuda_fp16.hpp)
            function "__half::__half(long long)" (declared at line 231 of /usr/include/cuda_fp16.hpp)
            function "__half::__half(unsigned long long)" (declared at line 232 of /usr/include/cuda_fp16.hpp)
    *((half4*)((data0+(alu0+(gidx1<<14)+(lidx0<<11)+alu1)))) = make_half4(((half)(val0)),((half)(val1)),((half)(val2)),((half)(val3)));
                                                                                                                              ^

4 errors detected in the compilation of "<null>".
2025-02-06 18:54:15 +01:00
Alex Cheema
b5fc4bc288 Merge pull request #675 from exo-explore/rmtenacity
remove tenacity dependency, implement simple retry logic instead
2025-02-03 21:58:08 +00:00
Alex Cheema
5157d80a46 remove tenacity dependency, implement simple retry logic instead 2025-02-03 21:56:38 +00:00
Alex Cheema
75914b4de8 Merge pull request #669 from pavel-rodionov/feature-local-models
Add toggle to show only models downloaded locally
2025-02-03 21:45:27 +00:00
Rodionov Pavel
d084dbe574 Add toggle to show only models downloaded locally 2025-02-01 23:45:19 -08:00
Alex Cheema
1a77a52d71 Merge pull request #666 from exo-explore/patchmanualdiscovery
patch for manual discovery, set known_peers
2025-02-01 23:07:21 +00:00
Alex Cheema
72329ba984 patch for manual discovery, set known_peers 2025-02-01 23:06:57 +00:00
Alex Cheema
f663b0afa2 Merge pull request #665 from exo-explore/resumedownload
add model downloading section to README
2025-02-01 20:23:58 +00:00
Alex Cheema
51b5c2ca9b add model downloading section to README 2025-02-01 20:23:05 +00:00
Alex Cheema
9a1f0a85e6 Merge pull request #664 from exo-explore/resumedownload
resumable downloads with integrity checks
2025-02-01 18:34:36 +00:00
Alex Cheema
2c0d17c336 beautiful download 2025-02-01 17:29:19 +00:00
Alex Cheema
7034ee0fcb resumable downloads with integrity checks 2025-02-01 13:22:51 +00:00
Alex Cheema
7a75fb09b2 Merge pull request #660 from exo-explore/robustdownload
cleanup tmp files on failed download
2025-01-30 20:25:15 +00:00