TPI-LLM: A High-Performance Tensor Parallelism Inference System for Edge LLM Services.
Updated 2024-10-04 22:25:48 +03:00
Pydantic model and dataclasses.dataclass generator for easy conversion of JSON, OpenAPI, JSON Schema, and YAML data sources.
Updated 2024-03-18 14:54:37 +03:00
To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
Updated 2024-01-23 02:05:47 +03:00
A Test Runner in python, for Human Readable HTML Reports
Updated 2023-08-06 14:45:57 +03:00
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
Updated 2023-06-17 01:18:36 +03:00
This is my Apache Airflow Local development setup on Windows 10 WSL2/Mac using docker-compose. It will also include some sample DAGs and workflows.
Updated 2022-03-01 18:58:40 +03:00