NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retrieval systems.
Updated 2025-01-05 18:58:13 +03:00
Logs uptime, mV, batt mA, Batt T, Pi mW and battery% from the GeeekPi UPSv5 (EP-0136) board to csv and optionally graphs the data
Updated 2025-01-02 02:37:29 +03:00
Postgres to Sqlite3 data copier
Updated 2024-08-19 00:02:28 +03:00
Explore and analyse your Home Assistant data
Updated 2024-05-04 21:29:07 +03:00
Pydantic model and dataclasses.dataclass generator for easy conversion of JSON, OpenAPI, JSON Schema, and YAML data sources.
Updated 2024-03-18 14:54:37 +03:00
Download market data from Yahoo! Finance's API
Updated 2024-01-29 09:38:57 +03:00
Code and dataset for the paper "LLMs for Knowledge Graph Construction and Reasoning: Recent Capabilities and Future Opportunities".
Updated 2023-07-18 10:10:51 +03:00
Interact privately with your documents using the power of GPT, 100% privately, no data leaks
Updated 2023-05-25 00:03:05 +03:00
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, vid
Updated 2023-05-24 23:26:06 +03:00
Postgres CLI with autocompletion and syntax highlighting
Updated 2022-11-27 03:23:34 +03:00
A Python library for crawling public data from Tefas.
Updated 2022-11-22 22:50:41 +03:00
sqlalchemy dialect for rqlite, the lightweight, distributed database built on SQLite.
Updated 2022-10-30 02:38:34 +03:00
This is my Apache Airflow Local development setup on Windows 10 WSL2/Mac using docker-compose. It will also include some sample DAGs and workflows.
Updated 2022-03-01 18:58:40 +03:00
ConTEXT Explorer is an open Web-based system for exploring and visualizing concepts (combinations of occurring words and phrases) over time in the text documents.
Updated 2022-02-20 22:06:42 +03:00
A reusable Django model field for storing ad-hoc JSON data
Updated 2022-02-12 21:18:11 +03:00