Block a user
Optimizing inference proxy for LLMs
llm
large-language-models
openai
prompt-engineering
agent
agents
proxy-server
genai
llm-inference
agentic-ai
optimization
agentic-framework
agentic-workflow
api-gateway
chain-of-thought
llmapi
mixture-of-experts
moa
monte-carlo-tree-search
openai-api
Updated 2025-05-28 09:39:38 +03:00
Updated 2025-05-11 21:09:47 +03:00
Updated 2025-05-11 19:55:31 +03:00
"MiniRAG: Making RAG Simpler with Small and Free Language Models"
Updated 2025-05-11 03:54:59 +03:00
"VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"
large-language-models
llms
rag
retrieval-augmented-generation
long-video-understanding
multi-modal-llms
Updated 2025-05-11 03:54:36 +03:00
I made my AI think harder by making it argue with itself repeatedly. It works stupidly well.
Updated 2025-04-29 22:56:02 +03:00
Updated 2025-04-14 00:13:29 +03:00
A zero-configuration tool for automatically exposing FastAPI endpoints as Model Context Protocol (MCP) tools.
Updated 2025-04-13 23:32:12 +03:00
Get your documents ready for gen AI
ai
markdown
html
pdf
pptx
docx
xlsx
pdf-to-text
tables
convert
document-parser
document-parsing
documents
pdf-converter
pdf-to-json
Updated 2025-03-19 18:18:10 +03:00
Make websites accessible for AI agents
Updated 2025-02-18 01:18:20 +03:00
Updated 2025-02-09 22:36:01 +03:00
Updated 2025-01-27 02:30:25 +03:00