gitea-ailhan-registry

alihan/ OSWorld

OSWorld: A real computer environment for multimodal agents to evaluate open-ended computer tasks

cli llm natural-language-processing artificial-intelligence language-model reinforcement-learning gui multimodal agent benchmark code-generation large-action-model rpa vlm

Updated 2024-04-29 12:26:03 +03:00

alihan/ llava-docker

Docker image for LLaVA: Large Language and Vision Assistant

docker ai llm chatbot docker-image chatgpt gpt-4 llama llama-2 llama2 llava instruction-tuning runpod vision-language-model visual-language-learning foundation-models multimodal

Updated 2023-10-23 00:48:12 +03:00