OSWorld: A real computer environment for multimodal agents to evaluate open-ended computer tasks
cli
llm
natural-language-processing
artificial-intelligence
language-model
reinforcement-learning
gui
multimodal
agent
benchmark
code-generation
large-action-model
rpa
vlm
Updated 2024-04-29 12:26:03 +03:00
Docker image for LLaVA: Large Language and Vision Assistant
docker
ai
llm
chatbot
docker-image
chatgpt
gpt-4
llama
llama-2
llama2
llava
instruction-tuning
runpod
vision-language-model
visual-language-learning
foundation-models
multimodal
Updated 2023-10-23 00:48:12 +03:00