diff --git a/README.md b/README.md index 01bffcf..1f366f4 100644 --- a/README.md +++ b/README.md @@ -134,24 +134,27 @@ Tell your computer what to do, and it gets it done. ## Roadmap -- [ ] Improve memory management -- [ ] Enhance planning capabilities -- [ ] Improve self-correction -- [ ] Fine-tune the model for better performance +### Agent +- [ ] Improve agent memory (summarize, compress, RAG, etc.) +- [ ] Enhance planning capabilities (load website specific context) +- [ ] Reduce token consumption (system prompt, DOM state) + +### DOM Extraction +- [ ] Improve extraction for datepickers, dropdowns, special elements +- [ ] Improve state representation for UI elements + +### Rerunning tasks +- [ ] LLM as fallback +- [ ] Make it easy to define workfows templates where LLM fills in the details +- [ ] Return playwright script from the agent + +### Datasets - [ ] Create datasets for complex tasks -- [ ] Sandbox browser-use for specific websites -- [ ] Implement deterministic script rerun with LLM fallback -- [ ] Cloud-hosted version -- [ ] Add stop/pause functionality -- [ ] Improve authentication handling -- [ ] Reduce token consumption -- [ ] Implement long-term memory -- [ ] Handle repetitive tasks reliably -- [ ] Third-party integrations (Slack, etc.) -- [ ] Include more interactive elements -- [ ] Human-in-the-loop execution - [ ] Benchmark various models against each other -- [ ] Let the user record a workflow and browser-use will execute it +- [ ] Fine-tuning models for specific tasks + +### User Experience +- [ ] Human-in-the-loop execution - [ ] Improve the generated GIF quality - [ ] Create various demos for tutorial execution, job application, QA testing, social media, etc.