Agentic Browser
Beats OpenAI Operator on WebVoyager
Three-agent orchestration loop on Playwright that handles multimodal inputs, downloads and parses PDFs, interacts with embedded video, and manages credentials securely through HashiCorp Vault. One-shot task completion where prior agents required multiple attempts.
- Three-agent loop: planner, actor, verifier
- Multimodal inputs — text, screenshots, embedded video
- HashiCorp Vault credential handling for real-world tasks
- Structured recovery on tool failure; no silent fallbacks
headline metric
WebVoyager (vs Operator 87)
stack
- Playwright
- Python
- HashiCorp Vault
- Custom tool schema