All agents share the same operational model: start a task, optionally poll for
status, and fetch the final result. Use our SDK helpers like
startAndWait()
for a simple blocking workflow, or the async pattern for full control.Which agent should I use?
Browser-Use
Open-source, fast, and efficient framework for browser automation with
strong defaults and great performance. Interacts with the dom using
Playwright.
Claude Computer Use
Anthropic’s Claude with full computer-use capabilities for robust, reliable
real‑world interaction. Has wide array of actions and computer tools that it
can choose from.
OpenAI CUA
OpenAI’s Computer‑Using Agent (Operator tech) for task execution across the
web UI. Usually slower but can be more reliable/accurate.
Gemini Computer Use
Google’s Gemini with computer-use for fast agentic tasks. Has more broad
tools that can incorporate multiple actions into a single step.
HyperAgent
Our open-source, Playwright‑powered agent framework designed for control and
extensibility.
Stagehand
Run Stagehand on Hyperbrowser-managed browsers. Attach via CDP, reuse
stealth sessions, and monitor tasks live without self-hosting Chrome.
Quickstart
The simplest way to run any agent is to call itsstartAndWait() method from our SDKs. Here’s a representative example using Browser‑Use:
Best practices
Write clear, specific tasks
Write clear, specific tasks
Be explicit about the goal and constraints. Prefer “go to example.com, open
pricing, extract Enterprise monthly price” to vague prompts.
Tune maxSteps and failure budgets
Tune maxSteps and failure budgets
Simple tasks typically succeed within 10–20 steps; complex multi‑page flows
may need 50+. Monitor failures and adjust
maxSteps and maxFailures.Reuse sessions for multi‑step workflows
Reuse sessions for multi‑step workflows
Create a session once, pass
sessionId to successive tasks, and set
keepBrowserOpen: true where you need continuity.Bring your own LLM keys when desired
Bring your own LLM keys when desired
Set
useCustomApiKeys: true and provider your own API Keys to pass calls to
your own organization.For best results with AI Agents, it is highly recommended to keep the default
screen configuration of 1280 x 720. The models can behave poorly with larger
screen sizes.