Overview

Hyperbrowser lets you run powerful, AI-driven browser agents in managed cloud sessions. Whether you prefer open-source frameworks or cutting‑edge model-native agents, you can start tasks with a single API call and watch them execute live.

All agents share the same operational model: start a task, optionally poll for status, and fetch the final result. Use our SDK helpers like startAndWait() for a simple blocking workflow, or the async pattern for full control.

Which agent should I use?

Browser-Use

Open-source, fast, and efficient framework for browser automation with strong defaults and great performance. Interacts with the dom using Playwright.

Claude Computer Use

Anthropic’s Claude with full computer-use capabilities for robust, reliable real‑world interaction. Has wide array of actions and computer tools that it can choose from.

OpenAI CUA

OpenAI’s Computer‑Using Agent (Operator tech) for task execution across the web UI. Usually slower but can be more reliable/accurate.

Gemini Computer Use

Google’s Gemini with computer-use for fast agentic tasks. Has more broad tools that can incorporate multiple actions into a single step.

HyperAgent

Our open-source, Playwright‑powered agent framework designed for control and extensibility.

Stagehand

Run Stagehand on Hyperbrowser-managed browsers. Attach via CDP, reuse stealth sessions, and monitor tasks live without self-hosting Chrome.

Quickstart

The simplest way to run any agent is to call its startAndWait() method from our SDKs. Here’s a representative example using Browser‑Use:

import { Hyperbrowser } from "@hyperbrowser/sdk";
import { config } from "dotenv";

config();

const client = new Hyperbrowser({
  apiKey: process.env.HYPERBROWSER_API_KEY,
});

async function main() {
  const result = await client.agents.browserUse.startAndWait({
    task: "Go to Hacker News and tell me the title of the top post",
    llm: "gemini-2.0-flash",
    maxSteps: 20,
  });

  console.log(`Output:\n${result.data?.finalResult}`);
}

main().catch((err) => {
  console.error(`Error: ${err.message}`);
});

Switch the agent family by swapping the SDK path, e.g. client.agents.claudeComputerUse.startAndWait(...), client.agents.cua.startAndWait(...), client.agents.geminiComputerUse.startAndWait(...), or client.agents.hyperAgent.startAndWait(...).

Best practices

Write clear, specific tasks

Be explicit about the goal and constraints. Prefer “go to example.com, open pricing, extract Enterprise monthly price” to vague prompts.

Tune maxSteps and failure budgets

Simple tasks typically succeed within 10–20 steps; complex multi‑page flows may need 50+. Monitor failures and adjust maxSteps and maxFailures.

Reuse sessions for multi‑step workflows

Create a session once, pass sessionId to successive tasks, and set keepBrowserOpen: true where you need continuity.

Bring your own LLM keys when desired

Set useCustomApiKeys: true and provider your own API Keys to pass calls to your own organization.

For best results with AI Agents, it is highly recommended to keep the default screen configuration of 1280 x 720. The models can behave poorly with larger screen sizes.

Get Started

Browser Sessions

Session Configuration

Scraping

Agents

Web API

Integrations

Which agent should I use?

Browser-Use

Claude Computer Use

OpenAI CUA

Gemini Computer Use

HyperAgent

Stagehand

Quickstart

Best practices

Explore agents

Get Started

Browser Sessions

Session Configuration

Scraping

Agents

Web API

Integrations

​Which agent should I use?

Browser-Use

Claude Computer Use

OpenAI CUA

Gemini Computer Use

HyperAgent

Stagehand

​Quickstart

​Best practices

​Explore agents

Which agent should I use?

Quickstart

Best practices

Explore agents