> ## Documentation Index > Fetch the complete documentation index at: https://hyperbrowser.ai/docs/llms.txt > Use this file to discover all available pages before exploring further. # Overview > Use AI agents to easily automate browser tasks and create agentic workflows Hyperbrowser lets you run powerful, AI-driven browser agents in managed cloud sessions. Whether you prefer open-source frameworks or cutting‑edge model-native agents, you can start tasks with a single API call and watch them execute live. All agents share the same operational model: start a task, optionally poll for status, and fetch the final result. Use our SDK helpers like `startAndWait()` for a simple blocking workflow, or the async pattern for full control. ## Which agent should I use? Open-source, fast, and efficient framework for browser automation with strong defaults and great performance. Interacts with the dom using Playwright. Anthropic’s Claude with full computer-use capabilities for robust, reliable real‑world interaction. Has wide array of actions and computer tools that it can choose from. OpenAI’s Computer‑Using Agent (Operator tech) for task execution across the web UI. Usually slower but can be more reliable/accurate. Google’s Gemini with computer-use for fast agentic tasks. Has more broad tools that can incorporate multiple actions into a single step. Our open-source, Playwright‑powered agent framework designed for control and extensibility. Run Stagehand on Hyperbrowser-managed browsers. Attach via CDP, reuse stealth sessions, and monitor tasks live without self-hosting Chrome. ## Quickstart The simplest way to run any agent is to call its `startAndWait()` method from our SDKs. Here’s a representative example using Browser‑Use: ```typescript Node.js theme={null} import { Hyperbrowser } from "@hyperbrowser/sdk"; import { config } from "dotenv"; config(); const client = new Hyperbrowser({ apiKey: process.env.HYPERBROWSER_API_KEY, }); async function main() { const result = await client.agents.browserUse.startAndWait({ task: "Go to Hacker News and tell me the title of the top post", llm: "gemini-2.0-flash", maxSteps: 20, }); console.log(`Output:\n${result.data?.finalResult}`); } main().catch((err) => { console.error(`Error: ${err.message}`); }); ``` ```python Python theme={null} from hyperbrowser import Hyperbrowser from hyperbrowser.models import StartBrowserUseTaskParams import os from dotenv import load_dotenv load_dotenv() client = Hyperbrowser(api_key=os.getenv("HYPERBROWSER_API_KEY")) result = client.agents.browser_use.start_and_wait( params=StartBrowserUseTaskParams( task="Go to Hacker News and tell me the title of the top post", llm="gemini-2.0-flash", max_steps=20 ) ) print(f"Output:\n{result.data.final_result}") ``` Switch the agent family by swapping the SDK path, e.g. `client.agents.claudeComputerUse.startAndWait(...)`, `client.agents.cua.startAndWait(...)`, `client.agents.geminiComputerUse.startAndWait(...)`, or `client.agents.hyperAgent.startAndWait(...)`. ## Best practices Be explicit about the goal and constraints. Prefer “go to example.com, open pricing, extract Enterprise monthly price” to vague prompts. Simple tasks typically succeed within 10–20 steps; complex multi‑page flows may need 50+. Monitor failures and adjust `maxSteps` and `maxFailures`. Create a session once, pass `sessionId` to successive tasks, and set `keepBrowserOpen: true` where you need continuity. Set `useCustomApiKeys: true` and provider your own API Keys to pass calls to your own organization. For best results with AI Agents, it is highly recommended to keep the default screen configuration of 1280 x 720. The models can behave poorly with larger screen sizes. ## Explore agents * [Browser‑Use](/agents/browser-use) * [Claude Computer Use](/agents/claude-computer-use) * [OpenAI CUA](/agents/openai-cua) * [Gemini Computer Use](/agents/gemini-computer-use) * [HyperAgent](/agents/hyperagent)