Agent Browser Skill (Core)
Purpose
Provide an advanced, production-ready playbook for using agent-browser to automate web tasks via CLI and structured commands.
Best fit
You need deterministic automation for AI agents.
You want compact snapshots with refs and JSON output.
You prefer a fast CLI with Node.js fallback.
Not a fit
You require a full SDK or custom JS integration.
You must stream large uploads or complex media workflows.
Quick orientation
Read
references/agent-browser-overview.mdfor install, architecture, and core concepts.Read
references/agent-browser-command-map.mdfor command categories and flags.Read
references/agent-browser-safety.mdfor high-risk controls and safe mode rules.Read
references/agent-browser-workflows.mdfor recommended AI workflows.Read
references/agent-browser-troubleshooting.mdfor common issues and fixes.
Required inputs
Installed agent-browser CLI and browser runtime.
Target URLs and workflow steps.
Session or profile strategy if authentication is required.
Expected output
- A clear command sequence and operational guardrails for automation.
Operational notes
Snapshot early, act via refs, then snapshot again after DOM changes.
Use
--jsonfor machine parsing and scripting.Use waits and load-state checks before actions.
Close tabs or sessions when done to release resources.
Safe mode defaults
Do not use
eval,--allow-file-access, custom--executable-path, or arbitrary--argswithout explicit approval.Avoid
network route,set credentials, and cookie/storage mutations unless the task requires it.Allowlist domains and block localhost or private network targets.
Security notes
Treat tokens and credentials as secrets.
Avoid
--allow-file-accessunless explicitly required.