Agent Browser Skill (Core)
Purpose
Provide an advanced, production-ready playbook for using agent-browser to automate web tasks via CLI and structured commands.
Best fit
- You need deterministic automation for AI agents.
- You want compact snapshots with refs and JSON output.
- You prefer a fast CLI with Node.js fallback.
Not a fit
- You require a full SDK or custom JS integration.
- You must stream large uploads or complex media workflows.
Quick orientation
- Read
references/agent-browser-overview.mdfor install, architecture, and core concepts. - Read
references/agent-browser-command-map.mdfor command categories and flags. - Read
references/agent-browser-safety.mdfor high-risk controls and safe mode rules. - Read
references/agent-browser-workflows.mdfor recommended AI workflows. - Read
references/agent-browser-troubleshooting.mdfor common issues and fixes.
Required inputs
- Installed agent-browser CLI and browser runtime.
- Target URLs and workflow steps.
- Session or profile strategy if authentication is required.
Expected output
- A clear command sequence and operational guardrails for automation.
Operational notes
- Snapshot early, act via refs, then snapshot again after DOM changes.
- Use
--jsonfor machine parsing and scripting. - Use waits and load-state checks before actions.
- Close tabs or sessions when done to release resources.
Safe mode defaults
- Do not use
eval,--allow-file-access, custom--executable-path, or arbitrary--argswithout explicit approval. - Avoid
network route,set credentials, and cookie/storage mutations unless the task requires it. - Allowlist domains and block localhost or private network targets.
Security notes
- Treat tokens and credentials as secrets.
- Avoid
--allow-file-accessunless explicitly required.