BrowserWing Executor API
Overview
BrowserWing Executor provides comprehensive browser automation capabilities through HTTP APIs. You can control browser navigation, interact with page elements, extract data, and analyze page structure.
Configuration
API Base URL: The BrowserWing Executor API address is configurable via environment variable.
Environment Variable:
BROWSERWING_EXECUTOR_URLDefault Value:
http://127.0.0.1:8080How to get the URL: Read from environment variable
$BROWSERWING_EXECUTOR_URL, if not set, use defaulthttp://127.0.0.1:8080
Base URL Format: ${BROWSERWING_EXECUTOR_URL}/api/v1/executor or http://127.0.0.1:8080/api/v1/executor (if env var not set)
Authentication: Use X-BrowserWing-Key: <api-key> header or Authorization: Bearer <token> if required.
Important: Always construct the API URL by reading the environment variable first. In shell commands, use: ${BROWSERWING_EXECUTOR_URL:-http://127.0.0.1:8080}
Core Capabilities
Page Navigation: Navigate to URLs, go back/forward, reload
Element Interaction: Click, type, select, hover on page elements
Data Extraction: Extract text, attributes, values from elements
Accessibility Analysis: Get accessibility snapshot to understand page structure
Advanced Operations: Screenshot, JavaScript execution, keyboard input
Batch Processing: Execute multiple operations in sequence
API Endpoints
1. Discover Available Commands
IMPORTANT: Always call this endpoint first to see all available commands and their parameters.
EXECUTOR_URL="${BROWSERWING_EXECUTOR_URL:-http://127.0.0.1:8080}"
curl -X GET "${EXECUTOR_URL}/api/v1/executor/help"
Response: Returns complete list of all commands with parameters, examples, and usage guidelines.
Query specific command:
EXECUTOR_URL="${BROWSERWING_EXECUTOR_URL:-http://127.0.0.1:8080}"
curl -X GET "${EXECUTOR_URL}/api/v1/executor/help?command=extract"
2. Get Accessibility Snapshot
CRITICAL: Always call this after navigation to understand page structure and get element RefIDs.
EXECUTOR_URL="${BROWSERWING_EXECUTOR_URL:-http://127.0.0.1:8080}"
curl -X GET "${EXECUTOR_URL}/api/v1/executor/snapshot"
Response Example:
{
"success": true,
"snapshot_text": "Clickable Elements:\n @e1 Login (role: button)\n @e2 Sign Up (role: link)\n\nInput Elements:\n @e3 Email (role: textbox) [placeholder: [email protected]]\n @e4 Password (role: textbox)"
}
Use Cases:
Understand what interactive elements are on the page
Get element RefIDs (@e1, @e2, etc.) for precise identification
See element labels, roles, and attributes
The accessibility tree is cleaner than raw DOM and better for LLMs
RefIDs are stable references that work reliably across page changes
3. Common Operations
Note: All examples below use EXECUTOR_URL="${BROWSERWING_EXECUTOR_URL:-http://127.0.0.1:8080}" to read the API address from environment variable, with http://127.0.0.1:8080 as fallback default.
Navigate to URL
EXECUTOR_URL="${BROWSERWING_EXECUTOR_URL:-http://127.0.0.1:8080}"
curl -X POST "${EXECUTOR_URL}/api/v1/executor/navigate" \
-H 'Content-Type: application/json' \
-d '{"url": "https://example.com"}'
Click Element
EXECUTOR_URL="${BROWSERWING_EXECUTOR_URL:-http://127.0.0.1:8080}"
curl -X POST "${EXECUTOR_URL}/api/v1/executor/click" \
-H 'Content-Type: application/json' \
-d '{"identifier": "@e1"}'
Identifier formats:
RefID (Recommended):
@e1,@e2(from snapshot)CSS Selector:
#button-id,.class-nameXPath:
//button[@type='submit']Text:
Login(text content)
Type Text
EXECUTOR_URL="${BROWSERWING_EXECUTOR_URL:-http://127.0.0.1:8080}"
curl -X POST "${EXECUTOR_URL}/api/v1/executor/type" \
-H 'Content-Type: application/json' \
-d '{"identifier": "@e3", "text": "[email protected]"}'
Extract Data
EXECUTOR_URL="${BROWSERWING_EXECUTOR_URL:-http://127.0.0.1:8080}"
curl -X POST "${EXECUTOR_URL}/api/v1/executor/extract" \
-H 'Content-Type: application/json' \
-d '{
"selector": ".product-item",
"fields": ["text", "href"],
"multiple": true
}'
Wait for Element
EXECUTOR_URL="${BROWSERWING_EXECUTOR_URL:-http://127.0.0.1:8080}"
curl -X POST "${EXECUTOR_URL}/api/v1/executor/wait" \
-H 'Content-Type: application/json' \
-d '{"identifier": ".loading", "state": "hidden", "timeout": 10}'
Batch Operations
EXECUTOR_URL="${BROWSERWING_EXECUTOR_URL:-http://127.0.0.1:8080}"
curl -X POST "${EXECUTOR_URL}/api/v1/executor/batch" \
-H 'Content-Type: application/json' \
-d '{
"operations": [
{"type": "navigate", "params": {"url": "https://example.com"}, "stop_on_error": true},
{"type": "click", "params": {"identifier": "@e1"}, "stop_on_error": true},
{"type": "type", "params": {"identifier": "@e3", "text": "query"}, "stop_on_error": true}
]
}'
Instructions
Step-by-step workflow:
Get API URL: First, read the API base URL from environment variable
$BROWSERWING_EXECUTOR_URL. If not set, use defaulthttp://127.0.0.1:8080. In shell commands, use:EXECUTOR_URL="${BROWSERWING_EXECUTOR_URL:-http://127.0.0.1:8080}"Discover commands: Call
GET /helpto see all available operations and their parameters (do this first if unsure).Navigate: Use
POST /navigateto open the target webpage.Analyze page: Call
GET /snapshotto understand page structure and get element RefIDs.Interact: Use element RefIDs (like
@e1,@e2) or CSS selectors to:- Click elements:
POST /click - Input text:
POST /type - Select options:
POST /select - Wait for elements:
POST /wait
- Click elements:
Extract data: Use
POST /extractto get information from the page.Present results: Format and show extracted data to the user.
Complete Example
User Request: "Search for 'laptop' on example.com and get the first 5 results"
Your Actions:
- Navigate to search page:
curl -X POST 'http://127.0.0.1:18085/api/v1/executor/navigate' \
-H 'Content-Type: application/json' \
-d '{"url": "https://example.com/search"}'
- Get page structure to find search input:
curl -X GET 'http://127.0.0.1:18085/api/v1/executor/snapshot'
Response shows: @e3 Search (role: textbox) [placeholder: Search...]
- Type search query:
curl -X POST 'http://127.0.0.1:18085/api/v1/executor/type' \
-H 'Content-Type: application/json' \
-d '{"identifier": "@e3", "text": "laptop"}'
- Press Enter to submit:
curl -X POST 'http://127.0.0.1:18085/api/v1/executor/press-key' \
-H 'Content-Type: application/json' \
-d '{"key": "Enter"}'
- Wait for results to load:
curl -X POST 'http://127.0.0.1:18085/api/v1/executor/wait' \
-H 'Content-Type: application/json' \
-d '{"identifier": ".search-results", "state": "visible", "timeout": 10}'
- Extract search results:
curl -X POST 'http://127.0.0.1:18085/api/v1/executor/extract' \
-H 'Content-Type: application/json' \
-d '{
"selector": ".result-item",
"fields": ["text", "href"],
"multiple": true
}'
- Present the extracted data:
Found 15 results for 'laptop':
1. Gaming Laptop - $1299 (https://...)
2. Business Laptop - $899 (https://...)
...
Key Commands Reference
Navigation
POST /navigate- Navigate to URLPOST /go-back- Go back in historyPOST /go-forward- Go forward in historyPOST /reload- Reload current page
Element Interaction
POST /click- Click element (supports: RefID@e1, CSS selector, XPath, text content)POST /type- Type text into input (supports: RefID@e3, CSS selector, XPath)POST /select- Select dropdown optionPOST /hover- Hover over elementPOST /wait- Wait for element state (visible, hidden, enabled)POST /press-key- Press keyboard key (Enter, Tab, Ctrl+S, etc.)
Data Extraction
POST /extract- Extract data from elements (supports multiple elements, custom fields)POST /get-text- Get element text contentPOST /get-value- Get input element valueGET /page-info- Get page URL and titleGET /page-text- Get all page textGET /page-content- Get full HTML
Page Analysis
GET /snapshot- Get accessibility snapshot (⭐ ALWAYS call after navigation)GET /clickable-elements- Get all clickable elementsGET /input-elements- Get all input elements
Advanced
POST /screenshot- Take page screenshot (base64 encoded)POST /evaluate- Execute JavaScript codePOST /batch- Execute multiple operations in sequencePOST /scroll-to-bottom- Scroll to page bottomPOST /resize- Resize browser windowPOST /tabs- Manage browser tabs (list, new, switch, close)POST /fill-form- Intelligently fill multiple form fields at once
Debug & Monitoring
GET /console-messages- Get browser console messages (logs, warnings, errors)GET /network-requests- Get network requests made by the pagePOST /handle-dialog- Configure JavaScript dialog (alert, confirm, prompt) handlingPOST /file-upload- Upload files to input elementsPOST /drag- Drag and drop elementsPOST /close-page- Close the current page/tab
Element Identification
You can identify elements using:
RefID (Recommended):
@e1,@e2,@e3- Most reliable method - stable across page changes
- Get RefIDs from
/snapshotendpoint - Valid for 5 minutes after snapshot
- Example:
"identifier": "@e1" - Works with multi-strategy fallback for robustness
CSS Selector:
#id,.class,button[type="submit"]- Standard CSS selectors
- Example:
"identifier": "#login-button"
XPath:
//button[@id='login'],//a[contains(text(), 'Submit')]- XPath expressions for complex queries
- Example:
"identifier": "//button[@id='login']"
Text Content:
Login,Sign Up,Submit- Searches buttons and links with matching text
- Example:
"identifier": "Login"
ARIA Label: Elements with
aria-labelattribute- Automatically searched
Guidelines
Before starting:
Get API URL first: Read from
$BROWSERWING_EXECUTOR_URLenvironment variable, or usehttp://127.0.0.1:8080as defaultCall
GET /helpif you're unsure about available commands or their parametersEnsure browser is started (if not, it will auto-start on first operation)
During automation:
Always call
/snapshotafter navigation to get page structure and RefIDsPrefer RefIDs (like
@e1) over CSS selectors for reliability and stabilityRe-snapshot after page changes to get updated RefIDs
Use
/waitfor dynamic content that loads asynchronouslyCheck element states before interaction (visible, enabled)
Use
/batchfor multiple sequential operations to improve efficiency
Error handling:
If operation fails, check element identifier and try different format
For timeout errors, increase timeout value
If element not found, call
/snapshotagain to refresh page structureExplain errors clearly to user with suggested solutions
Data extraction:
Use
fieldsparameter to specify what to extract:["text", "href", "src"]Set
multiple: trueto extract from multiple elementsFormat extracted data in a readable way for user
Complete Workflow Example
Scenario: User wants to login to a website
User: "Please log in to example.com with username 'john' and password 'secret123'"
Your Actions:
Step 1: Navigate to login page
EXECUTOR_URL="${BROWSERWING_EXECUTOR_URL:-http://127.0.0.1:8080}"
curl -X POST "${EXECUTOR_URL}/api/v1/executor/navigate" \
-H 'Content-Type: application/json' \
-d '{"url": "https://example.com/login"}'
Step 2: Get page structure
EXECUTOR_URL="${BROWSERWING_EXECUTOR_URL:-http://127.0.0.1:8080}"
curl -X GET "${EXECUTOR_URL}/api/v1/executor/snapshot"
Response:
Clickable Elements:
@e1 Login (role: button)
Input Elements:
@e2 Username (role: textbox)
@e3 Password (role: textbox)
Step 3: Enter username
POST http://127.0.0.1:18085/api/v1/executor/type
{"identifier": "@e2", "text": "john"}
Step 4: Enter password
POST http://127.0.0.1:18085/api/v1/executor/type
{"identifier": "@e3", "text": "secret123"}
Step 5: Click login button
POST http://127.0.0.1:18085/api/v1/executor/click
{"identifier": "@e1"}
Step 6: Wait for login success (optional)
POST http://127.0.0.1:18085/api/v1/executor/wait
{"identifier": ".welcome-message", "state": "visible", "timeout": 10}
Step 7: Inform user
"Successfully logged in to example.com!"
Batch Operation Example
Scenario: Fill out a form with multiple fields
Instead of making 5 separate API calls, use one batch operation:
EXECUTOR_URL="${BROWSERWING_EXECUTOR_URL:-http://127.0.0.1:8080}"
curl -X POST "${EXECUTOR_URL}/api/v1/executor/batch" \
-H 'Content-Type: application/json' \
-d '{
"operations": [
{
"type": "navigate",
"params": {"url": "https://example.com/form"},
"stop_on_error": true
},
{
"type": "type",
"params": {"identifier": "#name", "text": "John Doe"},
"stop_on_error": true
},
{
"type": "type",
"params": {"identifier": "#email", "text": "[email protected]"},
"stop_on_error": true
},
{
"type": "select",
"params": {"identifier": "#country", "value": "United States"},
"stop_on_error": true
},
{
"type": "click",
"params": {"identifier": "#submit"},
"stop_on_error": true
}
]
}'
Best Practices
Discovery first: If unsure, call
/helpor/help?command=<name>to learn about commandsStructure first: Always call
/snapshotafter navigation to understand the pageUse accessibility indices: They're more reliable than CSS selectors (elements might have dynamic classes)
Wait for dynamic content: Use
/waitbefore interacting with elements that load asynchronouslyBatch when possible: Use
/batchfor multiple sequential operationsHandle errors gracefully: Provide clear explanations and suggestions when operations fail
Verify results: After operations, check if desired outcome was achieved
Common Scenarios
Form Filling
Navigate to form page
Get accessibility snapshot to find input elements and their RefIDs
Use
/typefor each field:@e1,@e2, etc.Use
/selectfor dropdownsClick submit button using its RefID
Data Scraping
Navigate to target page
Wait for content to load with
/waitUse
/extractwith CSS selector andmultiple: trueSpecify fields to extract:
["text", "href", "src"]
Search Operations
Navigate to search page
Get accessibility snapshot to locate search input
Type search query into input
Press Enter or click search button
Wait for results
Extract results data
Login Automation
Navigate to login page
Get accessibility snapshot to find RefIDs
Type username:
@e2Type password:
@e3Click login button:
@e1Wait for success indicator
Important Notes
Browser must be running (it will auto-start on first operation if needed)
Operations are executed on the currently active browser tab
Accessibility snapshot updates after each navigation and click operation
All timeouts are in seconds
Use
wait_visible: true(default) for reliable element interactionAPI address: Always read from
$BROWSERWING_EXECUTOR_URLenvironment variable, fallback tohttp://127.0.0.1:8080if not setAuthentication required: use
X-BrowserWing-Keyheader or JWT token if configured
Troubleshooting
Element not found:
Call
/snapshotto see available elementsTry different identifier format (accessibility index, CSS selector, text)
Check if page has finished loading
Timeout errors:
Increase timeout value in request
Check if element actually appears on page
Use
/waitwith appropriate state before interaction
Extraction returns empty:
Verify CSS selector matches target elements
Check if content has loaded (use
/waitfirst)Try different extraction fields or type
Quick Reference
EXECUTOR_URL="${BROWSERWING_EXECUTOR_URL:-http://127.0.0.1:8080}"
# Discover commands
curl -X GET "${EXECUTOR_URL}/api/v1/executor/help"
# Navigate
curl -X POST "${EXECUTOR_URL}/api/v1/executor/navigate" \
-H 'Content-Type: application/json' \
-d '{"url": "..."}'
# Get page structure
curl -X GET "${EXECUTOR_URL}/api/v1/executor/snapshot"
# Click element
curl -X POST "${EXECUTOR_URL}/api/v1/executor/click" \
-H 'Content-Type: application/json' \
-d '{"identifier": "@e1"}'
# Type text
curl -X POST "${EXECUTOR_URL}/api/v1/executor/type" \
-H 'Content-Type: application/json' \
-d '{"identifier": "@e3", "text": "..."}'
# Extract data
curl -X POST "${EXECUTOR_URL}/api/v1/executor/extract" \
-H 'Content-Type: application/json' \
-d '{"selector": "...", "fields": [...], "multiple": true}'
Response Format
All operations return:
{
"success": true,
"message": "Operation description",
"timestamp": "2026-01-15T10:30:00Z",
"data": {
// Operation-specific data
}
}
Error response:
{
"error": "error.operationFailed",
"detail": "Detailed error message"
}