Browser Tool

The browser tool provides headless browser automation via Playwright, enabling agents to navigate websites, interact with pages, capture screenshots, and extract accessibility snapshots.

Architecture

The browser tool uses Playwright's Chromium engine in headless mode. All actions are gated through the security supervisor for sensitive operations (screenshots, form interactions).

Agent → browser tool call
  → Launch headless Chromium
  → Navigate to URL / Perform action
  → Security gate (screenshots, form data)
  → Return result (screenshot base64, snapshot text, or action confirmation)
  → Cleanup browser context

Available Actions

ActionDescriptionParameters
navigateNavigate to a URLurl
clickClick an elementtarget (selector or description)
typeType text into an inputtarget, text
screenshotCapture page screenshotfullPage (boolean), format (png/jpeg)
snapshotCapture accessibility snapshottarget (optional element selector)
evaluateExecute JavaScript on pagecode (function string)
waitWait for text or timetime (seconds) or text

Security

The browser tool triggers the security supervisor for:

  • Screenshots — may capture sensitive UI content (credentials, personal data)
  • Snapshots — accessibility tree may include sensitive form labels or data
  • Evaluate — arbitrary JavaScript execution (RCE-equivalent, requires explicit policy approval)

Each action goes through:

  1. Policy validation (regex allow/deny rules)
  2. LLM security supervisor review (for sensitive actions)
  3. Human approval if needed

Configuration

The browser tool uses a 30-second default timeout for all operations. No additional configuration is required — it uses the system's Playwright installation.

Example Usage

Agent: I need to check the CortexPrism GitHub releases page.

Tool call: browser.navigate("https://github.com/CortexPrism/cortex/releases")
Result: Navigated to https://github.com/CortexPrism/cortex/releases

Tool call: browser.snapshot()
Result: [Accessibility tree showing release list with versions and dates]

Agent: The latest release is v0.53.0 from June 24, 2026...

Capabilities

The browser tool requires these capabilities from the CAP_BROWSER capability group:

  • network:fetch — for page navigation
  • computer:screenshot — screenshot capture

See Also