AI Agents

Explorbot uses specialized AI agents that each handle a specific part of the testing workflow. This separation keeps each agent focused and cost-efficient.

Agent Overview

flowchart LR
    A[Navigator] --> B[Researcher] --> C[Planner] --> D[Tester]
    A -- "goes to page" --> B
    B -- "analyzes UI" --> C
    C -- "suggests tests" --> D
    D -- "runs tests" --> A
    E[Pilot] -.->|supervises| D

Navigator Agent

Purpose: Handles all browser interactions — clicks, form fills, navigation.

What it does:

Executes CodeceptJS commands in the browser
Tries multiple locator strategies when selectors fail
Automatically resolves failed interactions without stopping
Remembers what worked (and what didn't) for next time

Why you'll love it:

No more ElementNotFound exceptions killing your test runs
Self-healing when your UI changes
Learns optimal selectors for your specific app

Commands that use Navigator:

/navigate <target>
I.click(), I.fillField(), I.amOnPage(), etc.

Researcher Agent

Purpose: Analyzes pages to understand what's actually there.

What it does:

Discovers all interactive UI elements
Expands hidden content (accordions, dropdowns, modals)
Maps navigation paths and form structures
Extracts structured data from tables and lists
Filters out irrelevant elements (cookie banners, ads)

Why you'll love it:

Discovers UI elements you forgot existed
Gives you a complete picture of what's testable
Documents forms with all their validation rules
Configurable filtering to focus on what matters

Commands that use Researcher:

explorbot research /path (CLI)
/research [path] (TUI)
/research --deep — expand hidden elements
/research --screenshot — use vision model

See Researcher Agent for detailed configuration and usage.

Planner Agent

Purpose: Generates test scenarios from research findings.

What it does:

Creates business-focused test scenarios
Assigns priority levels (critical/important/high/normal/low)
Generates expected outcomes for verification
Balances positive and negative test cases
Avoids duplicating existing scenarios
Cycles through planning styles (normal, psycho, curious) for comprehensive coverage

Why you'll love it:

Creates tests that matter, not just "click stuff"
Prioritizes by risk (critical flows first)
Different styles ensure broad coverage over multiple iterations
Fully customizable — add your own styles and page-specific rules

Commands that use Planner:

/plan [--focus <feature>]
/explore

See Planner Agent for detailed documentation on planning styles, customization, and configuration.

Tester Agent

Purpose: Executes the planned scenarios.

What it does:

Runs test scenarios step by step
Adapts when things don't go as expected
Tracks state changes during execution
Documents actual results vs. expected
Uses research context for smart decisions

Why you'll love it:

Handles unexpected modals and popups
Recovers from minor failures automatically
Produces detailed execution logs

Commands that use Tester:

/test [scenario]
/explore

Pilot Agent

Purpose: Supervises Tester and intervenes when tests get stuck.

What it does:

Maintains separate conversation to track test progress over time
Detects stuck patterns (loops, repeated failures, no page changes)
Decides what context Tester needs (HTML, ARIA, UI map)
Asks user for help when automated recovery fails

Why you'll love it:

Catches when Tester is spinning wheels on the same failure
Requests user input before giving up on a test
Can use smarter models without token cost explosion (only sees tool summaries, not raw HTML)

When Pilot intervenes:

Actions succeed but page doesn't change (wrong element)
Same action repeated multiple times (loop)
Same locator keeps failing (need alternative approach)
Only research/context calls, no action tools (not progressing)

Captain Agent (coming soon)

Purpose: Orchestrates the whole testing session.

What it does:

Coordinates all agents intelligently
Responds to user commands in real-time
Adjusts strategy based on discoveries
Manages conversation context efficiently

Per-Agent Model Configuration

You can optimize costs by using different models for different agents:

export default {
  ai: {
    model: groq('gpt-oss-20b'),
    visionModel: groq('llama-scout-4'),
    agents: {
      navigator: { model: groq('gpt-oss-20b') },
      researcher: {
        model: groq('gpt-oss-20b'),
        excludeSelectors: ['.cookie-banner'],
      },
      planner: { model: groq('gpt-oss-20b') },
      tester: { model: groq('gpt-oss-20b'), progressCheckInterval: 5 },
      pilot: { stepsToReview: 5 },
    },
  },
};

Typical optimization:

Navigator needs fast responses for real-time interaction
Researcher benefits from vision capabilities
Planner can use a slightly larger model for better test design
Tester needs tool use for execution
Pilot can use smarter models — it only processes tool summaries, not HTML/ARIA

How Agents Communicate

Agents share context through:

State Manager — Tracks current page, URL, navigation history
Research Results — Structured page analysis available to Planner and Tester
Experience Files — Learned patterns shared across sessions. Injected as a compact table of contents (file tags + section headings) rather than full bodies; agents pull individual sections on demand via the learn_experience tool.
Knowledge Files — Domain knowledge you provide

Each agent maintains minimal context to keep costs down. They request specific information when needed rather than carrying full conversation history.

Pilot-Tester relationship: Pilot maintains a separate conversation from Tester. Tester's conversation contains heavy HTML/ARIA context. Pilot only sees tool execution summaries (what succeeded, what failed, what changed). This allows Pilot to use expensive models without token cost explosion.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AI Agents

Agent Overview

Navigator Agent

Researcher Agent

Planner Agent

Tester Agent

Pilot Agent

Captain Agent (coming soon)

Per-Agent Model Configuration

How Agents Communicate

FilesExpand file tree

agents.md

Latest commit

History

agents.md

File metadata and controls

AI Agents

Agent Overview

Navigator Agent

Researcher Agent

Planner Agent

Tester Agent

Pilot Agent

Captain Agent (coming soon)

Per-Agent Model Configuration

How Agents Communicate