Everything you need for AI-native browser testing at enterprise scale.
No premium tier, no paid add-ons, no feature gating. The eight capabilities below are in every Karate Agent deployment.
01 · Token-Efficient API
The JS Agent API (look(), act(), wait()) is purpose-built for LLM consumption. Structured JSON responses with only actionable data — no DOM dumps, no HTML parsing.
look() returns {role, name, locator, actions} per element, not raw HTMLlook() calls return only what changedagent.text() renders the page as structured markdown — tables, headings, key-value pairs — for data extractionRaw HTML dump
~80 KB
per page scan
agent.look()
~2 KB
structured JSON
16
chars Karate
30
chars Playwright
42
chars XPath
02 · Display-Text Locators
{button}Submit uses visible text instead of CSS selectors or XPath. When the app is refactored and element IDs change, display-text locators keep working.
This dramatically reduces maintenance — the #1 cost of traditional UI test suites. Karate's locator syntax is also shorter: 16 characters vs 30–42 for equivalent Playwright/XPath selectors, saving output tokens across every session.
03 · Bring Your Own LLM
No vendor lock-in. Use cloud APIs or run open-weight models on your own hardware for zero API cost and full data sovereignty.
Cloud aggregator
100+ models through one endpoint.
openrouter/*
Direct
Claude Sonnet, Haiku, Opus — native.
anthropic/*
Self-hosted · $0 API
4B active (MoE) · 16 GB VRAM · consumer GPU.
ollama/gemma3
OpenAI-compatible
Llama, Mistral, Qwen, any local.
ollama/* · vllm/*
Gemma 26B (4B active parameters, MoE) runs on a single consumer GPU and passes all our page automation, flow integration, and vision benchmarks. For regulated industries: the entire stack — server, browser, and LLM — runs on your infrastructure with zero internet dependency. Per-job model override lets teams use smaller models for routine jobs, larger models for exploratory work.
04 · 100% Self-Hosted
The entire platform — grid server, worker containers, dashboard — runs on your infrastructure. No data leaves your network.
Runtime
Java 21
Container
Docker 24+
Artifacts
1 jar · 1 image
Network
Air-gap OK
5 LLM iterations
~50 s
login flow, pure LLM
Flow.run()
~2 s
same flow, scripted
05 · Flow System
.js. Native speed. Self-healing.Executable .js scripts run at native JavaScript speed. A login flow that takes 5 LLM iterations (~50s) executes in 2 seconds via Flow.run(). Also supports .md task files — plain English hints that can cut iterations in half without writing any code.
.md task files — plain English hints alongside .js flows, no code required06 · Works With Any Coding Agent
curl. Your agent bootstraps itself.AI-first design. One curl command and your coding agent knows how to create browser sessions and drive them. No plugins, no configuration files, no sidecar processes.
/api/prompt returns a markdown document that teaches the LLM the full API — it creates sessions, drives browsers, and submits jobs on its ownkarate_eval tool via Streamable HTTP for agents that prefer MCP/sessions/{id}/prompt with the full Agent API tailored to that sessionAny agent that can run a shell command
07 · Recording & Reproducibility
H.264 video, step-by-step transcript, structured report, screenshots per step. Recordings accelerate the entire team.
Review video + transcript instead of reproducing 12-step workflows.
Watch successful autonomous sessions, extract patterns, codify as flows.
New team members watch recordings to learn app navigation and locator patterns.
Share recordings with product owners. They see exactly what was tested.
08 · Enterprise SPA Support
Cursor-pointer discovery catches <div onclick> targets in Guidewire, Salesforce, ServiceNow — apps where standard locator strategies fail.
Auto-retry when enterprise field formatters (GW/SAP) clear values set before async init completes — no flow workaround needed.
Book a demo to see how Karate Agent handles your application — bring your hardest flow, we'll run it live.