Gateway overhead (under 1 ms P95): measured at the agent on commodity Linux x86-64 hardware (4 vCPU, 8 GiB RAM) with a synthetic 1 KiB prompt and 4 KiB response
Reported number excludes upstream LLM latency..
Public methodology for the performance claims published on the architecture, product, and pricing pages. Scope, test setup, what is measured, what is not, and known limitations.
Last reviewed June 2026
Performance numbers in security marketing routinely conflate best-case microbenchmarks with end-to-end behaviour. Procurement teams need to know which figure applies in their setting.
Every published number on the public site falls into one of three buckets: gateway overhead, scan-run cost, scan-run wall time. Each bucket has its own measurement protocol, repeatability requirement, and review cadence.
Reported number excludes upstream LLM latency..
Excludes time spent in a customer human-review queue..
Excludes infrastructure cost, support cost, and amortised fixed costs..
.
.
.
NIST AI 600-1 MEASURE 1.1 (define metrics) and MEASURE 1.3 (track over time); ISO/IEC 42001 A.9.4 (performance monitoring).
Average overhead is dominated by cache hits and underweights the worst-case path. P95 is what determines whether the gateway is acceptable on a customer-facing endpoint at peak load.
No. The figure is judge-execution cost at scale (Batch API, cache enabled). Human review is a separate workflow with its own SLA and cost model.
Scoped walkthrough of the Methodology / Performance surface against your environment. No credit card.