low severityLakera Guard API

API requests to https://api.lakera.ai/v2/guard experience sudden increases in response time (e.g. >200ms P99) when sending multiple concurrent requests or long prompts, causing agent timeouts/delays.

Root cause

No specific root cause documented for named issue. Latency scales with content length (#tokens) and #detectors in policy due to processing time. Potential quota/concurrency limits in Community plan (10k req/mo). Network latency from global but not local deployment ([Lakera API docs](https://docs.lakera.ai/docs/api/guard), [pricing](https://www.eesel.ai/blog/lakera-pricing)).

Lakera GuardAPI latencyload spikeconcurrencyrate limit

Citations

https://docs.lakera.ai/docs/api/guard