Lakera Review

lakera
tldr

TL;DR

Lakera is one of the few AI-native security platforms that combines workforce AI security, AI agent security, and AI red-teaming under one roof. The Gandalf prompt injection challenge — and the more recent Gandalf: Agent Breaker environment — give the company unusual depth in adversarial research that translates back into the product. For buyers who want runtime defense and red-teaming from the same vendor, Lakera is the strongest single answer in the category.

Coverage is broader than a pure DLP and the API-first architecture is a fit for organizations with engineering teams building custom LLM apps and AI agents.

Score: 8.5 / 10

How This Review Was Conducted

This review is based on vendor demos, public documentation, customer feedback, and framework analysis. Lab testing is currently pending.


This review is currently based on:

Lab testing has not yet been completed because access is still pending.

The scoring rubric

Every reviewed product receives a score from 1 to 10 on each of seven dimensions. The dimensions and their weights are:

Dimension

Coverage breadth

Detection accuracy

Deployment friction

Policy & control depth

Framework alignment

Pricing transparency

Support & documentation

Weight

20%

20%

15%

15%

10%

10%

10%

Score

9

9

7

8

9

5

9

What it measures

Workforce (apps + browsers, shadow AI discovery), AI agents, custom LLM apps. Broad model surface coverage.

Mature prompt injection and jailbreak detection backed by Gandalf-driven adversarial research; sub-50ms runtime latency per vendor.

API-first; cloud-native enterprise integrations are documented. Pure SaaS rollouts are fast; custom LLM app integration requires engineering work.

Context-aware data protection and granular policy primitives across the runtime, agent, and red-team modules.

Strong OWASP LLM Top 10 coverage; reasonable NIST AI RMF mapping.

Quote-based for enterprise; some self-serve tiers exist for individual modules.

Public documentation is among the deepest in the category; the Gandalf community is a credibility multiplier.

Coverage breadth

Weight
20%

Score

9

What it measures

Workforce (apps + browsers, shadow AI discovery), AI agents, custom LLM apps. Broad model surface coverage.

Detection accuracy

Weight
20%

Score

9

What it measures

Mature prompt injection and jailbreak detection backed by Gandalf-driven adversarial research; sub-50ms runtime latency per vendor.

Deployment friction

Weight
15%

Score

7

What it measures

API-first; cloud-native enterprise integrations are documented. Pure SaaS rollouts are fast; custom LLM app integration requires engineering work.

Policy & control depth

Weight
15%

Score

8

What it measures

Context-aware data protection and granular policy primitives across the runtime, agent, and red-team modules.

Framework alignment

Weight
10%

Score

9

What it measures

Strong OWASP LLM Top 10 coverage; reasonable NIST AI RMF mapping.

Pricing transparency

Weight
10%

Score

5

What it measures

Quote-based for enterprise; some self-serve tiers exist for individual modules.

Support & documentation

Weight
10%

Score

9

What it
measures

Public documentation is among the deepest in the category; the Gandalf community is a credibility multiplier.

What it does well

Runtime + red-teaming in one platform.

Most competitors do one or the other. Lakera does both, & the feedback loop between adversarial research and detection improvements shows up in the product.

Shadow AI discovery integrated with the workforce module.

Discovery is not a separate SKU. New AI tools surfacing in the org show up in the same console used to enforce runtime policy.

Sub-50ms runtime latency.

Per vendor; appropriate for in-line enforcement on customer-facing LLM apps.

Gandalf adversarial research.

The original Gandalf prompt injection challenge and the newer Gandalf: Agent Breaker show a rare public commitment to adversarial testing and continuous product learning.

OWASP-aligned coverage.

Maps cleanly onto OWASP LLM01 (prompt injection), LLM07 (system prompt leakage), LLM02 (sensitive information disclosure), & elements of the new owasp Top 10 Applications 2026.

Where it falls short

This site distinguishes between two testing tracks. Both are honest about their depth. The lab program operates under the following commitments:

Pricing not fully transparent

Enterprise pricing is quote-based.

Engineering integration required for custom LLM apps.

Less of a weakness, more a category constraint but buyers expecting a zero-deployment workforce DLP should not be running through Lakera’s full stack on day one.

Open questions

Published ISO 42001 mapping, named-CSM tier thresholds, & customer-attested benchmark numbers are not publicly available at the depth a top-tier review would prefer.

Best fit

Mid-to-large enterprises with engineering teams building or operating custom LLM applications and AI agents, plus a workforce that uses ChatGPT, Claude, Gemini, Perplexity, and embedded SaaS AI. Buyers who want runtime defense and red-teaming from one vendor.

Poor fit

Small and mid-market organizations whose primary need is “stop employees pasting secrets into ChatGPT and Claude” — Lakera will work, but a focused tool like AILeakShield will get to working policy faster.

Pricing transparency

Mixed. Self-serve tiers exist for some modules; enterprise is quote-based. Improving on this would lift the score.

Alternatives

HiddenLayer is the closest enterprise alternative on AI lifecycle and supply chain. Lasso Security is a closer match on guardrails-layer GenAI security. Witness AI is the network-layer alternative.

What We Would Test in the Lab

If Lakera grants lab access, we would run the following scenarios. This list serves both as transparency about how a Lab Tested review of Lakera would be scored, and as a public roadmap that pressures vendors toward participation:

Financial

The standard 150-prompt sensitive data set across the workforce module to evaluate detection accuracy in the inspected path.

Prompt injection

The standard 150-prompt sensitive data set across the workforce module to evaluate detection accuracy in the inspected path.

Agent & MCP inspection

Verify policy enforcement on a representative agent runtime with at least one MCP tool integration.

Policy enforcement

Block, warn, redact, allow behaviors against the configured workforce and runtime policies.

Audit logging

Verify what is logged, what is not, & retention behavior across workforce, agent, and red-team modules.

SSO integration

The standard 150-prompt sensitive data set across the workforce module to evaluate detection accuracy in the inspected path.

Latency

Measure runtime latency on standard prompt sizes against the vendor's sub-50ms claim

Adoption considerations

Lakera’s three-module architecture (Workforce, Agent, Red Teaming) lets buyers start narrow and expand. The most common adoption pattern we have seen in customer references is to start with the AI Red Teaming module — adversarial testing of an existing LLM application before launch — and add Workforce and Agent modules as the AI portfolio grows.

The advantage of this sequence is that the red-teaming engagement produces evidence usable in NIST AI RMF Measure activities and ISO 42001 lifecycle artifacts; that evidence is a wedge for the broader rollout.

For organizations that prefer a workforce-first deployment, Lakera’s shadow AI discovery is integrated into the same console as the runtime policy, which avoids the “two consoles for one problem” pattern common with stitched-together discovery and enforcement vendors.

Integration footprint

API-first means engineering work for custom LLM applications. The integration is well-documented; references describe a few engineer-weeks for a typical deployment on a single application, with subsequent applications adopting the same pattern in days.

The MCP and agent integrations are more complex and depend on the
agent framework in use; LangChain, LangGraph, and direct MCP integrations are documented.

Lakera Review

Total cost of ownership

TCO depends on which modules are licensed and the seat count for the workforce module. Buyers comparing TCO with broader-platform competitors should normalize for the value of the red-teaming module many enterprises pay an external red-team firm for periodic engagements

and the in-platform red-teaming module replaces a portion of that spend. We saw one customer reference attribute roughly 15% of total Lakera value to the displaced external red-team budget.

question

Open questions we plan to follow up on

FAQ

Does Lakera replace a traditional DLP?
No. Lakera covers AI-specific detection and runtime policy enforcement. Traditional DLP for non-AI data flows still has a role.Yes. All reviews, comparisons, and ranked lists are free and require no login. Two long-form guides — the ISO 42001 readiness checklist and the EU AI Act roadmap — are emailed in PDF form to subscribers.
Gandalf is Lakera’s public prompt injection challenge — a series of levels in which a player attempts to extract a secret from an LLM under increasing defenses. Gandalf: Agent Breaker is the agent-focused successor. Both are credibility signals for Lakera’s adversarial research.
The AI Agent Security module is built specifically for agent runtime, including tool-use scenarios and MCP-style integrations. See our guide on the OWASP Top 10 for Agentic Applications 2026 for the threat model.
Lakera’s runtime and logging primitives map onto Article 16 obligations around quality management and logging, but a buyer pursuing high-risk system compliance should request the vendor’s EU AI Act mapping document directly.