HITL Provider

Llama Guard

Open-source safety classifier from Meta with strong docs and ecosystem integration but limited native API readiness and self-hosted security concerns.

53
Trust score
Visit Llama GuardStale · Not verified
✓ Our Verdict

prompt injection bypass vulnerabilities reported (no data breaches)

Trust Breakdown

53
Trust scoreCaution
AGENT
20
Autonomous workflow delegation
TRUST
90
Transparency & verification
INTEROP
60
Protocol compatibility breadth
SECURE
10
Security controls & audit trail
DOCS
85
Documentation completeness
How these scores are calculated →

What It Actually Does

Open-source safety classifier from Meta with strong docs and ecosystem integration but limited native API readiness and self-hosted security concerns.

Fit Assessment

Best for

Agent System
53
Llama Guard
Caution · 53/100
Visit Llama Guard

Score Breakdown

AGENT
20
Autonomous workflow delegation
TRUST
90
Transparency & verification
INTEROP
60
Protocol compatibility breadth
SECURE
10
Security controls & audit trail
DOCS
85
Documentation completeness

Protocol Support

MCP
A2A
A2H
REST API
Agent-callable

Capabilities

Transaction capable
ACP support
Audit trace

Pricing

Free

Workflow Fit

Agent System

Related Categories

Ready to evaluate Llama Guard in your stack?
Composite score: 53
Visit Llama Guard