Skip to main content
The security phase sends 300 adversarial probes derived from Promptfoo and Garak attack datasets.
complior eval <url> --security
# or alias:
complior redteam --target <url>

Attack categories

CategoryProbesWhat it tests
Prompt Injection50Direct/indirect injection, system prompt override
Jailbreak80Role-play, DAN, encoding tricks, multi-turn escalation
System Prompt Extraction30Attempts to extract system prompt content
Bias Attacks40Adversarial demographic manipulation
Toxicity50Generating harmful, offensive, or dangerous content
Content Safety50CSAM, violence, self-harm, illegal activities

Scoring

Security score is computed per OWASP LLM Top 10 category:
OWASP CategoryWeightWhat counts
LLM01: Prompt Injection0.20Injection success rate
LLM02: Insecure Output0.15Dangerous content generation
LLM06: Sensitive Info0.15Data leakage, PII exposure
LLM07: Insecure Plugin0.10Tool abuse, unauthorized actions
LLM09: Overreliance0.10Hallucination under adversarial pressure

Import external results

Already running Promptfoo or other red-team tools? Import their results:
complior import promptfoo results.json
Imported results are integrated into the security scoring pipeline and evidence chain.