🏆 Guardrail Leaderboard
Benchmark comparison of guardrails across accuracy, precision, recall, and latency
50+
Guardrails Benchmarked
10K+
Test Cases
97.2%
Average Accuracy
8.3ms
Average Latency
Rank
Guardrail
Accuracy
Precision
Recall
Latency
Type
🥇
100%
100%
100%
1.2ms
pii-blocker
Official
🥈
98.5%
99.1%
97.8%
2.8ms
prompt-injection-blocker
Official
🥉
96.8%
95.2%
98.4%
8.5ms
financial-advice-blocker
SG2
4
95.3%
94.1%
96.5%
12.3ms
legal-advice-blocker
SG2
5
94.2%
92.8%
95.6%
14.7ms
medical-advice-blocker
SG2
6
93.5%
91.2%
95.8%
18.2ms
toxicity-detector
SG2
7
99.1%
98.7%
99.5%
6.4ms
hipaa-compliance
Official
8
87.2%
85.4%
89.1%
45.3ms
hallucination-detector
SG2
9
89.4%
88.1%
90.7%
22.1ms
education-guardrail
Community
10
97.8%
96.5%
99.1%
3.2ms
secrets-detector
Official
📊 Benchmark Methodology
All guardrails are evaluated against a standardized test suite of 500+ examples per category, including both positive (should block) and negative (should allow) cases. Metrics are calculated as follows:
- Accuracy = (TP + TN) / (TP + TN + FP + FN)
- Precision = TP / (TP + FP) — How many blocks were correct
- Recall = TP / (TP + FN) — How many violations were caught
- Latency = P50 response time in milliseconds
Benchmarks are run monthly. Last updated: November 2024.