Leaderboard - GuardrailHub

50+

Guardrails Benchmarked

10K+

Test Cases

97.2%

Average Accuracy

8.3ms

Average Latency

Rank Guardrail Accuracy Precision Recall Latency Type

🥇

🔐

pii-blocker

ethicalzen

100% 100% 100% 1.2ms

Official

🥈

🛡️

prompt-injection-blocker

ethicalzen

98.5% 99.1% 97.8% 2.8ms

Official

🥉

💰

financial-advice-blocker

ethicalzen

96.8% 95.2% 98.4% 8.5ms

SG2

⚖️

legal-advice-blocker

ethicalzen

95.3% 94.1% 96.5% 12.3ms

SG2

🏥

medical-advice-blocker

ethicalzen

94.2% 92.8% 95.6% 14.7ms

SG2

⚠️

toxicity-detector

ethicalzen

93.5% 91.2% 95.8% 18.2ms

SG2

📋

hipaa-compliance

ethicalzen

99.1% 98.7% 99.5% 6.4ms

Official

🎯

hallucination-detector

ethicalzen

87.2% 85.4% 89.1% 45.3ms

SG2

🎓

education-guardrail

community

89.4% 88.1% 90.7% 22.1ms

Community

🔒

secrets-detector

ethicalzen

97.8% 96.5% 99.1% 3.2ms

Official

📊 Benchmark Methodology

All guardrails are evaluated against a standardized test suite of 500+ examples per category, including both positive (should block) and negative (should allow) cases. Metrics are calculated as follows:

Accuracy = (TP + TN) / (TP + TN + FP + FN)
Precision = TP / (TP + FP) — How many blocks were correct
Recall = TP / (TP + FN) — How many violations were caught
Latency = P50 response time in milliseconds

Benchmarks are run monthly. Last updated: November 2024.

🏆 Guardrail Leaderboard

📊 Benchmark Methodology