Vectara

Categories

Blog - page 2

All posts

Guardian Agents Benchmark
Agentic

Guardian Agents Benchmark

We built a platform-agnostic benchmark with ~900 real-world scenarios across 6 domains to measure agent robustness, surface tool-calling failures, and validate guardian agents that prevent costly / unsafe behavior.

Vishal NaikChenyu Xu
Vishal Naik,Chenyu Xu
Before you go...

Connect with
our Community!