Add EvalView — regression testing for AI agents
loading diff…
Adds EvalView — regression testing for AI agents.
EvalView snapshots agent behavior (tool calls, parameters, output), then diffs against baselines after every change. Catches silent regressions before production.
pip install evalview · Apache 2.0