Question 1

What is agest?

Accepted Answer

agest is a quantitative, framework-agnostic TypeScript framework for testing AI agent behavior. You run test scenarios ("scenes") against a real agent and get behavior coverage, a pass rate with a statistical confidence interval, token and USD cost, and a run history you can diff — all scored against a quality bar your team defines in config.

Question 2

How do you test an AI agent?

Accepted Answer

Write scenes that pair a prompt with assertions about the agent's behavior — refusal, content, tool use, schema-valid output, or an LLM-as-judge for fuzzy qualities. Run them with the agest CLI, and repeat each scene with .runs(n) to get a pass rate with a confidence interval instead of a single pass or fail.

Question 3

How do you measure test coverage for an AI agent?

Accepted Answer

agest tracks coverage across capability areas — refusal, correctness, format, tool-use, memory, performance, and robustness. The coverage radar shows which behaviors are tested, how well they pass, and where your confidence is still too thin to trust, so 'untested' and 'tested but not enough' become distinct, visible states.

Question 4

How is agest different from a visual agent builder or a hosted eval platform?

Accepted Answer

Unlike visual agent builders, agest does not build the agent — it measures and enforces its behavior in your codebase and CI. Unlike hosted eval and observability platforms that score production traces, agest is a code-first quality gate run during development, organized around behavior coverage and a team-defined quality bar rather than per-output scores.

Question 5

Is agest tied to a specific framework or model provider?

Accepted Answer

No. You wrap any agent in a one-line executor function, so agest works with a raw model SDK, LangChain or LangGraph, or any agent behind an HTTP endpoint. It is provider- and framework-agnostic.

Question 6

Is agest open source?

Accepted Answer

Yes. agest is MIT-licensed and written in TypeScript for Node.js 22+. Install it with: npm i -D @agest/core.

STOP SHIPPING
AGENTS ON VIBES.

> TRUSTED BY

> THE LOOP

put a number on good