AgentReady

"Battle-test your AI agent's knowledge before you ship. Ship with data, not vibes."

AgentReady is a lightweight, Netlify-deployed harness that stress-tests the knowledge base and agent harness (skills/rules/tools/system prompts) you built around an LLM.

What it does

Upload your agent's knowledge base (docs, runbooks, READMEs, etc.)
Upload your agent harness (optional) + describe the agent's mission
Get a readiness score and a prioritized list of blind spots before users find them

How it works

Generates stress-test probes tailored to your mission and domain (incidents, edge cases, adversarial prompts, cross-domain questions)
Evaluates knowledge + harness coverage by running probes and forcing the model to be explicit about confidence + missing context
Returns a readiness score + top gaps (what's missing, whether it's a knowledge vs harness issue, why it matters, and how to fix it)

Tech stack

Netlify Functions (serverless)
Anthropic Claude API (@anthropic-ai/sdk)
Vanilla JS (no framework)
Neo-brutalist CSS

Setup

Local dev

git clone https://github.com/rajnavakoti/agent-ready
cd agent-ready
npm install

# Run Netlify dev server (functions + static site)
ANTHROPIC_API_KEY=your-key npm run dev

Then open the local URL printed by Netlify (defaults to http://localhost:8888).

Deploy to Netlify

Create a new Netlify site from this repo
Set the environment variable ANTHROPIC_API_KEY
Deploy

netlify.toml already sets:

publish = "src"
functions = "netlify/functions"

Architecture

The app is a 3-function pipeline:

netlify/functions/generate-probes.mjs: given mission + knowledge base + harness, generates targeted probes (JSON)
netlify/functions/execute-probes.mjs: runs each probe against the provided context and returns answers + confidence + explicit gaps
netlify/functions/analyze-gaps.mjs: aggregates results into category scores, an overall readiness score, and the top blind spots

On the frontend, src/js/app.js orchestrates the pipeline:

/.netlify/functions/generate-probes
/.netlify/functions/execute-probes
/.netlify/functions/analyze-gaps

Datasets

AgentReady uses 17+ public datasets across 8 domains to ground scenario simulation, including:

Customer support
IT incidents
Healthcare
Legal
Finance / fraud
E-commerce
Real estate
Code / bugs

Research

This project is inspired by DDC-style evaluation methodology: https://arxiv.org/abs/2603.14057

License

MIT (see LICENSE).

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
netlify/functions		netlify/functions
sample-data		sample-data
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
netlify.toml		netlify.toml
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AgentReady

What it does

How it works

Tech stack

Setup

Local dev

Deploy to Netlify

Architecture

Datasets

Research

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AgentReady

What it does

How it works

Tech stack

Setup

Local dev

Deploy to Netlify

Architecture

Datasets

Research

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages