Name: Ibras AI Agent
Author: volfadar

Question 1

How do I install ibras-ai-agent?

Accepted Answer

In Claude Code, run `/plugin marketplace add volfadar/ibras-ai-agent`, then `/plugin install mastra-evals@ibras-ai-agent`. Requires TypeScript runtime (Bun or npx tsx) and a Mastra project with `@mastra/core` and `@mastra/evals` packages installed.

Question 2

What does mastra-evals plugin do?

Accepted Answer

It profiles your Mastra agent and auto-generates a complete evaluation system: code-based scorers, LLM judges, golden datasets, offline experiments, CI regression tests, and online sampling—all battle-tested against @mastra/core 1.45.

Question 3

Can I add custom plugins to the marketplace?

Accepted Answer

Yes. Follow the structure under `plugins/<name>/` and submit via PUBLISHING.md. Each plugin lives in its own directory and can be versioned independently within the marketplace.

Question 4

What are the system requirements?

Accepted Answer

You need Claude Code, a TypeScript runtime (Bun or npx tsx), and for mastra-evals specifically: a Mastra project with `@mastra/core` and `@mastra/evals` packages. Plugins use only Node.js built-ins.

Question 5

How do I run CI regression tests?

Accepted Answer

The generated `runEvals` function integrates into your CI/CD pipeline. It compares current agent performance against your golden dataset baseline and fails builds if regressions exceed thresholds.

Question 6

What is online sampling?

Accepted Answer

Real-time evaluation of agent outputs in production. The plugin generates hooks to continuously sample and score live agent responses against your evaluation criteria, surfacing drift or performance degradation.

Question 7

Can I use ibras-ai-agent for non-Mastra AI agents?

Accepted Answer

Currently optimized for Mastra projects. The marketplace structure supports other plugins in the future, but mastra-evals specifically requires `@mastra/core` and `@mastra/evals`.

Question 8

What is a golden dataset in this context?

Accepted Answer

A curated set of representative agent inputs and expected outputs. The plugin auto-generates one based on your agent's behavior, then uses it for offline experiments and regression testing to ensure consistency.

Ibras AI Agent

What it does

How it works

Use cases

Who benefits

Frequently asked questions

Glossary

More in Design Ops

Autonomous Development Pipeline

Claude Skills

Propose

SOTA Present