Question 1

What is agent observability?

Accepted Answer

Agent observability is capturing every agent run as structured trace data (each LLM call, tool invocation, and decision) so you can see exactly what an agent did, how well it performed, and what it cost, across its full lifecycle from pilot to production.

Question 2

What is agent evaluation?

Accepted Answer

Agent evaluation is continuously scoring agent outputs for quality, groundedness, and cost on real production traffic, so you catch drift and regressions before your users do, and can show an agent is improving over time.

Question 3

Is Prefactor only for enterprises?

Accepted Answer

Prefactor is built for enterprises moving AI agents from pilot to production: organisations where evaluation, visibility, and accountability are required to scale. That said, these challenges aren't unique to large organisations. Prefactor works for teams of all sizes, from startups shipping their first production agents to government agencies.

Question 4

How does Prefactor enforce policy at runtime?

Accepted Answer

Runtime enforcement means applying guardrails directly at the agent execution layer - blocking risky actions, detecting PII in outputs, and routing high-risk operations for human approval before they execute. Unlike static rules, it adapts as agents operate.

Question 5

How is Prefactor different from AI security tools?

Accepted Answer

AI security tools focus on threats such as prompt injection, data leakage, model misuse, and compromised tooling. Prefactor focuses on whether agents are performing: operating within scope, producing accurate, acceptable outcomes, and following the right human or policy controls in production.

Question 6

Why do AI agents need identity and scoped access?

Accepted Answer

AI agents need their own identity and scoped access so each action can be tied to a specific agent, task, and user context. That enables least privilege, traceability, token revocation, and safer delegation than static shared credentials.

Question 7

How does Prefactor evaluate AI agents in production?

Accepted Answer

Prefactor assesses outcome quality, cost efficiency, and scope adherence across AI agents, then can block actions, route them for approval, or record them for audit when policy thresholds are crossed.

Question 8

How does Prefactor define agent risk?

Accepted Answer

Risk is broken into two halves. The action profile is what your agent is permitted to do: create, read, update, or delete data, trigger financial transactions, send external communications. The data profile is the categories of sensitive data flowing through it, classified from public through to secret. Together they tell you how much damage an agent could do, and how much of that surface area it is actually using.

Question 9

What types of data does Prefactor look for?

Accepted Answer

Seventeen categories in total, including standard PII (names, contact, location, behavioural), financial records, credentials, confidential business data, and the GDPR Article 9 special categories: health, biometric, genetic, racial or ethnic origin, religious belief, political opinion, sex life or orientation, and trade union membership.

Question 10

What is the difference between what an agent can do and what it is actually doing?

Accepted Answer

The first is the design: the permissions and data access an engineer declared when they built the agent. The second is the reality: what the agent has actually invoked in production. Most teams only see the first. Prefactor shows you both, side by side, so you can see where an agent has drifted from what it was designed to do.

Question 11

How does an engineer declare risk on an agent?

Accepted Answer

Risk is declared in the schema, not sniffed from payloads at runtime. Each span type in your agent has a data risk definition: which categories of data flow through its inputs and outputs, what classification level they are at, and what actions it is allowed to take. That makes the risk profile auditable, version-controlled, and reviewable by a human before it ever runs.

Evaluate your AI agents in real time.

See everything.
Stop nothing.

Dashboards don't intervene

Evals with no teeth

You're the loop

Observability stops at the dashboard.
Reliability needs a loop.

From install to enforcement.

Install in minutes

Instrument your agents

See every run in real time

Evaluate what matters

Enforce, automatically or with a human

Bring any datasource into the agent run.

Ship agents like software: versioned, staged, promoted.

One layer over every agent, editor and workflow.

Built for enterprise security.

What you need to know

See Prefactor on your own agents.