Vidimus is an agent assurance platform for European enterprises. It lets you approve, test, monitor, and audit AI agents before and after production, and produces a regulator-grade evidence pack that names every obligation, every test, and the audit trail behind the verdict.

Where is our data hosted?

In the EU. The production database, object storage, and authentication run in the European Union (Supabase, Frankfurt), and default language-model processing uses an EU-based provider. Tenant data does not leave the bloc.

How does Vidimus actually test an agent?

For each agent it synthesises an adversarial test plan from the regulation corpus, sends those probes to your agent through HTTP, A2A, or MCP adapters, observes the tool calls the agent actually makes, and grades every probe with an independent judge model. Runs are reproducible and fully cited.

Is Vidimus a substitute for legal advice?

No. Vidimus produces the evidence and indicative verdicts your compliance, risk, and legal teams rely on. Outputs require human review before being used for a regulatory decision.

EU AI ActEU AI Pact signatory

Agent assurance for European enterprises

Vidimus is the control plane CIOs, Chief AI Officers, and CISOs use to approve, test, monitor, and audit AI agents before — and after — production deployment. Built EU-first: data stays in-region, controls map to the regulation, and every decision lands in an immutable audit trail.

Approve / Agent registry

Agent registry

Agents under governance

▲ 6 this quarter

Awaiting sign‑off

2 high‑risk

Live · monitored

all signals green

Evidence packs

Annex IV mapped

Registered agents regulated workloads

Risk: all ▾

Agent	Risk class	Status	Last review
CT Claims Triage Copilot Insurance · claims	High	In review	2 days ago
KY KYC Onboarding Agent Banking · onboarding	Limited	Approved	5 days ago
PI Patient Intake Assistant Healthcare · admissions	High	Monitoring	1 day ago
UW Underwriting Assistant Insurance · pricing	Limited	Approved	1 week ago
FR Fraud Signals Monitor Banking · risk	Minimal	Monitoring	3 days ago

Signed & sealed

Annex IV evidence pack

Built for regulated industriesBankingInsuranceHealthcarePublic sector

The platform

One platform from proposal to audit

Four surfaces that share one data model, so a probe failure in production can be traced back to the obligation it exercises and forward to the reviewer who signed the agent off.

Approve

A structured intake captures the agent’s purpose, model, data sources, tools, and human-oversight points. Vidimus classifies the agent against the EU AI Act risk tiers, evaluates a control checklist drawn from the EU AI Act, plus internal-security and vendor-risk baselines, and routes the file to the right reviewer with an immutable sign-off trail.

Test

For each agent intake we synthesise an adversarial test plan from the regulation corpus — prohibition probes, transparency probes, oversight probes, jailbreak resistance, tool overreach. Every probe is graded by an independent judge model. Runs are reproducible: prompt, response, verdict, and reasoning are all retained.

Monitor

Verdicts, drift signals, and configuration changes feed a live dashboard tied to the agents already in production. Regressions, new probe failures, and reviewer interventions surface in one place — not buried in separate observability stacks.

Audit

Every approval, override, and test run is appended to a tamper-evident log. The evidence pack is generated on demand: risk classification, applicable obligations, probe results, control verdicts, document evidence, and the full chronological trail. Mapped to AI Act Annex IV and your internal control catalogue.

Monitor / Live signals

Runtime assurance — Claims Triage Copilot

Behavioural drift rolling 24h

● live

Observed driftApproved baselineWithin tolerance

Signals

today

Compliance check passed

GDPR data‑handling · 14:02

Drift approaching tolerance

Output length +12% · 13:41

Abuse attempt blocked

Prompt‑injection pattern · 11:18

Regression suite green

312 / 312 scenarios · 09:30

99.98%

Uptime · 30d

Policy breaches

Abuse attempts blocked

142ms

Median check latency

The path

How it works

The same path every agent takes. Idempotent at each step so a paused or interrupted flow resumes without losing state.

01
Declare the agent
Intake captures purpose, data sources, model provider, tools, customer-facing scope, and oversight design. Validated with Zod at the boundary so the downstream risk derivation is honest.
02
Vidimus builds the regulatory test plan
A pattern bank × corpus synthesiser produces obligation-specific probes against the EU AI Act. A critic model filters the loose and the unwinnable so only defensible probes reach your agent.
03
Run the probes, collect the evidence
Probes are sent to your agent through HTTP, A2A, or MCP adapters. Each turn is graded; tool calls are observed; uploaded documentation is verified against the obligations it claims to address.
04
Hand a regulator the pack
Export a versioned, content-addressed evidence pack. Append-only audit trail, citations to verbatim regulation text, and reviewer decisions with reasons. Re-issuable, replayable, and tied to a specific intake fingerprint.

Compliance

Built to hold up to a regulator

Compliance is the substrate, not a feature. The architectural choices that matter to a supervisor are load-bearing, not optional flags.

EU data residency

Postgres, storage, and the model providers we default to are EU-region. Tenant data does not leave the bloc.

Tenant isolation in the database

Every table is scoped by org and enforced through Postgres row-level security — not just application checks.

Append-only audit trail

Approval decisions, control overrides, evidence-pack exports, and deployment events are immutable with actor, time, and reason.

Citable, replayable evidence

Probes quote the regulation verbatim; plans are content-addressed; uploaded documents are verified passage-by-passage against the obligations they cover.

Audit / Evidence pack

Evidence pack — Claims Triage Copilot

EU AI Act · Annex IV technical documentation

PACK #A‑1042 · GENERATED 31 MAY 2026 · SHA‑256 VERIFIED

System description & intended purpose

§1 · 4 docs

Risk classification & control checklist

§2 · sign‑off trail

Evaluation harness & regression results

§3 · 312 scenarios

Post‑market monitoring plan & logs

§4 · 30‑day window

Human‑oversight & incident procedures

§5 · mapped

Internal control catalogue cross‑reference

§6 · 28 controls

Coverage

100%

All Annex IV sections mapped to internal controls.

Attestation

Signed & sealed

by L. Okafor · Head of Assurance

0x8287d37…95ffd0

FAQ

Frequently asked questions

What is Vidimus?: Vidimus is an agent assurance platform for European enterprises. It lets you approve, test, monitor, and audit AI agents before and after production, and produces a regulator-grade evidence pack that names every obligation, every test, and the audit trail behind the verdict.
Which regulations does Vidimus cover?: Vidimus's automated assurance currently covers the EU AI Act, plus internal-security and vendor-risk baselines. GDPR and DORA coverage is on the roadmap.
Where is our data hosted?: In the EU. The production database, object storage, and authentication run in the European Union (Supabase, Frankfurt), and default language-model processing uses an EU-based provider. Tenant data does not leave the bloc.
How does Vidimus actually test an agent?: For each agent it synthesises an adversarial test plan from the regulation corpus, sends those probes to your agent through HTTP, A2A, or MCP adapters, observes the tool calls the agent actually makes, and grades every probe with an independent judge model. Runs are reproducible and fully cited.
Is Vidimus a substitute for legal advice?: No. Vidimus produces the evidence and indicative verdicts your compliance, risk, and legal teams rely on. Outputs require human review before being used for a regulatory decision.

Run a pilot on one agent

Bring one agent through the full path — intake, probe synthesis, run, and pack. Pilots take about two weeks and end with an evidence pack we walk through with your risk and compliance leads.