Production AI Agents for Teams Shipping Software in Regulated Domains

Drop-in AI agents, QualityProfit baked in.

Production-grade AI agents for teams shipping software with Claude Code, Cursor or Copilot — each with QualityProfit's quality and observability layer wired in from day one. Built for insurance, financial services and government, where privacy, security and audit trails aren't optional.

Pre-order opens Q3 2026. Need a custom build instead? Paul takes a few Studio engagements per year.

15+
Years
QA & test automation
14
Clients
Enterprise & government
8+
AI Agents
QualityProfit + NVM monorepo
1
SaaS
Solo-built, shipping
Ambassador
Cypress.io
Who this is for

Built for software teams where auditors care.

The agent catalog itself is stack-agnostic — it runs alongside whatever you ship. But we built it for one kind of customer: software teams in regulated domains where AI-assisted code shipping needs to be auditable, traceable and defensible. Insurance, financial services, government. The places where "we'll fix it next sprint" isn't an acceptable answer to a regulator.

Insurance & financial services

DORA is here. So is the auditor.

Insurers, banks, payment platforms, asset managers. DORA, GDPR, internal audit, third-party ICT risk — your AI-augmented delivery pipeline has to be defensible to people who don't write code. The catalog gives you agents that gate, classify and log every change, with QualityProfit translating it all into financial-impact language for the people who sign your budget.

Government & public sector

Auditable AI in delivery, by default.

Ministries, public-service implementers, government IT bodies. Algoritmeregister, AVG, BIO, NPR 5326, EU AI Act. The agents run inside your repo and your CI — not on a third-party cloud you don't control. Your AI-assisted delivery gets an evidence trail that survives an inspector and a change of administration, with privacy and data residency answered by architecture, not by paperwork.

Engineering & QA leadership

Velocity is up. Quality signal isn't.

CTOs, VPs of Engineering, Heads of QA in regulated organisations. Your team shipped fast with AI coding agents — review queues are a swamp and your CFO is asking what you're getting for the cost. The catalog gives you agents that gate, route and report, with QualityProfit's executive dashboard underneath.

Not for you if…

We don't sell to everyone.

You're shopping for a generic AI-agent vendor with no regulatory story. You want a one-off Cypress audit. You're a pure consumer-tech consumer-internet team where "move fast, break things" is still the operating model. The catalog isn't for you — and we'd rather tell you that now than discover it in week 6.

The problem

AI ships fast. Audit trails don't ship with it.

Your team adopted Claude Code, Cursor or Copilot. Velocity went up. But now you have review queues full of plausible-looking code no one really checked, tests that pass without exercising anything, and a creeping sense that your audit story is built on screenshots and hope.

In regulated industries, that gap isn't a productivity issue — it's a risk issue. DORA, the EU AI Act, NIS2, internal audit, your auditor's auditor: all of them are about to ask how AI-assisted shipping is controlled. Most engineering teams don't have a defensible answer.

We close that gap. Test architectures designed for AI extension. Custom agents that gate every push with evidence attached. A Language-First approach that makes specs, scenarios and tests legible to humans and LLMs — and to the auditor reading them next quarter.

Agent catalog · Pre-order Q3 2026

Production-ready AI agents — with audit trails included.

Drop-in agents for teams using Claude Code, Cursor, Copilot or similar — each with QualityProfit's quality and observability layer baked in by default. Distilled from real client work in insurance, financial services and government. Pre-order opens Q3 2026.

Compliance & risk · Shipping first

Pre-order · Q3 2026 First to ship

risk-classifier

Classifies every PR by business risk — auth, payments, PII, data migrations, external APIs — using project-specific rules + AST analysis. Routes to specialist reviewers and tunes gate strictness automatically.

QP integration: every classification logged to QualityProfit's Quality Cost Ledger with a confidence-weighted financial impact estimate.
Get notified →
Pre-order · Q3 2026 First to ship

compliance-guard

Checks code and tests against control frameworks — ISO/IEC 27001, GDPR, SOC2, DORA, NIS2, EU AI Act, WCAG. Flags violations as PR comments with the exact control IDs, so audit prep stops being archaeology.

QP integration: compliance evidence becomes a QualityProfit audit-ready report — defensible to inspectors, exportable for DPAs.
Get notified →

Reliability & operations

Pre-order · Q3 2026

incident-replay

Takes a production incident report, generates regression test(s) that would have caught it, and opens the PR. Closes the loop from outage to evidence — no more "we'll write the test next sprint."

QP integration: cost-of-incident logged to QP and attributed to the regressing change. Post-mortems gain a financial axis automatically.
Get notified →
Pre-order · Q3 2026

perf-regression-detector

Runs benchmarks per commit — Web Vitals, p95 latencies, DB query plans. Fails the gate on regressions outside per-route tolerance bands. Tuned to your SLOs, not generic thresholds.

QP integration: regressions tagged with cost impact in the Quality Cost Ledger; performance becomes a line item, not a gut feeling.
Get notified →

Quality & ROI

Pre-order · Q3 2026

roi-tracker

Tags every gate run with a "value-prevented" estimate (cost-of-bug-in-prod × confidence). Aggregates monthly into an executive PDF your CFO will actually read.

QP integration: the canonical QualityProfit on-ramp — outputs become the Quality Cost Ledger live, no double-bookkeeping.
Get notified →
Pre-order · Q3 2026

flaky-quarantine

Detects flaky tests using rolling pass/fail history. Auto-quarantines after N flakes, opens a debugging issue with a templated checklist, removes the flake from the gate so the rest of CI stays green.

QP integration: flake-cost surfaced in QualityProfit — visible to engineering leaders, not buried under CI logs no one reads.
Get notified →

Orchestration

Pre-order · Q3 2026

api-contract-guard

Detects breaking API changes between commits — REST, GraphQL, gRPC. Compares OpenAPI / SDL against the last green build, warns on backward-incompatible deltas, suggests a deprecation path.

QP integration: contract drift surfaced in QualityProfit as quality-cost events; downstream-team impact estimated automatically.
Get notified →
Pre-order · Q4 2026

agent-conductor

Orchestrates the open-source 8-agent template + your proprietary agents. Decides which run when, balances load, batches model calls, dedupes runs. The second-order agent that makes a 6-agent stack tractable.

QP integration: per-agent activity, false-positive rate and cost reported to QualityProfit for executive visibility on the agent layer itself.
Get notified →
How it works. Annual licenses bundle the agent + a QualityProfit subscription. Final pricing announced with the first cohort in Q3 2026 — early-pre-order signups get pilot pricing, roadmap influence, and named-customer status on the first case study. compliance-guard and risk-classifier ship first; the rest follow through Q3–Q4 2026. Email Paul to pre-order →
Selected Work

What this looks like in practice.

From government rollouts to solo SaaS — with audit trails attached.

RvO · NL Government2024 — 2025

Language-First in gov.

Cypress + Playwright architecture

Test architecture across multiple government departments where specifications, scenarios and tests share one continuous human-readable layer. Presented at CypressConf 2024 — "Beyond the Battle: Empowering Test Automation with a Language-First Approach."

Cypress · Playwright · TypeScript · Lerna · Artillery · Gherkin · Blueriq · GitLab · SonarQube
VGZ · Insurance2022 — 2024

Architect for the long run.

Test Automation Architect

Cypress + Lit Elements test architecture with Cucumber traceability, integrated into Azure DevOps. Built to outlive me — handed back to the team.

Cypress · Lit Elements · Cucumber · Azure DevOps
Ministry of Foreign Affairs · Government2021 / 2022 — 2024

Global rollout, audited.

QA Architect & Test Manager

Test management for a worldwide rollout under ISO 25010 / TMap discipline; earlier engagement covered Cypress / Angular / Docker on Azure DevOps.

ISO 25010 · TMap · Cypress · Angular · Docker · Azure DevOps
Evides · National Utility2024 — present

Quality Framework rollout.

Quality Assurance Manager

TMMi-aligned Quality Framework on top of ISO/IEC 25010, embedded in delivery pipelines for a national utility.

TMMi · ISO/IEC 25010 · Quality Framework
QualityProfit · Solo SaaS2024 — present

Solo SaaS, four agents.

Founder · Full-stack with Claude Code

A customer-deployed dashboard that turns Jira / Azure DevOps / GitHub / GitLab signals into financial ROI for QA. Four in-repo subagents: release-reviewer, deploy-monitor, onboarding-smoke-tester, requirements-guard.

Python · FastAPI · Pydantic · React · Cypress · Docker · Caddy · Stripe · Claude Code
New Orange Digital Agency2024 — present

Their AI test stack, productized.

Architecture · Framework · Claude Code skill

Built an AI-augmented Playwright architecture, framework and reusable Claude Code skill for a Dutch digital agency — designed to drop into any current or future client engagement, not built for a single project. Codifies project structure, Page Object pattern, AC/TS traceability and per-feature coverage matrix. Proof that productized AI-testing assets work at agency scale — the same thesis the catalog rests on.

Playwright · TypeScript · Next.js 16 · Turborepo · Tailwind v4 · Claude Code · Cursor · Copilot
— Core product · Pilot live

Your teams prevent millions in losses. Now prove it.

QualityProfit makes the invisible costs of software delivery visible, measurable and actionable. Connect Jira, Azure DevOps, GitHub or GitLab — get a Quality Cost Ledger, a Correlation Engine that traces causal chains, and executive dashboards that translate quality into financial impact. Built for the conversations regulated organisations are about to have with their auditors. The panel on the right mirrors the sample 250-engineer projection from qualityprofit.io.

  • release-reviewer — gates pushes on risk patterns
  • deploy-monitor — verifies container digests on the VPS
  • onboarding-smoke-tester — walks the wizard end-to-end via the real API
  • requirements-guard — reconciles spec against live code
Join the pilot at qualityprofit.io →
Custom engagements · ~10% of the work

Need something the catalog doesn't cover?

Paul takes a small number of custom Studio engagements per year for clients in insurance, financial services and government whose agent need doesn't fit a productized offering — or whose procurement reality requires a fixed-scope contract. Each is bundled with QualityProfit and starts with a free 30-minute scoping call. Standard DPA, contract templates, and DPIA support available — prices in EUR, USD/GBP equivalents on request.

Sprint · 6w · €25–35K Embed · 12w · €60–95K Train · cohort · €15–25K Care · monthly · €3–5K
Book a 30-min scoping call →
Tech stack

What I bring into your repo.

Pragmatic, opinionated, and chosen for AI extension — not novelty.

AI / Agents
Claude Code · Custom subagents · Hooks · Prompt engineering · AGENTS.md / SKILL.md · Cursor · GitHub Copilot · Windsurf
Testing
Cypress.io · Playwright · Jest · Cucumber / Gherkin · Postman · Artillery · JMeter · axe-core · TestNG · Selenium
Frontend
TypeScript · React · Next.js · Vue · Angular · Lit · Tailwind · Turborepo · Lerna
Backend
Python · FastAPI · Pydantic · Java · Hibernate / JPA · Node · REST · GraphQL
DevOps
Docker · GitHub Actions · GitLab CI · Azure DevOps · TeamCity · Jenkins · Caddy · SonarQube
Quality
TMMi · ISO/IEC 25010 · NPR 5326 · TMap · SHEQC Grooming · OTAP · CI/CD · Page Object pattern
Integrations
Jira · GitHub · GitLab · Azure DevOps · Blueriq · Sitecore · Stripe · AWS Cognito
Career timeline

15+ years across enterprise & government.

A selection — earlier roles span ING, SBB, Ministry of Foreign Affairs, ZLM, KPN and lecturing at The Hague University of Applied Sciences.

2024 — 2025
RvO (NL Government)
Quality Assurance Manager
2024 — present
Evides
Quality Assurance Manager
2024 — present
QualityProfit
Founder · Solo SaaS
2024 — present
New Orange Digital Agency
AI test stack productized
2022 — 2024
VGZ
Test Automation Architect
2022 — 2024
Ministry of Foreign Affairs
Test Manager
2022 — 2023
Aon
Quality Automation Architect
2021
Ministry of Foreign Affairs
QA Architect
2021
CZ
Test Automation Specialist
2020
Harlem Next · Nederlandse Transplantatie Stichting
Test Automation Specialist
2019 — 2020
Aon
Quality Assurance Manager
2018 — 2019
ING
Test Automation Specialist
Let's talk

Got an AI rollout that's outpacing your audit trail?

Book a free 30-minute call. I'll listen, ask sharp questions, and tell you honestly whether I'm the right person for the job — or whether you'd be better served by someone else.