Production AI Agents for IT Teams Shipping Software

Drop-in AI agents, QualityProfit baked in.

Production-grade AI agents for any IT team shipping software with Claude Code, Cursor or Copilot — each with QualityProfit's quality and observability layer wired in from day one. Pre-order opens Q3 2026. Need a custom build instead? Paul also takes a few Studio engagements per year.

15+
Years
QA & test automation
14
Clients
Enterprise & government
8+
AI Agents
QualityProfit + NVM monorepo
1
SaaS
Solo-built, shipping
Ambassador
Cypress.io
Who this is for

For every IT team shipping software with AI.

If your team uses Claude Code, Cursor, Copilot or anything else that writes code for you — these agents are for you. Industry-agnostic. Stack-agnostic. The agent layer rides alongside whatever you ship; QualityProfit makes the work visible to the people who pay the bill.

Engineering leaders

Velocity is up. Quality signal isn't.

CTOs, VPs of Engineering, Heads of Platform. Your team shipped fast with AI coding agents — now PR review queues are a swamp and your CFO is asking what you're getting for the cost. The catalog gives you agents that gate, route and report, with QualityProfit's executive dashboard underneath.

Developers & architects

Tests that read like specs, agents that catch what reviews miss.

You write code with Claude Code or Cursor. We build the agents that ride alongside — risk-classifying PRs, catching contract drift, quarantining flakes, replaying incidents into regression tests. Drop them into your repo, keep your velocity, lose the silent regressions.

QA & AI specialists

Become the gatekeeper of AI-generated code.

QA leads, test architects, AI specialists, in-house agent builders. Stop being the bottleneck on AI-augmented PRs. The agents do the first-pass triage so you can do the high-value review. QualityProfit makes the work visible upward and the cost visible to leadership.

The Problem

AI ships fast. Tests need to keep up.

Your team adopted Claude Code, Cursor or Copilot. Velocity went up. But review queues are now full of plausible-looking code no one really checked, tests that pass without exercising anything, and a creeping sense that quality is slipping behind speed.

I help engineering teams close that gap — with test architectures designed for AI extension, custom agents that gate every push, and a Language-First approach that makes specs, scenarios and tests legible to humans and LLMs.

Agent catalog · Pre-order Q3 2026

Production-ready AI agents for IT teams shipping software.

Drop-in agents for any team using Claude Code, Cursor, Copilot or similar — each with QualityProfit's quality and observability layer baked in by default. Distilled from real client work, not built from assumption. Pre-order opens Q3 2026.

Compliance & risk

Pre-order · Q3 2026

risk-classifier

Classifies every PR by business risk — auth, payments, PII, data migrations, external APIs — using project-specific rules + AST analysis. Routes to specialist reviewers and tunes gate strictness automatically.

QP integration: every classification logged to QualityProfit's Quality Cost Ledger with a confidence-weighted financial impact estimate.
Get notified →
Pre-order · Q3 2026

compliance-guard

Checks code and tests against control frameworks — ISO/IEC 27001, GDPR, SOC2, DORA, NIS2, WCAG. Flags violations as PR comments with the exact control IDs, so audit prep stops being archaeology.

QP integration: compliance evidence becomes a QualityProfit audit-ready report — defensible to inspectors, exportable for DPAs.
Get notified →

Reliability & operations

Pre-order · Q3 2026

incident-replay

Takes a production incident report, generates regression test(s) that would have caught it, and opens the PR. Closes the loop from outage to evidence — no more "we'll write the test next sprint."

QP integration: cost-of-incident logged to QP and attributed to the regressing change. Post-mortems gain a financial axis automatically.
Get notified →
Pre-order · Q3 2026

perf-regression-detector

Runs benchmarks per commit — Web Vitals, p95 latencies, DB query plans. Fails the gate on regressions outside per-route tolerance bands. Tuned to your SLOs, not generic thresholds.

QP integration: regressions tagged with cost impact in the Quality Cost Ledger; performance becomes a line item, not a gut feeling.
Get notified →

Quality & ROI

Pre-order · Q3 2026

roi-tracker

Tags every gate run with a "value-prevented" estimate (cost-of-bug-in-prod × confidence). Aggregates monthly into an executive PDF your CFO will actually read.

QP integration: the canonical QualityProfit on-ramp — outputs become the Quality Cost Ledger live, no double-bookkeeping.
Get notified →
Pre-order · Q3 2026

flaky-quarantine

Detects flaky tests using rolling pass/fail history. Auto-quarantines after N flakes, opens a debugging issue with a templated checklist, removes the flake from the gate so the rest of CI stays green.

QP integration: flake-cost surfaced in QualityProfit — visible to engineering leaders, not buried under CI logs no one reads.
Get notified →

Orchestration

Pre-order · Q4 2026

agent-conductor

Orchestrates the open-source 8-agent template + your proprietary agents. Decides which run when, balances load, batches model calls, dedupes runs. The second-order agent that makes a 6-agent stack tractable.

QP integration: per-agent activity, false-positive rate and cost reported to QualityProfit for executive visibility on the agent layer itself.
Get notified →
Pre-order · Q3 2026

api-contract-guard

Detects breaking API changes between commits — REST, GraphQL, gRPC. Compares OpenAPI / SDL against the last green build, warns on backward-incompatible deltas, suggests a deprecation path.

QP integration: contract drift surfaced in QualityProfit as quality-cost events; downstream-team impact estimated automatically.
Get notified →
How it works. Annual licenses bundle the agent + a QualityProfit subscription. Final pricing announced with the first cohort in Q3 2026 — early-pre-order signups get pilot pricing and a say in the agent's first-version roadmap. Email Paul to pre-order →
Selected Work

What this looks like in practice.

From solo SaaS to multi-team government rollouts.

QualityProfit2024 — present

Solo SaaS, four agents.

Founder · Full-stack with Claude Code

A customer-deployed dashboard that turns Jira / Azure DevOps / GitHub / GitLab signals into financial ROI for QA. Four in-repo subagents: release-reviewer, deploy-monitor, onboarding-smoke-tester, requirements-guard.

Python · FastAPI · Pydantic · React · Cypress · Docker · Caddy · Stripe · Claude Code
New Orange / NVM2024 — present

AI Playwright at scale.

Test-generation architecture

Structured Playwright E2E test-generation prompt as procedural context for AI coding agents. Codifies project structure, Page Object pattern, AC/TS traceability and per-feature coverage matrix.

Playwright · TypeScript · Next.js 16 · Turborepo · Tailwind v4 · Claude Code · Cursor · Copilot
RvO · NL Government2024 — 2025

Language-First in gov.

Cypress + Playwright architecture

Test architecture across multiple government departments where specifications, scenarios and tests share one continuous human-readable layer. Presented at CypressConf 2024 — "Beyond the Battle: Empowering Test Automation with a Language-First Approach."

Cypress · Playwright · TypeScript · Lerna · Artillery · Gherkin · Blueriq · GitLab · SonarQube
Evides2024 — present

Quality Framework rollout.

Quality Assurance Manager

TMMi-aligned Quality Framework on top of ISO/IEC 25010, embedded in delivery pipelines for a national utility.

TMMi · ISO/IEC 25010 · Quality Framework
VGZ2022 — 2024

Architect for the long run.

Test Automation Architect

Cypress + Lit Elements test architecture with Cucumber traceability, integrated into Azure DevOps. Built to outlive me — handed back to the team.

Cypress · Lit Elements · Cucumber · Azure DevOps
Ministry of Foreign Affairs2021 / 2022 — 2024

Global rollout, audited.

QA Architect & Test Manager

Test management for a worldwide rollout under ISO 25010 / TMap discipline; earlier engagement covered Cypress / Angular / Docker on Azure DevOps.

ISO 25010 · TMap · Cypress · Angular · Docker · Azure DevOps
— Core product · Pilot live

Your teams prevent millions in losses. Now prove it.

QualityProfit makes the invisible costs of software delivery visible, measurable and actionable. Connect Jira, Azure DevOps, GitHub or GitLab — get a Quality Cost Ledger, a Correlation Engine that traces causal chains, and executive dashboards that translate quality into financial impact. The panel on the right mirrors the sample 250-engineer projection from qualityprofit.io.

  • release-reviewer — gates pushes on risk patterns
  • deploy-monitor — verifies container digests on the VPS
  • onboarding-smoke-tester — walks the wizard end-to-end via the real API
  • requirements-guard — reconciles spec against live code
Join the pilot at qualityprofit.io →
Custom engagements · ~10% of the work

Need something the catalog doesn't cover?

Paul takes a few custom Studio engagements per year for clients whose agent need doesn't fit a productized agent. Each is bundled with QualityProfit and starts with a free 30-minute scoping call. Standard DPA and contract templates available — prices in EUR, USD/GBP equivalents on request.

Sprint · 6w · €25–35K Embed · 12w · €60–95K Train · cohort · €15–25K Care · monthly · €3–5K
Book a 30-min scoping call →
Tech stack

What I bring into your repo.

Pragmatic, opinionated, and chosen for AI extension — not novelty.

AI / Agents
Claude Code · Custom subagents · Hooks · Prompt engineering · AGENTS.md / SKILL.md · Cursor · GitHub Copilot · Windsurf
Testing
Cypress.io · Playwright · Jest · Cucumber / Gherkin · Postman · Artillery · JMeter · axe-core · TestNG · Selenium
Frontend
TypeScript · React · Next.js · Vue · Angular · Lit · Tailwind · Turborepo · Lerna
Backend
Python · FastAPI · Pydantic · Java · Hibernate / JPA · Node · REST · GraphQL
DevOps
Docker · GitHub Actions · GitLab CI · Azure DevOps · TeamCity · Jenkins · Caddy · SonarQube
Quality
TMMi · ISO/IEC 25010 · NPR 5326 · TMap · SHEQC Grooming · OTAP · CI/CD · Page Object pattern
Integrations
Jira · GitHub · GitLab · Azure DevOps · Blueriq · Sitecore · Stripe · AWS Cognito
Career timeline

15+ years across enterprise & government.

A selection — earlier roles span ING, SBB, Ministry of Foreign Affairs, ZLM, KPN and lecturing at The Hague University of Applied Sciences.

2024 — present
QualityProfit
Founder · Solo SaaS
2024 — present
New Orange / NVM
AI test-gen architecture
2024 — present
Evides
Quality Assurance Manager
2024 — 2025
RvO (NL Government)
Quality Assurance Manager
2022 — 2024
VGZ
Test Automation Architect
2022 — 2024
Ministry of Foreign Affairs
Test Manager
2022 — 2023
Aon
Quality Automation Architect
2021
Ministry of Foreign Affairs
QA Architect
2021
CZ
Test Automation Specialist
2020
Harlem Next · Nederlandse Transplantatie Stichting
Test Automation Specialist
2019 — 2020
Aon
Quality Assurance Manager
2018 — 2019
ING
Test Automation Specialist
Let's talk

Got an AI rollout that's outpacing your tests?

Book a free 30-minute call. I'll listen, ask sharp questions, and tell you honestly whether I'm the right person for the job.