Drop-in AI agents, QualityProfit baked in.
Production-grade AI agents for any IT team shipping software with Claude Code, Cursor or Copilot — each with QualityProfit's quality and observability layer wired in from day one. Pre-order opens Q3 2026. Need a custom build instead? Paul also takes a few Studio engagements per year.
For every IT team shipping software with AI.
If your team uses Claude Code, Cursor, Copilot or anything else that writes code for you — these agents are for you. Industry-agnostic. Stack-agnostic. The agent layer rides alongside whatever you ship; QualityProfit makes the work visible to the people who pay the bill.
Velocity is up. Quality signal isn't.
CTOs, VPs of Engineering, Heads of Platform. Your team shipped fast with AI coding agents — now PR review queues are a swamp and your CFO is asking what you're getting for the cost. The catalog gives you agents that gate, route and report, with QualityProfit's executive dashboard underneath.
Tests that read like specs, agents that catch what reviews miss.
You write code with Claude Code or Cursor. We build the agents that ride alongside — risk-classifying PRs, catching contract drift, quarantining flakes, replaying incidents into regression tests. Drop them into your repo, keep your velocity, lose the silent regressions.
Become the gatekeeper of AI-generated code.
QA leads, test architects, AI specialists, in-house agent builders. Stop being the bottleneck on AI-augmented PRs. The agents do the first-pass triage so you can do the high-value review. QualityProfit makes the work visible upward and the cost visible to leadership.
AI ships fast. Tests need to keep up.
Your team adopted Claude Code, Cursor or Copilot. Velocity went up. But review queues are now full of plausible-looking code no one really checked, tests that pass without exercising anything, and a creeping sense that quality is slipping behind speed.
I help engineering teams close that gap — with test architectures designed for AI extension, custom agents that gate every push, and a Language-First approach that makes specs, scenarios and tests legible to humans and LLMs.
The templates and writing we put in the open.
Top of funnel for the Studio. Fork the orchestration templates, read the field notes, get the Playbook when it lands.
orchestration-playwright-agents
Drop-in Claude Code orchestration template for Playwright E2E: master prompt as a skill, 8 specialised sub-agents, slash commands, starter e2e/ folder. Adapt it to your repo in a day.
View on GitHub →orchestration-cypress-agents
Sister template for Cypress: master prompt as a skill, 8 sub-agents, slash commands, starter cypress/ folder. Same pattern, framework-native.
View on GitHub →Field notes from AI-augmented engineering
Long-form on Language-First test design, in-repo subagents and the messy reality of shipping with LLMs. Monthly cadence resuming Q3 2026 — see the writing roadmap.
Read on Medium →The AI-Augmented E2E Playbook
15-page PDF: Language-First architecture, AGENTS.md scaffolds, Page Object pattern with AC/TS traceability, per-feature coverage matrix. Bundles three Medium pieces into one printable artifact.
Get notified →Production-ready AI agents for IT teams shipping software.
Drop-in agents for any team using Claude Code, Cursor, Copilot or similar — each with QualityProfit's quality and observability layer baked in by default. Distilled from real client work, not built from assumption. Pre-order opens Q3 2026.
Compliance & risk
risk-classifier
Classifies every PR by business risk — auth, payments, PII, data migrations, external APIs — using project-specific rules + AST analysis. Routes to specialist reviewers and tunes gate strictness automatically.
compliance-guard
Checks code and tests against control frameworks — ISO/IEC 27001, GDPR, SOC2, DORA, NIS2, WCAG. Flags violations as PR comments with the exact control IDs, so audit prep stops being archaeology.
Reliability & operations
incident-replay
Takes a production incident report, generates regression test(s) that would have caught it, and opens the PR. Closes the loop from outage to evidence — no more "we'll write the test next sprint."
perf-regression-detector
Runs benchmarks per commit — Web Vitals, p95 latencies, DB query plans. Fails the gate on regressions outside per-route tolerance bands. Tuned to your SLOs, not generic thresholds.
Quality & ROI
roi-tracker
Tags every gate run with a "value-prevented" estimate (cost-of-bug-in-prod × confidence). Aggregates monthly into an executive PDF your CFO will actually read.
flaky-quarantine
Detects flaky tests using rolling pass/fail history. Auto-quarantines after N flakes, opens a debugging issue with a templated checklist, removes the flake from the gate so the rest of CI stays green.
Orchestration
agent-conductor
Orchestrates the open-source 8-agent template + your proprietary agents. Decides which run when, balances load, batches model calls, dedupes runs. The second-order agent that makes a 6-agent stack tractable.
api-contract-guard
Detects breaking API changes between commits — REST, GraphQL, gRPC. Compares OpenAPI / SDL against the last green build, warns on backward-incompatible deltas, suggests a deprecation path.
What this looks like in practice.
From solo SaaS to multi-team government rollouts.
Solo SaaS, four agents.
Founder · Full-stack with Claude Code
A customer-deployed dashboard that turns Jira / Azure DevOps / GitHub / GitLab signals into financial ROI for QA. Four in-repo subagents: release-reviewer, deploy-monitor, onboarding-smoke-tester, requirements-guard.
AI Playwright at scale.
Test-generation architecture
Structured Playwright E2E test-generation prompt as procedural context for AI coding agents. Codifies project structure, Page Object pattern, AC/TS traceability and per-feature coverage matrix.
Language-First in gov.
Cypress + Playwright architecture
Test architecture across multiple government departments where specifications, scenarios and tests share one continuous human-readable layer. Presented at CypressConf 2024 — "Beyond the Battle: Empowering Test Automation with a Language-First Approach."
Quality Framework rollout.
Quality Assurance Manager
TMMi-aligned Quality Framework on top of ISO/IEC 25010, embedded in delivery pipelines for a national utility.
Architect for the long run.
Test Automation Architect
Cypress + Lit Elements test architecture with Cucumber traceability, integrated into Azure DevOps. Built to outlive me — handed back to the team.
Global rollout, audited.
QA Architect & Test Manager
Test management for a worldwide rollout under ISO 25010 / TMap discipline; earlier engagement covered Cypress / Angular / Docker on Azure DevOps.
Your teams prevent millions in losses. Now prove it.
QualityProfit makes the invisible costs of software delivery visible, measurable and actionable. Connect Jira, Azure DevOps, GitHub or GitLab — get a Quality Cost Ledger, a Correlation Engine that traces causal chains, and executive dashboards that translate quality into financial impact. The panel on the right mirrors the sample 250-engineer projection from qualityprofit.io.
- release-reviewer — gates pushes on risk patterns
- deploy-monitor — verifies container digests on the VPS
- onboarding-smoke-tester — walks the wizard end-to-end via the real API
- requirements-guard — reconciles spec against live code
Need something the catalog doesn't cover?
Paul takes a few custom Studio engagements per year for clients whose agent need doesn't fit a productized agent. Each is bundled with QualityProfit and starts with a free 30-minute scoping call. Standard DPA and contract templates available — prices in EUR, USD/GBP equivalents on request.
What I bring into your repo.
Pragmatic, opinionated, and chosen for AI extension — not novelty.
15+ years across enterprise & government.
A selection — earlier roles span ING, SBB, Ministry of Foreign Affairs, ZLM, KPN and lecturing at The Hague University of Applied Sciences.
Pragmatic guides on Cypress, testing & automation.
On Medium since 2020, with 10+ deep-dives on Cypress patterns, ROI for testing, and test strategy. New AI-augmented engineering pieces landing on this site through 2026.
Pro-tip: stub the window object. A practical walkthrough for the multi-tab problem Cypress users hit constantly.
Cross-origin testing is finally there. What changed, what to watch for, and how to migrate your auth flows.
And why you should care. Where the lines are, why teams confuse them, and how to pick the right tool for the assertion.
On stage, on a podcast, in your team's Slack.
Cypress.io Ambassador, conference speaker, certified didactical trainer.
Cypress Ambassador
Active community work
Conference Speaker
CypressConf 2024 + 2025 workshops
Certified Trainer
Software testing & QA · Post-HBO didactical
Talks on YouTube
Recorded talks and workshops.
Cypress: The Bad Practices Workshop
Hands-on tour of the Cypress anti-patterns we keep meeting in real codebases — and how to refactor out of them. Co-presented with Frits van der Sloot.
Watch on YouTube →Beyond the Battle: Empowering Test Automation with a Language-First Approach
How specs, scenarios and tests can share one continuous human-readable layer — and why that shape makes AI extension tractable.
Watch on YouTube →Effective Test Automation Design
The architecture decisions that make a test suite outlive the team that wrote it — Page Objects, traceability, and the discipline behind a pyramid that holds.
Watch on YouTube →Got an AI rollout that's outpacing your tests?
Book a free 30-minute call. I'll listen, ask sharp questions, and tell you honestly whether I'm the right person for the job.