Agent Process Design.
The discipline of turning a workflow into a defensible agentic service. We design two layers in parallel — the aggregate view a CTO or COO uses to operate the agentic workforce, and the individual agent a product team actually ships. Both come out as interactive HTML mockups — not slides, not Figma frames — so the engineering team can read, fork, and build straight from the deliverable.
Two views. One design system.
Every engagement runs both lenses simultaneously. Without the aggregate view you ship orphans. Without the individual-agent view you ship a roadmap. The deliverable is a working HTML prototype of both surfaces — what the workforce looks like at the top, and what one agent looks like at the bottom.
The agentic workforce.
The shared surface — agent home, the map of how agents connect, the inventory of what exists today and what's coming next quarter, the workforce mix between humans and agents. The CTO opens this on Monday morning.
↓ Four artefactsThe specific agent service.
The end-to-end design of one agent — process analysis, trust spec, scaffolding, storyboard, runtime, codebase, evaluation harness, audit trail. The product team builds straight from it.
↓ Ten stagesWhat the CTO opens on Monday.
Four artefacts make the agentic workforce visible. Each one ships as a live HTML mockup — your team can use it, edit it, route it to engineering, and bring it into the next steering meeting.
Agent Home — the dashboard.
The summary surface every stakeholder lands on. Live agents at a glance — what's running, what's paused, what's escalated, what each one cost and saved this week. Designed to be the first tab on the CTO's browser, not buried inside a vendor console.
We design the information architecture, the live-status colour system, the KPI tiles, the activity stream, and the escalation surface — then deliver a working HTML prototype your engineering team can implement on top of your stack.
Agent Maps — the sitemap.
Agents do not live alone. They consume each other's outputs, feed shared knowledge banks, share tools, and call each other for help. We draw the workforce as a graph — Clay-style nodes, named connections, data-flow arrows — so the team can see where one agent's change ripples to three others.
The map is the artefact that ends the question "so what does this agent actually depend on?" at the executive table.
Inventory & planning.
Everything live, everything in build, everything on the roadmap — in one register. Each row carries the agent's type, owner, autonomy tier, trust stage, and economic case. The artefact a CFO can read; the artefact procurement can defend.
We also draft the planning view — quarter-by-quarter sequencing of which agents land when, against which workflow, owned by which named human.
| ID | Agent | Tier | Owner | Stage |
|---|---|---|---|---|
| A001 | Pricing Brain | Act-notify | Pricing Ops | Live |
| A002 | Brand Voice | Approve-each | Brand Council | Live |
| A003 | Compliance | Human-conf. | Legal | Pilot |
| A004 | Reporting | Autonomous | Analytics | Live |
| A005 | Audience | Review-act | Insights | Live |
| A006 | Brief Brain | Approve-each | Strategy | Build · Q3 |
| A007 | Forecast | Act-notify | Finance Ops | Spec · Q4 |
Workforce summary.
The mix view. Per function, per workflow, per region — how the work splits between human, human-supervised agent, and autonomous agent. The artefact a CHRO and a CTO read together. Drives capacity planning, hiring posture, and the agentic budget conversation.
Where on the workforce mix you sit today — and where you are aiming for next year — is rarely written down. We write it down.
How we design one agent.
Each stage produces a named artefact. Every artefact is a clickable HTML page the build team can fork — the trust spec, the storyboard, the runbook, the audit format. By the end the agent has a homepage of its own, before a single production endpoint is touched.
Business process analysis.
Before any prompt or model: we read the workflow. Who triggers it, what fires it, what counts as success, what breaks today, where time leaks. Most agents fail because they automated a process nobody had written down clearly enough to read.
Process map, actors & taxonomies.
The workflow drawn as a graph. Every actor — human, system, agent — named with role, scope, and decision rights. Every artefact named with its taxonomy class (input · intermediate · output · audit). The shared vocabulary of the engagement gets fixed here, and stops changing.
Business goals → agent KPIs.
Business outcomes translated into things an agent can actually move. Revenue uplift becomes decisions queued · approve-rate · cycle-time · error-rate · escalation-rate. We separate quality KPIs, trust KPIs, and economic KPIs so the agent is judged on all three, not just the one that looks easy.
Trust layer — Canvas & harness.
The agent's constitution. We run the Trust Canvas session to fix autonomy boundaries per action class, name the escalation rules, set the memory policy, and draft the explainability contract. Then we design the runtime harness — the deterministic policy layer that enforces all of it in code, AGT-compatible by default.
This is the artefact a regulator and a CFO can both read.
Agent scaffolding — brains, skills, knowledge, context.
What the agent is made of. Which reasoning brain (deterministic, probabilistic, generative, retrieval-augmented). Which named skills it can call. Which knowledge banks it reads. Which short-term and long-term context it carries between sessions. Every component named, every component owned — no shadow cognition.
Storyboarding — escape hatches, visible controls.
The end-to-end user experience, frame by frame. Where the intent handshake happens. Where the confidence cue surfaces. Where the human can revise or undo without losing work. Every storyboard frame names the AUX pattern it instantiates — Intent Handshake, Memory in Motion, Escape Hatch — so the build team is not improvising.
Runtime — triggers, dead ends, reruns, cycles.
How the agent actually runs in time. What triggers a session — schedule, event, user invocation, escalation. What counts as a successful end. What counts as a dead end and how it routes. When the agent reruns a step, retries with a different brain, or cycles back to a human. The runtime is the part most teams forget to design — then it shows up in production as the bug they cannot reproduce.
Codebase, tools, integrations, deployment.
The engineering hand-off. Repo structure, tool catalogue (what the agent is allowed to call, with what scope), integrations (which platforms, which auth, which observability), and deployment shape (managed runtime, edge, self-hosted). We do not write production code — we write the spec the build team writes production code from. The brief is small enough to commit, complete enough to ship from.
Evaluations & improvements.
The agent ships, then it earns the right to keep running. We design the eval harness — quality checks, hallucination detection, trust-stage scorecards, A/B against the legacy human process. Improvement cadence is named: who reviews what, on which day, with what authority to roll back. Continuous, not optional.
Audits with audit trails.
Every action the agent takes is logged immutably — prompt, model, version, inputs, decision, outputs, the human approval that came after. Two years later, a trustee or regulator should be able to reconstruct what happened from the log alone. Without this, agentic systems are fiduciarily indefensible. We design the log shape, the storage policy, and the audit-readout format the agent's owner can produce on request.
2026-05-28T14:02:13Z [BRAIN] brand-voice@2.1.0 · sources: brandbook/v4, claims-policy/q2
2026-05-28T14:02:14Z [FLAG] "transformational" violates rule BR-19 · soft-pause
2026-05-28T14:02:14Z [QUEUE] alt drafted · review queued for jane@
2026-05-28T14:08:42Z [APPROVE] jane@ · variant 2 · sig=verified
2026-05-28T14:08:43Z [AUDIT] hash=a91f…3c2b · merkle-anchored
We ship HTML, not slides.
Every deliverable from the engagement is a working HTML page or component — designed once, forkable by engineering, hostable on your intranet from day one. The agent has a homepage before it ships, and the workforce has a homepage before there's a workforce.
Agent Home page
The CTO's Monday morning surface. Live agent status, KPIs, escalation queue. Interactive HTML, your design system optional.
Agent Map graph
Node-and-edge view of how agents depend on each other. Editable, exportable, embeddable in your wiki.
Inventory & roadmap
One row per agent — type, owner, tier, stage, economic case. Plus the quarter-by-quarter sequencing for what's coming.
Trust Canvas + policy YAML
The agent's constitution in two formats: human-readable Canvas, machine-readable policy file. Wired for AGT, OPA, or your own enforcement layer.
Storyboard walkthrough
Interactive frame-by-frame HTML showing every AUX pattern at every moment. Click through what the user actually sees.
Runtime state diagram
Triggers · happy path · dead ends · reruns · cycles. The diagram the on-call engineer can actually read at 3am.
Build spec + tool catalogue
The brief the build team commits to git. Small enough to read in an hour, complete enough to ship from.
Eval harness + audit format
The quality scorecard, the rollback runbook, the audit-log schema. The artefacts that turn the agent from "trust us" into "audit us."
Two ways in.
Most engagements begin with a Blueprint Sprint on a single workflow — six weeks, fixed scope, fixed price. Teams that move to the workforce view then graduate to an Advisory Retainer covering successive agents through a quarter or year.
Blueprint Sprint.
A single workflow, all ten stages, both lenses framed. You walk out with the agent home mockup, the trust canvas, the storyboard, the runtime spec, and the build brief — everything the engineering team needs to start building on Monday.
- Process + KPI translation
- Trust Canvas + policy YAML
- Storyboard + runtime spec
- Build-ready hand-off
Advisory Retainer.
The Blueprint Sprint outcome, extended across the agentic workforce. We sit alongside your AI council, design successive agents on the same template, evolve the aggregate view, and own the consistency of the deliverable as the surface grows.
- Per-quarter agent designs
- Aggregate view maintained
- Audit + eval cadence supported
- Steering presence at AI council
Bring us one workflow.
We'll bring back two homepages.
One for the agent. One for the workforce. Both clickable. Both forkable. Both small enough to commit, complete enough to ship from.