Approach

Strategy
  • Reframed the design question from transparency to trust. Showing what the AI is doing solves a UI problem; designing for trust solves a product-adoption problem.

  • Anchored the work in one persona's defense posture: a Head of GRC at a Series-B SaaS company, 90 days from a SOC 2 audit. Every empty state and copy choice was pressure-tested against "would this hold up in front of an auditor?"

  • Audited adjacent categories - enterprise compliance, AI agent UIs, observability. Compliance reads conservative, agent UIs read noisy. The brief was to keep the conservative spine and let agentic affordances in selectively.

Interaction & Systems

Four structural design decisions shaped the rest of the work.

1 - IA anchored in the operator's existing mental model, not the agent fleet's.
The spine is Frameworks → Controls → Evidence, mirroring how compliance officers already work in their spreadsheets and audit binders. Agents sit as an ambient layer on top. A deliberate move against the obvious "agents-first" structure, because the operator's existing model is the trust scaffold, not the thing to disrupt.

2 - Live activity feed as a first-class dashboard surface, not a notification corner.
"What's happening right now" is the operator's primary anxiety in agentic work, and burying it in a bell would make agents feel unaccountable. Promoted, then paired with a rule: ambient updates can be muted, user-action toasts cannot. The operator never gets punished for their own actions.

3 - Conversation as a first-class affordance for every agent.
Every agent run opens a side panel with two tabs: a forensic Run timeline and an Ask Agent chat. Operators ask "why this score?" or "what evidence am I missing?" in plain language. Chat persists per-agent across close and reopen, so a conversation doesn't vanish when the operator pivots. Conversation became the primary surface for accountability; configuration moved to secondary.

4 - A three-layer disclosure pattern for AI legibility.
Ambient (stat cards, KPIs) → narrated (activity feed in colleague-voice prose) → forensic (run panel with timeline, evidence, chat). Each layer reveals more on demand. The pattern recurs across the product, holding information density without overwhelming.

Execution
  • Shipped in three days by treating Claude as a junior engineering pair-up. I held the spec, the audit list, and every taste call; Claude held the syntax and the cross-file plumbing.

  • Iterated in a local sandbox so every micro-decision landed in under a second. Canvas-first tools couldn't keep up with the audit cadence I wanted.

  • Wrote the system before the components - one icon library at a single weight, a CSS-variable token system for dark and light parity, a written copy voice for agent narration. The rules existed before the files did.

  • Deployed via CI/CD to a public URL so the work is a clickable link, not a screenshot. Shipping the prototype made me design with deployment constraints in view from the first frame.

Impact

A working, deployed prototype at a public URL - operators can move through the surface and feel the trust pattern, not just see a static mock.

  • Established a reusable ambient → narrated → forensic disclosure pattern that generalizes beyond compliance to any agentic product where defensibility matters (legal, audit, healthcare, financial reporting).

  • Validated that conversation as a first-class affordance for AI legibility is more honest than configuration UIs - it preserves the operator's agency without forcing them to micromanage.

  • Demonstrated a repeatable working method for designing with AI as a collaborator · the case study is half about the product, half about the practice.

Have a project idea in mind? Let’s chat about how we can bring it to life— virtually, from anywhere in the world!

Have a project idea in mind? Let’s chat about how we can bring it to life— virtually, from anywhere in the world!