AD '26
All Chapters

CHAPTER 14

The Future of Agentic Design

Trends, predictions, and what comes after the current tools

Reading time

17 min

Where We Are: The State of Agentic Design in May 2026

A snapshot of the ecosystem at the time of writing.

Timeline diagram showing three phases of design tool evolution: 2024 with GUI-only tools like Figma and Sketch requiring manual handoff, 2026 with agent-assisted design using MCP and skills, and projected 2028 with autonomous self-maintaining design systems
Ecosystem evolution from GUI-only design (2024) through agentic emergence (2026) to autonomous systems (2028)

Paper launched its Desktop app in February 2026, becoming the first connected design canvas with an MCP server that works with any agent: Claude Code, Codex CLI, Cursor, GitHub Copilot, OpenCode, Windsurf, Cline, and Continue. According to Paper's published case studies and marketing materials, production users include Vercel, Perplexity, PostHog, Tailwind CSS, Dub, Replicate, Zed, and Attio.

Open Design hit 43.6k GitHub stars in under two weeks after launch, according to public GitHub data --- reportedly the fastest growth for an open-source design tool. It supports 16 CLI agents and ships 129 design systems as portable Markdown files.

OpenPencil shipped Concurrent Agent Teams: multiple AI agents working on different sections of the same canvas simultaneously. It is the first open-source vector design tool built around the agent as the primary user. At 3.1k stars as of May 2026, it is smaller but evolving fast.

Huashu Design relicensed to MIT on May 14, 2026, signaling that the open-source design skill ecosystem is maturing. At 14.1k stars as of May 2026 (according to public GitHub data), it remains the most specialized tool in the stack --- an HTML-native design skill with anti-AI-slop enforcement and a 5-dimension self-critique system.

Hyperframes (19k stars as of May 2026, by HeyGen) and Remotion (47.1k stars according to public GitHub data) represent the two poles of agent-driven video production. Hyperframes is HTML-native and agent-first. Remotion is React-based and mature.

Four major agent platforms compete: Claude Code, Codex CLI, OpenCode, and Gemini CLI. All four support MCP. MCP itself has become the universal connector between agents and design tools --- Paper, Pencil, OpenPencil, Figma, Miro, and MagicPattern all ship MCP servers as of May 2026.

Tool Stars (May 2026, per public GitHub data) Key Milestone Agent-Native?
Remotion 47.1k v4.0.462, mature framework No (React-based)
Open Design 43.6k Fastest-growing open-source design tool Yes
Hyperframes 19k 137 releases, 50+ catalog blocks Yes
Huashu Design 14.1k MIT relicensed May 2026 Agent-agnostic skill
OpenPencil 3.1k Concurrent Agent Teams shipped Yes
Paper Commercial Desktop launched Feb 2026 Yes
Pencil Commercial .pen format, IDE integration Partial (MCP server)

The 24-Hour Sprint: How Agent Execution Changes Delivery Cadence

McKinsey published a framework in May 2026 that describes what agentic software delivery looks like at organizational scale. The numbers are striking: companies running agent-driven delivery see threefold to fivefold productivity improvements, with 60% smaller teams (source: McKinsey, "Rewiring Software Delivery for the Agentic Era", retrieved 2026-05-28). But the specific number matters less than the structural model underneath it.

The framework describes four pillars. I want to focus on the first two, because they map directly onto how agentic design teams will operate.

The Daily Cadence

The two-week sprint cycle collapses into a daily rhythm. During the day, humans review output, resolve ambiguity, set architectural guardrails, and prioritize the backlog. Their role shifts from producing artifacts to supervising the system that produces them. Overnight, agents execute structured work: enriching requirements, generating designs, testing code, packaging outputs for morning review.

Circular 24-hour sprint diagram showing the daily cadence: human judgment during the day (review output, resolve ambiguity, set guardrails, prioritize backlog) and agent execution overnight (enrich requirements, generate designs, test code, package outputs)
The 24-hour sprint: human judgment during the day, agent execution overnight, repeated daily

This model only works if a few practical foundations are in place:

  1. Clear product vision. The business must know what it is building so it can judge agent output against business intent.
  2. Standardized tech environment. Common frameworks and modular architectures so agents can produce predictable, reusable output.
  3. Structured requirements-to-code path. A standard structure so agents can reliably interpret inputs and produce predictable outputs across projects.
  4. Consistent stakeholder engagement. The same core stakeholders stay engaged across the value stream to avoid misalignment and rework.

For design teams, the implications are concrete. A product designer who currently spends two weeks on a feature exploration could queue structured design briefs for overnight execution. Morning review becomes the primary design activity. The designer's daily work is directing taste, resolving ambiguity, and deciding which of ten agent-generated directions to pursue --- not pushing pixels.

Eliminating Handoffs

In most organizations, design intent travels through multiple translations: user story to wireframe, wireframe to mockup, mockup to spec, spec to code. Each handoff loses information. The agentic model structures artifacts for machine-to-machine handoffs instead. Requirements, design tokens, guardrails, and specifications live in standardized, machine-readable formats. The pipeline runs end-to-end. Humans intervene at defined review gates.

Horizontal pipeline diagram showing five stages (Requirements, Design Spec, Visual Artifact, Production Code, Shipped Product) with agent icons at each transition and three human review gates along the bottom
Machine-readable artifacts eliminate human translation between stages. Agents handle every handoff. Humans intervene only at review gates.

This is exactly what design-as-code formats enable. A DESIGN.md file (covered in Chapter 08) is a machine-readable design specification. An agent can read it, generate visual output, and pass the result to the next stage without human translation. The McKinsey framework validates what the tool ecosystem is already making possible.

Knowledge Infrastructure

The third McKinsey pillar is knowledge infrastructure. The most effective agent-driven teams build knowledge graphs that connect customer feedback, architecture decisions, design documents, GitHub activity, and incident reports into a semantically linked system. For design, this means a persistent memory layer that captures why decisions were made, what alternatives were rejected, and what the design system constraints are.

Hexagonal knowledge graph diagram with six surrounding nodes (Customer Feedback, Architecture Decisions, Design Documents, Codebase Activity, Incident Reports, Compliance Rules) connected to a central Knowledge Graph hub, with agent input and output arrows
A knowledge graph gives agents persistent organizational memory. Every design decision becomes traceable to its source.

An agent with access to this graph produces better output because it understands context, not just the current task. Questions that once required weeks of interviews with subject matter experts can be answered in minutes by an agent drawing from structured institutional memory. Every decision becomes traceable. If a stakeholder asks why a feature was deprioritized, the answer links directly to its source: customer survey data, usage analytics, or an architectural constraint.

The knowledge graph should not begin as a grand, top-down ontology effort. It should evolve organically around priority domains and live programs, compounding value over time. As it scales, knowledge becomes production infrastructure rather than static documentation.

My take: I don't agree with everything in the McKinsey piece. The 60% team reduction claim is aggressive for design work specifically, because design judgment does not compress as cleanly as code production. But the structural model --- daily cadence, machine-readable artifacts, knowledge infrastructure, defined review gates --- is sound. It describes where agentic design is heading, even if the timeline is uncertain. The designers who figure out how to queue structured briefs for overnight execution will have an unfair advantage over teams still running two-week sprints with manual handoffs.

Trend 1: Autonomous Design Systems

Design systems are becoming self-maintaining. This is the most consequential trend in the ecosystem, and it is happening faster than I expected.

Consider the current state. Open Design's library contains 129 design systems, each expressed as a DESIGN.md file --- a portable Markdown specification that any agent can read and apply. These are not static theme JSON files. They are living documents that agents interpret, enforce, and update. When a token changes in the DESIGN.md, the agent applies the change across every artifact it generates.

OpenPencil's Design System Kit takes this further. It manages reusable UIKits with style switching and component composition. A team can define Light, Dark, and High Contrast themes as variable sets inside a single .op file. Switching themes is a variable swap, not a redesign.

Paper's bi-directional workflow closes the loop. A designer changes a token value in Paper. The agent reads the change via MCP and updates the CSS custom properties in the codebase. A developer changes a spacing value in Tailwind config. The agent reads the change in Git and updates the Paper design. No manual sync. No drift.

The trajectory is clear. Based on publicly available information, GitHub's design team has reportedly signaled a shift away from maintaining separate component libraries, moving toward treating the codebase as the single source of truth. The direction is unambiguous: translating code components to design is becoming a single-prompt operation.

The vision for 2027: design systems that auto-detect token drift between design and code, flag the discrepancy, and offer to auto-fix. The human approves or rejects. The agent handles the mechanical synchronization that currently eats hours of design systems team time every sprint.

Trend 2: Real-Time Collaborative Agents

The second trend: multiple agents and multiple humans working on the same design surface simultaneously. Not asynchronously through Git. In real time.

Paper already has real-time multiplayer. You share a URL, your team members see changes live. But the multiplayer is human-only today. The agent participates through MCP, which is stateless --- it reads and writes but does not observe live.

OpenPencil's Concurrent Agent Teams point toward the future. Three agents working on different sections of the same canvas, coordinated by an orchestrator. This is parallel execution, not real-time collaboration. But it proves the spatial decomposition pattern: complex designs can be split into independent sub-tasks, assigned to different agents, and merged.

Open Design takes a different approach to the same problem. It auto-detects 16 CLI agents on your PATH and lets you swap between them with one click. This is agent portability, not agent collaboration. But it establishes the principle that the design tool should be agent-agnostic --- it should not care which agent you use.

The convergence point, likely in late 2027: a design canvas where one human and two agents are working on different sections at the same time, with live cursors showing where each participant is active. The human designs the hero section. Agent A generates responsive variants. Agent B synchronizes tokens with the codebase. All three changes appear in real time.

The orchestrator pattern from Chapter 11 becomes the standard architecture for this. An orchestrator agent decomposes the design task, assigns spatial sub-tasks to worker agents, and merges the results. The human acts as creative director, intervening when the agents produce something off-brand.

Trend 3: Design and Code as a Single Artifact

The third trend is the most fundamental. Design and code are converging into a single artifact. Not "design that exports to code." Not "code that renders a preview." A single artifact that IS both design and code simultaneously.

Paper is the clearest example. Paper's canvas is built on real HTML and CSS. When you design in Paper, you are writing HTML and CSS. There is no translation layer. The get_jsx tool does not convert a proprietary format into JSX --- it extracts JSX from a surface that is already HTML. The write_html tool goes the other direction, but it is writing to the same format.

This is different from Figma's model. Figma stores designs in a proprietary WebGL format. Developers and agents need to translate that format into production code. Paper's approach is the inverse: its native format is already close to production code, making it inherently agent-friendly. There is no translation step because the design surface speaks the same language as the output.

Pencil's .pen files and OpenPencil's .op files are intermediate steps on the same path. They are code artifacts --- plain text, diffable, version-controllable. A pull request can include changes to a .pen file alongside changes to a .tsx file. The reviewer sees both the design change and the code change in the same diff. But .pen and .op files are still separate formats that need translation to production code.

Approach Tool Format Translation Layer Diffable?
Design IS code Paper HTML/CSS None Yes (native HTML)
Design-as-code artifact Pencil .pen (JSON) Required (pen → JSX) Yes (plain text)
Design-as-code artifact OpenPencil .op (JSON) Required (op → multi-platform) Yes (plain text)
Proprietary canvas Figma WebGL binary Required (fig → anything) No

The end state: a world where there is no "design file" and "code file." There is one file. It renders visually in a canvas tool and compiles to production code. The distinction between designer and developer roles blurs because the artifact is the same.

Multiple voices in the ecosystem are converging on this idea. The future of code is design, and the best design work emerges from a canvas --- not a code editor. At the same time, code is a poor interface for spatial problems, and chat is a poor interface for systemic decisions. The canvas --- not the code editor, not the chat window --- is the correct interface for design work. But the canvas must produce code, not pixels.

My take: The design-code convergence is inevitable, but the timeline is uncertain. Paper's HTML-native approach is the most pragmatic path I see. Proprietary canvas formats (Figma, Sketch) will either open up or become liabilities. The .pen and .op formats are stepping stones --- useful now, but ultimately unnecessary if the design surface speaks the same language as production code.

Trend 4: The Rise of Agent-Native Design Tools

The fourth trend is the emergence of design tools built from the ground up for agent collaboration. Not legacy tools with AI features bolted on. Tools where the agent is the primary user and the human is the creative director.

Paper was the first tool built from scratch for agent collaboration via MCP. Every tool in Paper's MCP server --- get_selection, get_jsx, get_computed_styles, write_html --- was designed for agent consumption, not human clicking. The human uses the visual canvas. The agent uses the MCP API. Both work on the same artifact.

OpenPencil calls itself "the world's first open-source AI-native vector design tool." The agent can create shapes, apply styles, manage components, and export to code --- all through MCP tools, without a human touching the canvas. The Agent Teams feature extends this: the canvas is partitioned into spatial regions, and each agent works independently.

Open Design is agent-native in a different sense. There is no canvas. The agent IS the design engine. You give it a prompt, it asks clarifying questions, picks a visual direction, and builds the artifact. The output is the design. There is no intermediate visual surface.

Hyperframes is explicit about its positioning: "Built for agents." Its HTML-native authoring format and non-interactive CLI are designed for agent consumption, not human editing. The 50+ catalog blocks are pre-built agent actions --- composable units that an agent can assemble without understanding animation principles.

The key differentiator across all these tools: the format. Tools where agents read and write the same format as production (HTML, CSS, JSX) have a structural advantage over tools where agents must translate between formats. Paper's get_jsx works because the canvas IS HTML. Figma's MCP server works, but every read requires format translation.

Legacy tools are responding. Figma shipped an MCP server in early 2026. It works. But retrofitting agent support onto a proprietary canvas format is fundamentally different from building for agents from the start. The MCP server can read designs and export code, but the format translation introduces information loss and complexity that agent-native tools avoid.

Predictions for 2027-2028

Specifying predictions with timestamps so you can hold me accountable.

Horizontal spectrum diagram showing design tasks plotted from fully human-driven on the left to fully agent-driven on the right, with current 2026 positions marked and projected 2028 positions showing tasks like component generation and accessibility moving further right toward automation
Design task spectrum from human-driven to agent-driven with 2026 current state and 2028 projections
Prediction Timeline Confidence Signal Today
MCP servers ship as table stakes for every design tool By Q4 2026 High 6 tools already have MCP; Figma added one in early 2026
Design files disappear as a concept --- .pen and .op become code artifacts reviewed in PRs By mid 2027 Medium Pencil and OpenPencil already Git-integrated; Paper is HTML-native
Agent Teams are standard for complex design tasks By late 2027 Medium-High OpenPencil already ships Concurrent Agent Teams
The "design handoff" dies --- design IS code By 2028 Medium Paper's bi-directional workflow, GitHub treating codebase as source of truth
Design systems auto-detect and auto-fix token drift By late 2027 Medium Paper's token sync, Open Design's DESIGN.md schema
Figma either opens its format or loses market share to agent-native alternatives By 2028 Low-Medium Community reportedly switching to Paper; Paper's case studies list Vercel/Perplexity/PostHog
Video production becomes a standard design skill, not a specialist discipline By mid 2027 High Hyperframes 50+ blocks, Huashu Design MP4 export, Remotion Lambda
The designer role shifts from pixel-pusher to agent director and design system architect Ongoing through 2028 High Reportedly already happening in teams using Paper and Open Design

The prediction I am most confident about: MCP becomes universal by late 2026. Every design tool that wants to remain relevant will ship an MCP server. The ones that do not will become niche. This is not a bold prediction --- it is an observation of a trend that is already well underway at the time of writing.

The prediction I am least confident about: Figma's trajectory. Figma has enormous institutional momentum, deep enterprise contracts, and a generation of designers trained on its interface. Its MCP server, shipped in early 2026, shows awareness of the agent trend. But its proprietary canvas format is a growing liability in an agent-native world. Whether Figma opens its format or doubles down on proprietary lock-in will determine its trajectory.

Risks: Homogenization, Over-Reliance, and the Designer's Role

The trends are real. The predictions are grounded in observable signals. But the trajectory is not guaranteed. Several risks could undermine the whole ecosystem.

Quality homogenization. This is the risk I hear about most from design leaders. A prominent design executive puts it bluntly: the more companies talk about coding with agents and designing in code, the worse their UX tends to be. A design tool founder agrees: if you are building software meant to be consumed by humans, take the time to actually design it. AI has a design smell because it is "designing" via code, not through spatial reasoning.

The "AI design smell" is real. Huashu Design's anti-slop rules list the specific patterns: purple gradients, emoji icons, rounded-corner-plus-left-border-accent, SVG humans, Inter-as-display-font. When every agent generates from the same model weights without constraint, the output converges. Every startup's landing page starts to look the same.

The mitigation is clear but not easy. Strong design systems with custom tokens. Agent skills with anti-slop enforcement. Human creative direction at every step. The tools that bake these constraints into the workflow --- Huashu Design, Open Design --- will matter more than the tools that leave output quality to the model.

Over-reliance on agents. The warning from experienced practitioners is consistent: no agent replaces design judgment. Tools can accelerate execution, but the craft of design --- the spatial reasoning, the taste, the ability to hold ambiguity --- remains human. When designers treat agents as creative substitutes rather than execution engines, the quality drops.

Agents cannot read between the lines. They cannot hold ambiguity. They cannot make judgment calls about what matters and what does not. They execute instructions. When the instructions are clear and specific, the output is good. When the instructions are vague or absent, the output is generic.

The risk is not that agents replace designers. The risk is that designers lose craft skills as agents handle more execution. When execution becomes cheap, the temptation is to skip the spatial phase of thinking entirely. The bottleneck moves upstream --- it becomes a question of mind, not mechanics. The designer's value shifts from execution to spatial thinking, creative direction, and taste. But if designers stop practicing spatial thinking because agents handle the rendering, the taste atrophies.

My take: The homogenization risk is the one I worry about most. I have seen agent-generated designs that are technically correct and aesthetically empty. The tools will get better at avoiding AI slop, but the underlying problem is cultural: when execution becomes free, the temptation to skip thinking is overwhelming. The designers who thrive in this new landscape will be the ones who treat agents as execution engines, not creative partners. The creative work stays human.

The designer's changing role. A creative tool founder frames it directly: designers should be racing to get as close to the product as possible. The role is shifting from pixel-pusher to agent director, design system architect, and quality curator.

This shift creates opportunity and anxiety in equal measure. The designers who embrace agentic workflows gain leverage --- they produce more output, iterate faster, and ship with higher consistency. The designers who resist or ignore the shift find their skills depreciating as the industry moves to agent-native tools.

A design engineer captures the human advantage: if AI can think better and software can scale wider, then the scarce thing is the particular, messy way a human mind cares about something. Agents do not care. They execute. The caring --- the judgment about what is good, what is worth building, what matters to users --- stays human.

Other risks worth watching. Vendor lock-in: tools that require specific agents or models create dependency. Cost: API token usage for agent-driven workflows adds up, especially for video generation and multi-agent teams. Reliability: agents hallucinate tool calls, lose MCP connections on long sessions, and degrade on large tasks. Security: agents with write access to design files and code repositories need permission models, and those models are still immature. Accessibility: if agents generate code without a11y enforcement, digital exclusion worsens.

Risk Severity Current Signal Likely Mitigation
Quality homogenization High AI design smell in agent-generated output Anti-slop rules, custom design systems, human curation
Craft skill atrophy Medium-High Designers skipping spatial thinking phase Education, deliberate practice, canvas-first workflows
Vendor lock-in Medium Tools requiring specific agents or models Open-source alternatives, MCP standardization
API cost at scale Medium Token costs for video and multi-agent teams Model efficiency improvements, local models
Agent reliability Medium Hallucinated tool calls, dropped MCP connections Better tool validation, session recovery protocols
Security surface area Medium Agents with write access to design and code Permission models, sandboxed execution, audit logs
Accessibility regression High Agents generating code without a11y enforcement Automated a11y testing in agent loop (covered in Chapter 12)

These risks are not theoretical. I encountered agent hallucination, MCP connection drops, and security permission issues in the case studies covered in Chapter 13. The ecosystem will address some of these. Others will persist.

My take: I am optimistic about the trajectory but realistic about the timeline. The tools covered in this book are version 1.0 of agentic design. The MCP protocol is young. The skill ecosystem is nascent. The permission models are thin. But the direction is clear: design and code are converging, agents are becoming collaborative, and the canvas is returning as the primary interface for creative work. The designers who learn to direct agents --- not just use them --- will define the next decade of product design.

Generative UI frameworks: when the agent renders the interface at runtime

Two delivery models — emit-source (reviewable) versus render-at-runtime (ephemeral) — and what each costs the critique loop.
Figure: Emit-source versus render-at-runtime, and the cost to the critique loop.

The agentic design tools in wide use today produce an artifact you can read — a .pen file, an HTML page, a token set, a diff. There is a quieter frontier where the agent skips the artifact and renders the interface directly at runtime, constrained to your design system. The umbrella terms circulating in mid-2026 are generative UI, AG-UI, and MCP Apps: the agent receives intent, then assembles live components from your library on the fly rather than emitting source for you to review.

I am cautious here, and the reason is the loop that produces good design: intent, then generation, then critique, then revision. Critique needs something durable to point at. Runtime-generated UI weakens that step: if the interface only exists for the duration of a session, you cannot diff it, you cannot leave a comment on line 40, you cannot revert to yesterday's version. You trade reviewability for immediacy.

That trade makes sense for ephemeral, personalized surfaces — a dashboard assembled per-user, a form that adapts to the conversation. It makes much less sense for durable product surfaces that need to be reviewed, versioned, and shipped. Watch the space, but do not confuse "the agent rendered a UI" with "the agent produced a design you can direct." The second is the harder and more useful problem.

Next: This chapter concludes the main content of the book. Appendix A provides a comprehensive tool comparison matrix for quick reference. Appendix B contains the full MCP server reference. Appendix C offers a prompt library organized by design task.

Next Chapter

Tool Comparison Matrix

Side-by-side feature comparisons for every tool covered in this book

Continue Reading

©2026 Mehran Mozaffari. All rights reserved.