Tool Comparison Matrix

Note: Pricing and feature details reflect publicly listed rates as of May 2026. Verify current pricing on each tool's website before making purchasing decisions.

Agent Platforms Compared

Four agent platforms dominate agentic design workflows as of May 2026. Each supports MCP, each has a different strength, and each takes a fundamentally different approach to skill management, permissions, and model access. The table below compares them across the dimensions that matter most for design work: how they connect to design tools, how they handle permissions, and what the real cost looks like when you use them daily.

Feature comparison grid for four agent platforms showing license type, model access, configuration files, skill system support, MCP compatibility, subagent capabilities, plan mode, and free tier availability with green for supported, yellow for partial, and red for not available — Agent platform feature comparison across 15 dimensions with color-coded availability indicators

The comparison criteria here focus on practical concerns, not marketing claims. "Design skill ecosystem" measures the availability of pre-built skills and plugins for design tasks --- reading canvas files, generating JSX, exporting assets. "Model access" matters because some agents lock you into one provider while others let you bring any API key.

Feature	Claude Code	Codex CLI	OpenCode	Gemini CLI
Vendor	Anthropic	OpenAI	Open-source community	Google
License	Proprietary	Proprietary	MIT	Proprietary
Model access	Claude Opus / Sonnet	GPT-4o / o3	BYOK (any provider)	Gemini 2.5 Pro
MCP support	Full (plugin + manual)	Full (Streamable HTTP)	Full (JSON config)	Full
Skill system	SKILL.md, plugins	Plugins	Skills, agents	Skills
Permission model	Per-tool allowlist	Per-tool allowlist	Configurable	Per-tool
Design skill ecosystem	Largest (Paper plugin, custom skills)	Growing	Extensive (Open Design, Huashu)	Emerging
Price	$100-200/mo (Max plan)	$200/mo (Pro)	Free (BYOK API costs)	Free (BYOK)
Best for design	Full-stack design-to-code	Rapid prototyping	Multi-agent design teams	Multimodal design input
Last verified	May 2026	May 2026	May 2026	May 2026

Claude Code has the largest design skill ecosystem as of May 2026, driven by Anthropic's plugin marketplace and the Paper plugin specifically. Codex CLI is catching up fast --- OpenAI's plugin infrastructure is maturing, but design-specific skills remain thinner. OpenCode is the interesting outlier: it does not ship its own model, so you bring whatever API key you want. That flexibility comes with more configuration overhead, but it also means you can swap models mid-project without changing agents.

Gemini CLI's strength is multimodal input. If your design workflow starts with screenshots, photos, or whiteboard captures, Gemini's vision capabilities give it an edge for the initial design-to-prompt translation. Its skill ecosystem is the thinnest of the four, but Google is investing heavily here.

My take: For design work specifically, Claude Code is the most complete option today. The Paper plugin alone --- read designs, export JSX, write HTML back to canvas --- eliminates half the friction in a design-to-code workflow. But if you work across multiple model providers or need to run parallel agents with different models, OpenCode's BYOK architecture is the right call. In the author's experience, the $200/month price tag on Codex CLI is hard to justify for design work when the skill ecosystem is still growing.

Chapter 02 covers each platform in depth with installation and configuration details.

Design Canvas Tools Compared

The canvas is where design happens. These three tools --- Paper, Pencil, and OpenPencil --- take fundamentally different approaches to rendering, file formats, and agent integration. The comparison below covers the technical dimensions that determine how well each tool fits into an agentic workflow: canvas model (what you're actually manipulating), MCP integration (how agents talk to the canvas), code export quality (what you get out), and collaboration model (who else can work on the same file).

Pricing comparison matrix showing tool costs organized by category with free tier, individual or pro pricing, team pricing, and enterprise options for Paper, Pencil, Open Design, OpenPencil, Huashu Design, Remotion, and Hyperframes — Pricing matrix for all book tools showing free, pro, and enterprise tier costs

I excluded Figma from this table because Figma's MCP integration is read-only as of May 2026. Figma remains the dominant design tool for collaborative work, but agents can only read from it, not write to it. The three tools below all support bidirectional agent communication.

Feature	Paper	Pencil	OpenPencil
Canvas model	Real HTML/CSS	WebGL (.pen JSON)	CanvasKit/Skia (.op JSON)
License	Commercial (freemium)	Commercial	MIT (open source)
Pricing	Free / $16/mo Pro	Free (IDE extension)	Free
Platform	Web + Desktop	Desktop + IDE	Web + Desktop + CLI
Collaboration	Real-time multiplayer	Local single-user	Planned
MCP server	Yes (22 tools)	Yes	Yes (built-in)
Code export	JSX + Tailwind via MCP	React + Tailwind via CLI	10 platforms via MCP
Color support	sRGB + P3 + OkLCH	sRGB hex	sRGB hex
Agent compatible	Claude Code, Codex, Cursor, Copilot, OpenCode	Claude Code, OpenCode	Claude Code, Codex, OpenCode, Gemini, Copilot, Kiro
File format	.paper (HTML-based)	.pen (JSON)	.op (JSON)
Git integration	Via agent	Native (.pen in repo)	Native (Git panel, merge)
Last verified	May 2026	May 2026	May 2026

Paper's canvas model is the most unusual: it renders actual HTML and CSS, not an abstraction layer. When an agent writes to Paper, it writes HTML. When it reads from Paper, it reads HTML. This makes the design-to-code pipeline almost trivially short --- the design artifact is already code. The trade-off is that Paper cannot render vector shapes or complex illustrations the way a tool with a dedicated rendering engine can.

Pencil's strength is IDE integration. The .pen file format is JSON that lives in your repository alongside your source code. Design changes show up in git diff. Design reviews happen in pull requests. For teams already doing code review, Pencil fits into existing workflows without introducing a separate review tool.

OpenPencil's Agent Teams feature --- where multiple agents work different regions of the canvas simultaneously --- is the most architecturally interesting approach in the space. Each agent claims a spatial region, works independently, and the canvas merges the results. Chapter 11 covers this pattern in depth.

My take: Paper wins on collaboration and color fidelity --- sRGB + P3 + OkLCH is, as of this writing, the most complete color support of any design tool in this comparison. Pencil wins on IDE integration and diffability. OpenPencil wins on openness and multi-agent architecture. I use all three depending on the task. For solo design-to-code work, Pencil lives inside my IDE and I never leave my terminal. For team projects with stakeholders who need to see and comment on designs, Paper is the strongest option in the author's experience. OpenPencil's Agent Teams feature is the most interesting architectural experiment in the space right now.

Design Skill Frameworks Compared

Skills teach agents how to design. Two frameworks dominate as of May 2026: Open Design and Huashu Design. They take opposite approaches. Open Design is breadth-first --- 31 independent skills, 129 built-in design systems, covering everything from landing pages to dashboards to pitch decks. Huashu Design is depth-first --- one comprehensive skill that embeds 20 design philosophies, a Junior Designer Workflow, and explicit anti-AI-slop rules into a single SKILL.md file.

The comparison below covers architecture, output capabilities, and the self-critique mechanisms each framework provides. One important clarification: both frameworks implement a "5-dimension critique," but they are different implementations. Open Design's critique is a separate skill in its 31-skill library, focused on scoring output against design system constraints. Huashu Design's critique is built into its single skill, focused on five dimensions of design quality (philosophical consistency, visual hierarchy, detail execution, functionality, and innovation) with explicit scoring and a fix list. Same concept, different execution.

Feature	Open Design	Huashu Design
License	Apache-2.0	MIT
GitHub stars	43.6k	14.1k
Skills count	31	1 (comprehensive)
Design systems	129 built-in	Brand Asset Protocol
Agent support	16 CLI agents auto-detected	Agent-agnostic (any markdown-skill agent)
Output formats	HTML, PDF, PPTX, ZIP, Markdown	HTML, MP4, GIF, PPTX, PDF
Visual directions	5 schools (curated palettes)	5 schools x 20 philosophies
Self-critique	5-dimension design system critique (separate skill)	5-dimension expert critique (built-in, scored with fix list)
Anti-AI-slop	Via design system constraints	Explicit anti-slop rules
Price	Free (open source)	Free (open source)
Architecture	Web + daemon + BYOK proxy	Skill file (SKILL.md)
Last verified	May 2026	May 2026

The architectural difference matters in practice. Open Design runs a local web daemon and proxies API calls through your own keys. It detects which agent CLI you have installed and adjusts its output accordingly. This is powerful but requires a running process. Huashu Design is a single SKILL.md file that you drop into your project. No daemon, no proxy, no running process. The agent reads the file and follows the instructions. Simpler setup, but less infrastructure for managing multi-step workflows.

For output formats, the key differentiator is video. Huashu Design can export HTML animations to MP4 and GIF with 60fps interpolation and background music mixing. Open Design focuses on static and interactive outputs --- HTML, PDF, PPTX. If your design workflow includes motion, Huashu Design has the edge. If you need 129 pre-built design systems to choose from, Open Design is, in the author's experience, unmatched in this category.

Chapters 06 and 07 cover Open Design and Huashu Design in depth, respectively.

Motion and Video Tools Compared

Programmatic video for agents comes down to two approaches as of May 2026: React components (Remotion) or HTML-native animation (Hyperframes). The licensing difference alone is worth understanding before you commit. Remotion uses a custom source-available license that requires a paid license for companies with certain revenue thresholds. Hyperframes uses Apache-2.0, which is OSI-approved open source with no revenue restrictions.

The table below compares the authoring model, rendering pipeline, agent integration, and output capabilities. The core architectural difference: Remotion compiles React components into video frames through a bundler. Hyperframes plays HTML + CSS + GSAP animations directly in a browser and captures the output. No build step, no bundler --- the same index.html you preview in a browser is the file that gets rendered to MP4.

Feature	Remotion	Hyperframes
License	Source-available (custom)	Apache-2.0 (OSI open source)
GitHub stars	47.1k	19k
Authoring format	React components (TSX)	HTML + CSS + GSAP
Build step	Required (bundler)	None (index.html plays as-is)
Animation model	Frame-accurate, seekable	Library-clock (GSAP, CSS, Lottie, Three.js)
Distributed rendering	Lambda (production-ready)	Single-machine today
Agent skills	Via SKILL.md	12 skills bundled
MCP server	No	No
Agent compatible	Claude Code, Cursor	Claude Code, Cursor, Codex, Gemini
Block catalog	Community templates	50+ ready-to-use blocks
Price	Free (companies need license)	Free
Last verified	May 2026	May 2026

Remotion's frame-accurate rendering model is a genuine technical advantage for certain use cases. Because each frame is a React render, you can seek to any frame, scrub the timeline, and get deterministic output. Hyperframes uses GSAP's clock, which means the animation timing depends on the library's internal state. For most design-driven video --- product demos, explainer animations, social content --- this difference does not matter. For broadcast-quality or frame-precise compositing, Remotion has the edge.

Hyperframes' no-build-step approach is the better fit for agent workflows. An agent generates an HTML file with GSAP animations, and that file is immediately renderable. No npm install, no bundler configuration, no webpack or Vite setup. The 12 bundled skills cover common video patterns --- product demos, feature walkthroughs, social clips, title sequences. Remotion's community template ecosystem is larger, but the templates require more adaptation.

Distributed rendering is the area where Remotion is clearly ahead. Lambda-based rendering means you can parallelize frame computation across hundreds of instances. Hyperframes renders on a single machine today. For long-form video (10+ minutes) or batch rendering, this is a meaningful gap.

My take: For agent-driven design work, Hyperframes is, in the author's experience, the more practical choice. The no-build-step model aligns with how agents work --- generate a file, render it, iterate. The Apache-2.0 license removes the compliance question entirely. Remotion is, as of this writing, the stronger tool for production video teams that need distributed rendering and frame-accurate control, but that describes a different workflow than what this book covers.

Chapter 09 covers both tools in depth, including setup, animation composition, and rendering.

MCP-Connected Tools Compared

These tools connect to agents via MCP servers. They extend agent capabilities into existing design platforms --- reading Figma files, generating patterns in MagicPattern, or collaborating on Miro boards. Unlike the canvas tools in section A.2, these are not design surfaces themselves. They are external platforms that expose a subset of their functionality to agents through MCP.

The comparison below covers access level, authentication method, and key limitations. Access level is the most important dimension: read-only MCP servers (like Figma's) let agents extract information but never modify the source file. Read-write servers (like MagicPattern's) let agents both read and create content. This distinction determines whether you can close the design loop entirely within the agent session or need to switch back to the GUI for write operations.

Feature	Figma MCP	Miro MCP	MagicPattern MCP
License	Proprietary	Proprietary	Proprietary
Access level	Read-only	Read + limited write	Read + write
Primary use	Read designs programmatically	AI-powered whiteboarding	Generative patterns
Agent compatible	Claude Code, Cursor, Codex	Claude Code, Cursor	Claude Code
Export formats	JSON, SVG, images	Board data, images	SVG, PNG, CSS backgrounds
Authentication	Figma API key	OAuth	API key
Price	Included with Figma	Included with Miro	Free / paid tiers
Key limitation	No write access, SVG fills as images	Limited write operations	Pattern types only
Last verified	May 2026	May 2026	May 2026

Figma's MCP server is the most widely used of the three, which makes sense given Figma's market position. But the read-only limitation is real. Agents can extract design tokens, read component structures, and download images, but they cannot create or modify frames. If your workflow requires writing back to Figma, you need the GUI or a third-party plugin.

Miro sits in the middle: agents can create sticky notes, add shapes to boards, and read board content. The write operations are limited to basic elements --- you cannot create complex diagrams or modify existing objects programmatically. For structured whiteboarding workflows where an agent captures meeting notes and organizes them on a board, Miro's MCP is sufficient.

MagicPattern has the broadest write access of the three. Agents can generate SVG patterns, export them as CSS backgrounds or PNG images, and iterate on pattern parameters. The limitation is scope: MagicPattern does one thing (generative patterns), and it does it well, but it will not help with layout, typography, or component design.

Tip: Chain these MCP servers for compound workflows. An agent can read a design system from Figma, extract the color palette, feed it to MagicPattern to generate background patterns, and write the results into a Paper canvas. Chapter 10 covers MCP chaining in detail.

Chapter 10 covers MCP integrations in depth, including configuration, chaining, and troubleshooting. Appendix B provides a complete reference for every MCP tool available in each server.

Agent Platforms Compared

Design Canvas Tools Compared

Design Skill Frameworks Compared

Motion and Video Tools Compared

MCP-Connected Tools Compared

MCP Server Reference

Preface

Copyright & License

The Agentic Design Paradigm

Your Agent Toolkit

Teaching Agents to Design

Design-as-Code

Paper and Pencil

Open Design and OpenPencil

Huashu Design

Design Systems and Tokens

Motion and Video

MCP Integrations

Multi-Agent Design Teams

Production UI from Design

Real-World Case Studies

The Future of Agentic Design

Tool Comparison Matrix

MCP Server Reference

Prompt Library for Design Tasks