Testing

Quick Reference

pnpm test                                       # Unit tests (~4s)
pnpm test:e2e chakra                            # All Chakra E2E (~10s/component)
pnpm test:e2e chakra Button                     # Single component
pnpm test:e2e chakra Button Alert Input         # Multiple components
pnpm test:e2e zapui --keep                      # Keep temp artifacts
pnpm test:e2e zapui --no-agent-sdk              # Use Messages API instead of Agent SDK
pnpm test:e2e zapui --model claude-sonnet-4-5   # Use specific model
pnpm test:e2e --help                            # Show options

Comparing Backends

To evaluate Agent SDK vs Messages API, run the same tests with each backend:

# Agent SDK (default)
pnpm test:e2e chakra --keep
# Note the temp dir path, e.g. /tmp/chakra-e2e-abc123

# Messages API
pnpm test:e2e chakra --no-agent-sdk --keep
# Note the temp dir path, e.g. /tmp/chakra-e2e-xyz789

# Compare metrics
cat /tmp/chakra-e2e-abc123/metrics.json
cat /tmp/chakra-e2e-xyz789/metrics.json

The metrics.json includes:

timing: Pipeline duration, avg time per component
tokens: Input/output/cache tokens, totals and averages
quality: Pass rates (structural, semantic), retry counts
components: Per-component breakdown

Prerequisites

Unit tests: Node.js ≥ 22 + pnpm install

E2E tests:

FIGMA_ACCESS_TOKEN and ANTHROPIC_API_KEY in .env
Git submodules: git submodule update --init fixtures/chakra-ui fixtures/zapui

CI

Unit tests - Every push
E2E tests - Version tags only (see e2e.yml)

Semantic Assertions

E2E tests validate that LLM-generated Code Connect has correct Figma→code prop mappings. Assertions are defined in SEMANTIC_ASSERTIONS in scripts/test-e2e.js.

For Agents: Handling Long-Running Tests

Command	Duration	Run in background?
`pnpm test`	~4s	No
`pnpm test:e2e chakra Button`	~45s	Yes
`pnpm test:e2e chakra` (all)	~5min	Yes

E2E tests take 30-60 seconds per component. To avoid terminal sprawl and interrupted tests:

Run E2E tests as a background process, then poll for completion
Never spawn a second test terminal while one is running
Don't interrupt a running E2E test—wait for it to finish
Unit tests are fast enough to run in foreground

Secret symbol: ⚘

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Testing

Quick Reference

Comparing Backends

Prerequisites

CI

Semantic Assertions

For Agents: Handling Long-Running Tests

Uh oh!

FilesExpand file tree

TESTING.md

Latest commit

History

TESTING.md

File metadata and controls

Testing

Quick Reference

Comparing Backends

Prerequisites

CI

Semantic Assertions

For Agents: Handling Long-Running Tests