Tracking: SB MCP Preview

This is a tracking issue for the work required to release the Storybook MCP packages in public preview.

# Milestones

## Milestone 1: Loose ends
**Lead**: @JReinhold 
**Completed by**: Week 3, Feb 16

- [x] Improve evals, making them easier to compare before/after results across multiple runs https://github.com/storybookjs/mcp/pull/136
- [x] Add quality scores to evals based on:
  - [x] Objective:
    - [x] Typescript errors
    - [x] Imported components compared to expected imports
  - [x] Subjective:
    - [x] UI Review
    - [x] Source Review
- [ ] #144
- [x] Minimize docs context by only including a subset of stories by default https://github.com/storybookjs/mcp/pull/123
- [ ] #143
- [ ] Improve Preview Stories input to support component ID + story ID from component docs in addition to the current path + story name input
- [ ] Add Windows tests to MCP CI
- [ ] QA Windows support
- [x] Add more eval metrics:
  - [x] Which MCP tools were called
  - [x] ... with which inputs
  - [x] ... and what was the token count of the output
  - [x] ... optionally comparable to expected values
- [ ] Remove the `feature.experimentalComponentManifest` flag
- [ ] MCP Apps follow-up
  - [ ] QA in Claude Desktop
  - [ ] QA in ChatGPT
- [ ] Investigate how to improve server discoverability through registries, packaged extensions, manifests, etc.
- [x] Remove XML formatter

## Milestone 2: MCP Test toolset
**Lead**: @JReinhold 
**Completed by**: Week 4, Feb 23

- [x] Add test tool that allows the agent to run story tests, including getting a11y results
- [x] Prompt Engineering to ensure the agent runs these tests continuously during UI development
- [x] Add quality-focused evals that ensures the testing feature is valuable to agents
  - [x] Modifying existing code
  - [x] Creating new components
  - [x] Both Component Testing and a11y
- [ ] https://github.com/storybookjs/mcp/pull/131
- [x] https://github.com/storybookjs/storybook/pull/33206

## Milestone 3: MCP Composition
**Lead**: @kasperpeulen 
**Completed by**: Week 2, Feb 9

- [x] Investigate how MCP Composition can work with private/authenticated Storybooks
- [x] Make it possible for users to re-use the Storybook Composition API to compose multiple Storybook sources together in the local MCP
- [x] Support remote, authenticated Storybooks
- [x] Write eval to ensure the agent correctly combines components from different source Storybooks

## Milestone 4: Improving React prop types extraction
**Lead**: @kasperpeulen 
**Completed by**: Week 4, Feb 23

- [x] Replace `react-docgen` with `react-docgen-typescript`
- [x] Ensure we have a comprehensive test suite for various ways to type props
- [ ] Investigate replacing `react-docgen-typescript` with a new DYI solution, based on the TypeScript compiler
- [ ] Add evals to validate that this is an improvement, eg. using components that are built with `forwardRef`.

## Milestone 5: Documentation
**Lead**: @kylegach  
**Completed by**: Week 5, March 2
 
- [ ] Usage documentation on `@storybook/addon-mcp`
- [ ] Reference documentation on `@storybook/addon-mcp`
- [ ] Example on how to self-host the MCP server with `@storybook/mcp` @JReinhold 
- [ ] Launch material
  - [ ] Storybook homepage (`/`)
  - [ ] Storybook feature page (`/ai`)
  - [ ] Blog post
  - [ ] E2E video demo @JReinhold 

## Milestone 6: QA
**Completed by**: Week 6, March 9

- [ ] Docs-driven QA
- [ ] Different agents
  - [ ] GitHub Copilot
  - [ ] Claude Code
  - [ ] Codex
  - [ ] Opencode
- [ ] Different Design Systems, ensuring the manifest generates properly with prop types, stories, etc.

## Milestone 7 - Stretch: Expose Component Coverage
**Lead**: TBD

Would it be valuable for an agent to know, what the test coverage is for a given component, in an attempt at making it write better stories?

> This stories-file covers these lines of the component (eg. 80 %)

- How about other modules in the component's module graph?
- How about coverage by other stories files?

## Milestone 8 - Stretch: Component Documentation Best Practices
**Lead**: TBD

What is the best way to document components, both for humans and LLMs to read? Which style is most understandable, how verbose should prop descriptions be, etc.?

Can we take a (design system-like) project and turn it from undocumented to fully documented, and learn from that process?

Can we make a structure around this, exposing an MCP tool/skill that users can use to inject key documentation how-to knowledge into their agent, and then make the agent document all their components?

Resources to learn from:

- https://bennypowers.dev/cem/docs/usage/effective-mcp-descriptions/
- https://bennypowers.dev/cem/docs/usage/documenting-components/


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tracking: SB MCP Preview #138

Milestones

Milestone 1: Loose ends

Milestone 2: MCP Test toolset

Milestone 3: MCP Composition

Milestone 4: Improving React prop types extraction

Milestone 5: Documentation

Milestone 6: QA

Milestone 7 - Stretch: Expose Component Coverage

Milestone 8 - Stretch: Component Documentation Best Practices

Sub-issues

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Tracking: SB MCP Preview #138

Description

Milestones

Milestone 1: Loose ends

Milestone 2: MCP Test toolset

Milestone 3: MCP Composition

Milestone 4: Improving React prop types extraction

Milestone 5: Documentation

Milestone 6: QA

Milestone 7 - Stretch: Expose Component Coverage

Milestone 8 - Stretch: Component Documentation Best Practices

Sub-issues

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions