What are Claude Code Skills?

Claude Code Skills are reusable prompt templates stored as markdown files in your project's .claude/skills/ directory. They let you codify best practices and common workflows into slash commands that Claude follows consistently.

How do I create a Claude Code Skill?

Create a markdown file in .claude/skills/ with instructions for Claude. The filename becomes the command name — for example, .claude/skills/code-review.md can be invoked with /code-review during a Claude Code session.

Can I share skills with my team?

Yes. Skills stored in .claude/skills/ can be committed to version control so every team member has access to the same standardized workflows. You can also organize skills into subdirectories by category.

What is the difference between project and global skills?

Project skills (.claude/skills/) are specific to the current project and are committed to git. Global skills (~/.claude/skills/) are personal skills available across all your projects.

smithers

New

49Community RegistryDevelopmentby William Cory

Smithers (smithers-orchestrator): write durable JSX workflows for Claude Code (SQLite-backed state, phases/steps, retries, observability).

Community PluginView Source

Overview

Run long-horizon coding-agent work as durable workflows.

![npm](https://www.npmjs.com/package/smithers-orchestrator) ![CI](https://github.com/smithersai/smithers/actions/workflows/ci.yml) ![License: MIT](#license) ![Docs](https://smithers.sh)

Tell your coding agent to do real, multi-step work, then Smithers runs it for minutes or days with crash recovery, retries, human approvals, and full observability. The same workflow runs across Claude Code, Codex, Pi, AI SDK models, and remote sandboxes.

!Live runs in Smithers Studio: some succeeded, some running, some paused on an approval gate, every run resumable and rewindable.

A workflow run is a list of steps you can watch, pause, approve, and rewind. The run above shows several in flight at once.

What you get

•🛡️ Durable runs that survive crashes: every completed step is persisted the moment it

finishes, so a run resumes from where it stopped instead of starting over.

•🔌 Any agent, any model: Claude Code, Codex, Pi, Antigravity, and more, plus any model

through the AI SDK. Swap the harness without rewriting the workflow.

•🛠️ Higher-quality output: review loops, human approvals, and evals give agents the

structure that real work demands.

•🧩 Dozens of ready-to-run workflows: planning, implementation, review, debugging,

tickets, audits, and long-horizon missions. Your agent can author new ones.

When to use Smithers

You want to…	Smithers?
Get one answer from one prompt	No, call the model directly
Ship a chatbot or a simple tool-calling app	Usually no
Let a coding agent change a repo across many steps	Yes
Pause for a human approval, then resume later	Yes
Run several agents that review, retry, and converge	Yes
Survive crashes and replay, fork, or rewind a run	Yes
Run durable backend jobs unrelated to agents	Consider Temporal or Inngest
Host stateful per-user session agents	Consider Cloudflare Agents

Smithers is the durable runtime for coding-agent work: when the unit of work is an agent editing a real repository over many steps, and you need that work to be inspectable, approvable, and recoverable.

Prompt your agent

Smithers is driven by your coding agent, not a GUI you click. Your agent runs Smithers on your behalf: it scaffolds workflows, kicks off runs, watches them, and handles approvals.

The fastest way to make your agent fluent is two fan-out commands. They install the Smithers skill and register the MCP server into every coding agent on your machine (Claude Code, Codex, Cursor, Copilot, Pi, Hermes, OpenClaw, and ~20 more):

bash

bunx smithers-orchestrator skills add   # install the skill set into every detected agent
bunx smithers-orchestrator mcp add      # register Smithers as an MCP server everywhere

Then just ask:

"orchestrate an agent to add rate limiting and keep iterating until the tests pass."

Your agent picks the right workflow, starts the run, and keeps going through retries and review loops until the work is actually done.

See Agent Support for the per-agent setup (skill, MCP, instructions) for Claude Code, Codex, Cursor, Copilot, Pi, Hermes, and OpenClaw.

Wire one agent by hand? If you'd rather drop the curated onboarding skill into a single agent directly:

bash

mkdir -p ~/.claude/skills/smithers
curl -fsSL https://raw.githubusercontent.com/smithersai/smithers/main/skills/smithers/SKILL.md \
  -o ~/.claude/skills/smithers/SKILL.md
curl -fsSL https://smithers.sh/llms-full.txt \
  -o ~/.claude/skills/smithers/llms-full.txt

See `skills/smithers/` for the onboarding skill.

Quick start

Prefer to drive it yourself from the CLI? Start here.

bash

# scaffold the workflow pack into .smithers/
bunx smithers-orchestrator init

# turn a request into a practical implementation plan
bunx smithers-orchestrator workflow run plan --prompt "add rate limiting, audit logging, and API key rotation"

init scaffolds a .smithers/ folder preloaded with production-ready workflows. Once that's in place, you can chain a request from tickets to implementation:

bash

# break a request into ticket files under .smithers/tickets/
bunx smithers-orchestrator workflow run tickets-create --prompt "add rate limiting, audit logging, and API key rotation"

# implement the tickets, each in its own worktree branch
bunx smithers-orchestrator workflow run kanban

Run bunx smithers-orchestrator starters to browse plain-English starters, and bunx smithers-orchestrator workflow list to see what's installed.

Watch your runs

Whether your agent started a run or you did, you can see exactly what's happening:

bash

bunx smithers-orchestrator ps              # list active, paused, and recently completed runs
bunx smithers-orchestrator inspect RUN_ID  # steps, agents, approvals, and outputs for one run
bunx smithers-orchestrator logs RUN_ID     # tail the event log
bunx smithers-orchestrator chat RUN_ID     # read the agent's chat output

ps shows you what needs attention (a paused approval, a recent failure); inspect drills into a single run so you can follow each step and agent as it works.

Durable by default

Durability is the differentiator. Runs survive crashes, restarts, and flaky tools because every completed step is persisted to SQLite the moment it finishes. The runtime always knows what's done and what to run next. Approvals, human questions, retries, and replay are first-class.

text

prompt → render workflow → run task → validate output → persist to SQLite → re-render → resume · inspect · replay

That loop is the whole model: a task runs, its output is validated against a schema and written down, then the workflow re-renders from persisted state to decide the next task. A crash at any point resumes from the last write, not from the top.

bash

bunx smithers-orchestrator up workflow.tsx --input '{"description":"Fix bug"}'
bunx smithers-orchestrator up workflow.tsx --run-id abc123 --resume true   # resume after a crash
bunx smithers-orchestrator rewind abc123 --frame 4                          # time-travel to an earlier frame
bunx smithers-orchestrator fork abc123                                      # branch an alternate timeline
bunx smithers-orchestrator replay abc123                                    # replay from a checkpoint

Any agent, any model

Smithers doesn't bet on one lab or one harness. Point a task at whichever agent is best for the job, mix several in one workflow, and switch freely. The workflow doesn't change when the model does, so a frontier model can plan, a fast model can fan out, and a specialized harness can do the edits.

Agents that run tasks

Agent	How it runs
Claude Code	CLI harness
Codex	CLI harness
Pi	CLI harness
Antigravity	CLI harness
Any AI SDK model	SDK agent, with tools, structured output, and MCP

Sandboxes that isolate them

The same <Sandbox> primitive runs an agent locally or on a remote provider with no change to the workflow:

Target	Notes
Local	default, runs on your machine
gVisor	syscall-isolated containers
Kubernetes	your own cluster
Freestyle · Daytona · Cloudflare	managed remote sandboxes

To drive Smithers itself, the `skills add` / `mcp add` commands also wire Cursor, Copilot, Hermes, OpenClaw, and ~20 more coding agents.

Built-in workflows

bunx smithers-orchestrator init installs a pack of ready-to-run workflows. Point your agent at one and go via bunx smithers-orchestrator workflow run WORKFLOW_ID --prompt "...":

Build

Workflow	What it does
`implement`	Implement a focused change with validation and review feedback loops.
`research-plan-implement`	Research a request, produce a plan, then implement it with validation and review.
`ticket-create` / `tickets-create`	Turn a request into one or many structured implementation tickets.
`kanban`	Implement ticket files in worktree branches, board-style.

Plan

Workflow	What it does
`plan`	Create a practical implementation plan before code changes begin.
`research`	Gather repository and external context before planning or building.
`grill-me`	Ask targeted questions until vague requirements become actionable.
`mission`	Run long-horizon work as approved milestones with focused workers and validation.

Quality

Workflow	What it does
`review`	Review current repository changes with one or more configured agents.
`debug`	Reproduce, fix, validate, and review a reported bug.
`improve-test-coverage`	Find and add high-impact missing tests for the repository.
`audit`	Audit feature groups for tests, docs, observability, and maintainability gaps.
`feature-enum`	Build or refine a code-backed feature inventory for a repository.
`ralph`	Keep working continuously on an open-ended maintenance prompt.

See `docs/workflows/` for the full pack.

Examples

The `examples/` folder has 100+ runnable workflows, one per orchestration pattern. Copy one as a starting point:

![Every orchestration pattern we could find: 100+ real, runnable Smithers workflows in one folder.](./examples)

Example	Pattern
`code-review-loop`	Implement → review → fix, looped until approved.
`coverage-loop`	Run tests, measure coverage, write tests, repeat to target.
`panel`	N specialist agents review in parallel, a moderator synthesizes.
`supervisor`	A boss agent plans and delegates to workers dynamically.
`fan-out-fan-in`	Split work across N parallel agents, aggregate results.
`parallel-tickets`	Triage, run waves of work in parallel, merge-queue the results.
`migration`	Plan → transform files → validate → report.
`pr-shepherd`	Watch a PR, gather context, leave structured review comments.
`playwright-test-agent`	Plan E2E flows, generate Playwright tests, run/heal until stable.
`adaptive-rag-citation-loop`	Retrieve evidence, draft cited answers, grade grounding, retry.
`sql-analyst-dashboard`	Discover schema, check read-only SQL, execute, summarize with a chart.
`rfp-response-room`	Extract RFP requirements, draft cited answers, review, package.
`calendar-negotiator-with-approval`	Rank meeting slots and write calendar state only after approval.
`document-exception-queue`	Extract document packets, reconcile them, route exceptions.

That's fourteen of 100+. The folder also covers debates, canary judging, SLO-breach explainers, repo janitors, and dozens more. Browse the full set in `examples/`.

Author your own

The built-in workflows are normal Smithers TSX files: run them as-is, have your agent adapt them to your repo, or have it write new ones from the same primitives. A workflow is a JSX tree of tasks:

tsx

import { createSmithers, Sequence } from "smithers-orchestrator";
import { z } from "zod";

const { Workflow, Task, smithers, outputs } = createSmithers({
  analyze: z.object({
    summary: z.string(),
    severity: z.enum(["low", "medium", "high"]),
  }),
  fix: z.object({
    patch: z.string(),
    explanation: z.string(),
  }),
});

export default smithers((ctx) => (
  <Workflow name="bugfix">
    <Sequence>
      <Task id="analyze" output={outputs.analyze} agent={analyzer}>
        {`Analyze the bug: ${ctx.input.description}`}
      </Task>

      <Task id="fix" output={outputs.fix} agent={fixer}>
        {`Fix this issue: ${ctx.latest("analyze").summary}`}
      </Task>
    </Sequence>
  </Workflow>
));

Each task output is validated against its Zod schema and persisted to SQLite. If the process crashes, Smithers resumes from the last completed node without re-running finished work.

Components

Component	Purpose
`<Workflow>`	Root container
`<Task>`	AI or static task node
`<Sequence>`	Ordered execution
`<Parallel>`	Concurrent execution
`<Branch>`	Conditional execution
`<Ralph>`	Loop until a condition is met

tsx

<Ralph until={ctx.latest("validate")?.approved} maxIterations={5}>
  <Task id="implement" output={outputs.implement} agent={coder}>
    Fix based on feedback
  </Task>

  <Task id="validate" output={outputs.review} agent={reviewer}>
    Review the implementation
  </Task>
</Ralph>

There are many more: approvals, merge queues, sub-workflows, signals, timers, sagas, and composite patterns. See Components.

Deeper capabilities

•Observability: every run emits Prometheus metrics and OpenTelemetry traces. Bring up

the local stack with bunx smithers-orchestrator observability up (Grafana, Prometheus, Tempo, OTLP collector) and serve metrics with bunx smithers-orchestrator up workflow.tsx --serve --metrics.

•Evals: run repeatable workflow regressions from JSON/JSONL cases with

bunx smithers-orchestrator eval workflow.tsx --cases evals/smoke.jsonl --suite smoke; the command exits non-zero when any case fails.

•Prompt optimization: run GEPA-style optimization against an eval suite with

bunx smithers-orchestrator optimize, which writes an optimized prompt artifact only when the score improves.

•Hot reload: edit prompts, config, agent settings, or JSX structure mid-run with

bunx smithers-orchestrator up workflow.tsx --hot. In-flight tasks finish on their original code; only newly scheduled tasks pick up changes.

•Scale across machines: run agents in remote sandboxes via the <Sandbox> primitive

(provider table above) with no change to the workflow. Start from `examples/freestyle-sandbox-provider` and the Sandbox component.

Security and safety

Smithers is built for agents that modify real repositories, so control is wired into the runtime:

•Approvals: gate destructive or risky steps behind a human approve / deny before

they run.

•Inspectable: every step, tool call, and output is persisted and replayable, so you

can see exactly what an agent did and why.

•Reversible: rewind, fork, or replay any run from a checkpoint instead of living

with whatever the agent left behind.

•Isolated: run agents in a sandbox (gVisor, Kubernetes,

Freestyle, Daytona, Cloudflare) so edits never touch your host.

•Auditable: every run emits Prometheus metrics and OpenTelemetry traces.

Docs

Full documentation lives at [smithers.sh](https://smithers.sh).

License

MIT

Install & Usage

Create the skills directory

mkdir -p .claude/skills

Download the skill file

mkdir -p .claude/skills && curl -o .claude/skills/smithers.md https://raw.githubusercontent.com/evmts/smithers/main/SKILL.md

Invoke in Claude Code

/smithers

View source on GitHub

Frequently Asked Questions

What is smithers?

Smithers (smithers-orchestrator): write durable JSX workflows for Claude Code (SQLite-backed state, phases/steps, retries, observability).

How to install smithers?

To install smithers, create the .claude/skills directory in your project, then run the curl command to download the skill file. Once installed, invoke it in Claude Code with /smithers.

What is smithers best for?

smithers is a community categorized under Development. Created by William Cory.

Overview

What you get

When to use Smithers

Prompt your agent

Quick start

Watch your runs

Durable by default

Any agent, any model

Built-in workflows

Examples

Author your own

Components

Deeper capabilities

Security and safety

Read next

Docs

License

Install & Usage

Frequently Asked Questions

What is smithers?

How to install smithers?

What is smithers best for?