BeClaude

jro-fable

New
GitHub TrendingGeneralby JRO424

Maximize Fable 5 judgment while minimizing token cost. Auto-triggers on token-heavy tasks running on Fable: large repo scans, multi-file changes, log analysis, broad research, vault audits, transcript processing. Routes bounded work to Sonnet/Haiku subagents; keeps architecture, synthesis, and final review on Fable. Check whether the task genuinely requires Fable-tier reasoning before invoking — if Sonnet or Opus suffices, skip the orchestration layer. Trigger phrases: jro fable, efficient fable, save tokens, delegate, fan out, orchestrate, too expensive, token budget.

Community PluginView Source

Overview

JRO-Fable

Fable 5 costs $10/$50 per MTok — 2x Opus, 3.3x Sonnet, 10x Haiku. Every token Fable spends reading files, scanning repos, or reducing logs is 10x what Haiku would charge for the same work. Spend Fable tokens on judgment. Spend cheaper tokens on everything else.

Pre-flight

Before orchestrating, ask: does this task actually need Fable? If Sonnet or Opus can handle it end-to-end, use that model directly — no orchestration layer needed. This skill only applies when the task genuinely requires Fable-tier reasoning AND has delegatable subtasks.

Model Routing Table

Work typeModelWhy
File search, grep, directory listing, log scanningmodel: "haiku"Pattern matching, no reasoning needed. 10x cheaper.
Inventory / summarize a codebase or doc setmodel: "haiku"Bulk reading with structured extraction.
Bounded edits (≤3 files, known scope)model: "sonnet"Good code generation at 3.3x cheaper.
Multi-file implementation, refactoringmodel: "sonnet"79.6% SWE-bench. Right capability/cost balance.
Test execution, browser checks, log reductionmodel: "sonnet"Fable decides direction; Sonnet executes and reports.
Security review, deep code analysismodel: "sonnet"Adequate for most review; Fable re-checks high-impact findings.
Task decomposition, architecture, tradeoffsFable (self)This is what you're paying for.
Conflict resolution across subagent reportsFable (self)Cross-cutting judgment.
Final diff review, risk assessment, user synthesisFable (self)Truth-judgment stays here.

Haiku breaks even only if its error rate stays below 20%. If a Haiku agent's output requires Sonnet-level correction more than 1 in 5 times, the re-prompting cost negates the savings. Monitor and escalate.

How to Delegate

Use the Agent tool with the model parameter:

code
Agent({
  description: "Scan repo for auth patterns",
  model: "haiku",
  prompt: "Search /path/to/repo for authentication..."
})

For parallel independent work, send multiple Agent calls in a single message — they run concurrently (capped at ~16).

For structured multi-stage work, use the Workflow tool with pipeline() (no barrier, items flow through stages independently) or parallel() (barrier, waits for all before proceeding).

Subagents cannot spawn subagents — this is enforced. If you need hierarchical delegation, use Workflows.

Output Contracts

Every subagent prompt must specify the return format. Subagent results inject into Fable's context verbatim — uncontrolled prose wastes tokens.

See DELEGATION.md for named presets with output contracts for each delegation type.

General rules for all subagent returns:

  • Max 500 tokens unless the task explicitly requires more.
  • Structured over prose: paths, line numbers, commands, pass/fail.
  • End with STOP_REASON: (scope-complete | hit-stop-condition | ambiguous).
  • End with UNCERTAINTY: if any doubt about the findings.

Handoff Packets

Write delegated prompts as if the subagent just walked into the room. Include only:

  • Repo path and exact objective.
  • Files/packages in scope. Anything explicitly out of scope.
  • Evidence format to return (reference the output contract).
  • Verification commands or browser flows, plus what success looks like.
  • Stop conditions: if the code doesn't match, a command fails after

reasonable retry, or the task needs out-of-scope files — stop and report instead of improvising.

Pass relevant file slices, not re-read instructions. If you already have a file in context, include the relevant lines in the prompt rather than asking the subagent to re-read it — that's double-paying for the same tokens.

When NOT to Delegate

Delegation has overhead: the routing turn costs Fable tokens, and the return packet consumes context. Not every task pays off.

Keep on Fable when:

  • Task is ≤2 tool calls — just do it.
  • Files are already in your context — re-reading in a subagent wastes tokens.
  • Every edit step requires a judgment call — coordination cost exceeds savings.
  • Scope is ambiguous enough that the handoff packet needs full context anyway.
  • Single-file bug with obvious root cause — the round trip adds no value.
  • The task output is <500 tokens of Sonnet work — routing overhead ≈ savings.

Delegation break-even rule: Subagent work should consume 2K+ tokens to justify the routing overhead. Below that, do it directly.

Vetting Delegated Work

Treat subagent reports as leads, not facts. Before using a high-impact finding, opening a PR, or telling the user the work is done:

  1. Reopen the important cited files yourself.
  2. Confirm line references and failure reports match reality.
  3. Review the final diff against the original task.

Let lighter agents gather signal. Keep truth-judgment with Fable.

Common Scenarios

Treat as soft defaults, not rigid rules:

  • Research: Haiku agents scan docs, repos, APIs. Fable decides what

evidence changes the plan.

  • Coding: Sonnet agents do bounded edits or candidate patches. Fable

owns shared-file coordination, integration, and final review.

  • Testing: Fable chooses the validation direction. Sonnet agents run

tests, browser flows, screenshots, and log reduction. They report commands, failures, likely causes, and whether failures look flaky, environmental, or real.

  • Debugging: Haiku/Sonnet agents cluster logs, reproduce issues, try

small fixes. Fable picks the most trustworthy diagnosis.

  • Audits: Haiku agents inventory and scan in parallel batches. Fable

synthesizes findings and makes merge/restructure decisions.

Cost Reference

ModelInput/MTokOutput/MTokvs Fable
Fable 5$10.00$50.001x
Opus 4.8$5.00$25.002x cheaper
Sonnet 4.6$3.00$15.003.3x cheaper
Haiku 4.5$1.00$5.0010x cheaper

Worked example: 10 research subagent calls, 2K input + 500 output each.

SetupCostSavings
All Fable$0.55
Fable orchestrator + Sonnet subagents$0.2457%
Fable orchestrator + Haiku subagents$0.1376%

Prompt caching on stable system content (0.1x read price after first call) compounds these savings further.

Diagram

Use assets/fable-orchestrator.excalidraw when a visual explanation helps.

Install & Usage

1
Create the skills directory
mkdir -p .claude/skills
2
Download the skill file
mkdir -p .claude/skills && curl -o .claude/skills/jro-fable.md https://raw.githubusercontent.com/JRO424/jro-fable/main/SKILL.md
3
Invoke in Claude Code
/jro-fable
View source on GitHub
code-reviewagent

Frequently Asked Questions

What is jro-fable?

Maximize Fable 5 judgment while minimizing token cost. Auto-triggers on token-heavy tasks running on Fable: large repo scans, multi-file changes, log analysis, broad research, vault audits, transcript processing. Routes bounded work to Sonnet/Haiku subagents; keeps architecture, synthesis, and final review on Fable. Check whether the task genuinely requires Fable-tier reasoning before invoking — if Sonnet or Opus suffices, skip the orchestration layer. Trigger phrases: jro fable, efficient fable, save tokens, delegate, fan out, orchestrate, too expensive, token budget.

How to install jro-fable?

To install jro-fable, create the .claude/skills directory in your project, then run the curl command to download the skill file. Once installed, invoke it in Claude Code with /jro-fable.

What is jro-fable best for?

jro-fable is a community categorized under General. It is designed for: code-review, agent. Created by JRO424.