2.1Anthropic's skill-creator is the canonical test-measure-refine loop · 官方技能生成器 FACT
Anthropic publishes a skill-creator skill that runs the create-edit-eval-benchmark loop with four subagents: executor (runs the skill on test cases), grader (scores against rubric), comparator (A/B against alternatives), analyzer (proposes refinements). It is the operational artifact of the autoresearch axiom — single-pass skill drafts hit a consensus ceiling; this loop breaks through by attacking the weakest rubric dimension each round.
Available via the Anthropic skills marketplace; on this machine it lives at ~/.claude/skills/anthropic-skills/skill-creator/. Used to produce /why-token, /tokenomics, and the planned /landscape skill.
Source Anthropic skills marketplace · ~/.claude/rules/axioms/001_iterative_refinement.md
2.2Slash commands beat prompt repetition · 斜杠命令优于重复提示 FACT
If you have re-typed the same multi-line instruction more than twice, it should be a slash command. The economics are obvious in retrospect — a skill costs ~20 minutes to write once and amortizes across thousands of invocations — but in practice most users keep typing. Symptoms of underutilization: bookmarked prompt snippets, copy-paste from Notes app, recurring sentences in CLAUDE.md that fire only for one task type.
Practitioner heuristic: third repeat = write the skill. On this machine, 20 skills (/why-token, /tokenomics, /last30days, /deep-research, /x-control-chase, /ship, /qa, /loop, /schedule, etc.) cover the bulk of weekly work; without them the CLAUDE.md would have to be 3× its current size.
Source ~/.claude/skills/ directory · ~/.claude/rules/skills/INDEX.md
2.3Marketplace economics: skills as personal IP · 技能即个人知识产权 OPINION
A skill is a packaged opinion. The marketplace is therefore a stage where opinions about how work should be done get distributed. This is closer to a substack than to npm — quality is opinionated, not commodity. Sahil Lavingia's OPC-skills (minimalist entrepreneur frameworks: /find-community, /validate-idea, /mvp, /processize) demonstrates the model: ten skills encoding one person's worldview, useful precisely because the author has taste.
Implication for practitioners: publishing skills is a higher-leverage form of content than writing about workflows. Each skill the user installs is a vote for that author's mental model.
Source Sahil Lavingia OPC-skills · marketplace usage on this machine (Anthropic + OPC sources)
2.4Model-version durability: well-designed skills survive 4.5→4.6→4.7 · 跨模型版本耐用度 FACT
The fear that skills will rot across model upgrades is largely unfounded if the skills are written as outcomes + constraints (axiom 010) rather than step-by-step procedures. On this machine: of ~20 skills installed before Sonnet 4.5, the upgrades to 4.6 and 4.7 required two refactors (skills that had hard-coded a model name or token budget); the rest continued to work unchanged. Sample is small; the pattern matches the broader practitioner consensus that outcome-shaped skills survive model upgrades better than procedure-shaped ones.
Practitioner rule: never reference a specific model in a skill body; never assume a specific context window; treat the model as a capability not a fixed signature.
Source this machine's skill upgrade audit 2025-12 → 2026-05 · axiom 010
2.5When skill beats prompt — and when it doesn't · 技能与提示的边界 OPINION
Skills beat prompts when the workflow is stable, repeated, and benefits from named structure. Prompts beat skills when the request is one-off, exploratory, or requires negotiated scope with the user. The failure mode is over-skillification — wrapping every prompt in a skill produces a brittle DSL that nobody remembers how to use. Symptoms: skills with 1 invocation per quarter, skills that always need 3 arguments to do anything useful, skills that get rewritten more often than invoked.
Heuristic: a skill must be invoked at least 5 times before it earns its keep. Lower than that, revert to prompt and save the cognitive load.
Source author observation across 20-skill setup · ~/.claude/rules/skills/INDEX.md
2.6Skill granularity: atomic actions vs orchestrating skills · 技能粒度 OPINION
Two valid skill shapes. Atomic: one verb, one outcome (/ship lands a PR, /qa runs the test loop). Orchestrating: multi-phase, chains atomic skills (/x-control-chase runs pulse + drafts + posts). Orchestrating skills are higher-leverage but riskier — they fail in more places and are harder to debug. Atomic skills compose better.
Practitioner rule: build atomic first; only build the orchestrator after three atomic skills have been used together more than five times. Premature orchestration is the skill-system version of premature abstraction (axiom 006).
Source author skill design experience · axiom 006