Agent SkillsAgent Skills

Skill Analysis

Conflict detection, context budgeting, diffing, and benchmarking

doctor --deepbudgetdiffbench

doctor --deep#

Deep conflict analysis across all installed skills.

skills doctor --deep

Extends the existing doctor command with a --deep flag that runs three conflict detection strategies:

StrategyDescription
Keyword ContradictionDetects conflicting instructions (e.g., "use tabs" vs "use spaces")
Topic OverlapJaccard similarity on section headings to find duplicate coverage
Rule ExtractionExtracts imperative instructions and compares across skills

Output:

🩺 Agent Skills Doctor
  βœ“ Installed skills: 16/16 valid

πŸ” Deep Conflict Analysis
  βœ“ No conflicting instructions found.
  βœ“ No topic overlaps found.

When conflicts exist, you'll see severity levels and estimated wasted tokens.


budget#

Smart context budget manager β€” loads only the most relevant skills within a token limit.

skills budget -b <tokens> [options]
OptionDescription
-b, --budget <tokens>Token budget (required, e.g. 8000)
-f, --format <format>Output format: text, xml, json (default: text)
-m, --min-relevance <score>Minimum relevance score (0–100, default: 10)
-p, --project <dir>Project directory to analyze (default: cwd)
--list-onlyShow ranked list without selecting

Relevance scoring (no LLM required):

  1. β€’File extension matching (project file types β†’ skill language keywords)
  2. β€’Dependency matching (package.json, requirements.txt, Cargo.toml)
  3. β€’Keyword density (skill body vs project file/directory names)
  4. β€’Description match against all project signals

Examples:

skills budget -b 8000                    # Text output with relevance bars
skills budget -b 4000 --format xml       # Agent-ready XML
skills budget -b 10000 --format json     # Machine-readable
skills budget -b 6000 --min-relevance 30 # Only high-relevance skills

Output:

πŸ“Š Context Budget Plan
  Budget: 8000 tokens | Skills found: 21

  βœ… Loading 4 skill(s):
    β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘ skill-creator (4426 tokens, 67% relevant)
    β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘ docx (2538 tokens, 64% relevant)
    β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘ skill-installer (704 tokens, 23% relevant)
    β–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘ test-skill (106 tokens, 10% relevant)

  Summary: Used 7774 / 8000 tokens

diff#

Section-aware comparison between two skills.

skills diff <skill-a> <skill-b> [options]
OptionDescription
--jsonOutput as JSON

Parses each SKILL.md by headings and compares:

  • β€’Added sections β€” only in skill B
  • β€’Removed sections β€” only in skill A
  • β€’Changed sections β€” same heading, different content (with line delta)
  • β€’Token delta β€” how many tokens differ

Examples:

skills diff frontend-design frontend-code-review
skills diff ./skill-a ./skill-b --json

Output:

πŸ“Š Skill Diff: frontend-design vs frontend-code-review
  βž• Added: 11 sections
  βž– Removed: 2 sections
  ✏️  Changed: 1
  Token delta: -358

bench#

Benchmark and compare skills by quality, size, and coverage.

skills bench [skills...] [options]
OptionDescription
-a, --allBenchmark all installed skills
--sort <field>Sort by: quality, tokens, name (default: quality)
--jsonOutput as JSON
--min-quality <n>Filter skills below this quality score (0–100)

Quality scoring (0–100):

  • β€’Frontmatter with name (+10) and description (+10)
  • β€’Section headings (+15/+5), code blocks (+15/+5)
  • β€’Has examples (+10), has instructions (+10)
  • β€’Appropriate token range (+10), no TODOs (+10)

Examples:

skills bench --all                  # All skills, sorted by quality
skills bench --all --sort tokens    # Sorted by size
skills bench --min-quality 80       # Only high-quality skills

Output:

πŸ“ˆ Skill Benchmark Results
  Skill                     Quality    Tokens   Sections   Code   Features
  ─────────────────────────────────────────────────────────────────────────
  add-uint-support          β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ 2311     24         13     πŸ“ πŸ’‘ πŸ“‹
  frontend-code-review      β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ 711      11         2      πŸ“ πŸ’‘ πŸ“‹
  frontend-design           β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘ 1069     2          0      πŸ“ πŸ’‘ πŸ“‹

  Summary: 17 skills | Avg quality: 90% | Total tokens: 41279
  Legend: πŸ“ frontmatter  πŸ’‘ examples  πŸ“‹ instructions