skill recommendations
based on analysis of 4,656 threads across amp users.
TL;DR
| skill | current usage | recommendation | priority |
|---|---|---|---|
| dig | ~1 invocation | USE MORE | HIGH |
| write | ~1 invocation | USE MORE | MEDIUM |
| oracle (tool) | 25% of threads | USE EARLIER | MEDIUM |
| coordinate | ~1 invocation | USE FOR COMPLEX WORK | MEDIUM |
| platform-sre | ~1 invocation | USE FOR DEBUG | MEDIUM |
| Task (tool) | 40% of threads | REFINE USAGE | LOW |
| report | 97% of skill loads | FINE AS-IS | - |
severely underutilized skills
dig (systematic investigation)
current state: 1 explicit invocation across 4,656 threads
why this matters:
- 47% of steerings are flat rejections (“no…”) — users correct agent’s approach
- 87% recovery rate after steering, but 2+ consecutive steerings = doom spiral
- 8/14 FRUSTRATED threads had complex debugging that would benefit from systematic investigation
evidence from thread analysis:
- “debugging dataset queue starvation and capacity deadlock” — failed threads that needed hypothesis-driven approach
- FRUSTRATED threads average 80 messages vs RESOLVED at 60 — thrashing without structure
recommendation: invoke dig skill for:
- any debugging task involving emergent behavior
- root cause analysis (not just symptom chasing)
- investigations spanning multiple files/systems
- when first approach fails and you’re about to try second
write (technical prose)
current state: 1 explicit invocation
why this matters:
- assistant brevity analysis shows inconsistent output formatting
- documentation and PR descriptions vary wildly in quality
- threads with clear, structured communication patterns succeed more
recommendation: invoke write skill for:
- README updates
- PR descriptions
- jsdocs on complex code
- any prose meant for developer consumption
tools to use differently
oracle — use earlier, not as rescue
current state:
- 25% of RESOLVED threads use oracle
- 46% of FRUSTRATED threads use oracle
- FRUSTRATED threads invoke oracle early + repeatedly when already stuck
the insight: oracle is reached for when things go wrong, but late oracle (for review/validation) has 82.8% success vs early oracle at 78.8%. not a huge gap, but pattern is clear.
the real problem: oracle isn’t being used for PLANNING. it’s being used for RESCUE.
recommendation:
- invoke oracle at thread start for complex tasks (planning)
- invoke oracle after implementation for review (validation)
- avoid repeated oracle calls when stuck — this signals you need dig skill instead
Task — use deliberately with scoped tasks
current state:
- 40.5% of RESOLVED threads use Task
- 61.5% of FRUSTRATED threads use Task (counterintuitive!)
- 2-6 tasks per thread = 77-78% resolution (optimal)
- 11+ tasks = 58% resolution (coordination overhead)
the insight: FRUSTRATED threads over-delegate. successful Task usage is SCOPED:
- “fix X in file Y” → works
- “execute project plan” → fails
recommendation:
- cap at 2-6 concurrent tasks
- use imperative verbs: fix, implement, update, add
- include file paths in task description
- delegate during NEUTRAL phases, not after steering (72% of successful delegations are proactive)
- DON’T delegate: debugging, exploration, context-dependent work
skills to consider using more
coordinate
current state: 1 explicit invocation
use case: complex multi-agent orchestration with bidirectional communication
when to invoke:
- parallel workstreams that need synchronization
- tasks requiring explicit state handoff
- when spawn depth would exceed 5-7 levels
caution: coordination overhead can become the task. 62.5% of spawned threads are orphans (never properly closed).
platform-sre
current state: 1 explicit invocation
use case: incident response, observability queries, production debugging
when to invoke:
- production incidents or log investigation
- debugging with observability data available
- hypothesis-driven triage in complex systems
skills working fine
report
current state: 97% of skill invocations
interpretation: this is architectural, not user behavior. spawned subagents load report as part of the spawn workflow. no change needed.
meta-insight: skill discovery problem
rare skill usage (~1 invocation each for dig, write, platform-sre, coordinate) suggests:
- users don’t know skills exist
- users don’t know when to invoke them
- agents don’t auto-suggest skills
recommendation for amp:
- surface skill suggestions based on task patterns
- “this looks like a debug task — consider loading dig skill”
- or auto-load skills when keywords match (e.g., “debug”, “investigate” → dig)
user-specific recommendations
@verbose_explorer (you)
based on @verbose_explorer-specific analysis (CORRECTED):
- 83% resolution rate (top tier)
- 231 spawned subagents at 97.8% success
- 4.2% handoff rate
- verbose prompts (932 chars avg) — effective for spawn context
skill recommendations for @verbose_explorer:
- coordinate for complex multi-agent work — already your strength
- write for documentation — helps structure thinking
- oracle for PLANNING at thread start
- Task with explicit scope — you already provide good context
compiled from insights/skill-usage.md, tool-patterns.md, oracle-timing.md, task-delegation.md, spawn-vs-inline.md