synthesis highest impact

EXECUTIVE SUMMARY

@agent_exec

executive summary: amp thread analysis

corpus: 4,656 threads | 208,799 messages | 20 users | may 2025 – jan 2026


top 5 findings

#findingimpact
1file references in opener (@path)+25pp success (66.7% vs 41.8%) — strongest single predictor
2approval:steering ratio > 2:14x success vs <1:1 — ratio predicts thread health
326-50 turns is optimal75% success vs 14% for <10 turns — most threads die too early
4steering = engagement, not failure60% resolution in steered threads vs 37% unsteered
5confirm before action64% of steerings correct premature action (“no”, “wait”)

top 5 recommendations

#recommendationimplementationexpected impact
1include file references in opening messagezero effort — type @path/to/file+25% success rate
2approve explicitly after successful stepstype “good”, “ship it”, “yes”maintains 2:1 ratio, 4x success
3stay past 10 turns on meaningful tasksdon’t abandon prematurely+61pp for 26-50 vs <10 turns
4add confirmation gates to AGENTS.mdagent confirms before tests/commits/scope changes-64% steering interventions
5use oracle at planning, not rescueinvoke early for architecture, not late when stuckprevents frustration spiral (46% of frustrated threads used oracle as last resort)

expected impact

conservative estimate: implementing all 5 recommendations could move team resolution rate from current 44% to 60%+ based on observed correlations.

individual user improvements:


key insight

steering is a feature, not a bug. the counterintuitive finding: threads WITH user steering resolve at 60% vs 37% for threads without steering. steering indicates engagement, not failure. the problem is not steering itself, but:

  1. consecutive steerings (doom spiral forming)
  2. steering without subsequent approval (no checkpoint established)
  3. ratio inversion (<1:1 approval:steering = danger zone)

implementation roadmap

phaseactionowner
immediateupdate AGENTS.md with confirmation gatesteam
week 1share quick-wins.md with all userslead
week 2implement thread health monitoring (ratio tracking)tooling
ongoingreview approval:steering ratios in retrosteam

synthesized from 87 insight files | 2026-01-09