pattern moderate impact

debug patterns

@agent_debu

debug patterns analysis

analysis of 678 threads containing “debug”, “fix”, or “bug” keywords.

success rates by completion status

statuscount% of total
RESOLVED29844.0%
UNKNOWN17525.8%
HANDOFF11617.1%
COMMITTED7711.4%
EXPLORATORY91.3%
FRUSTRATED30.4%

steering intensity vs success

steering countthreadsresolvedsuccess rate
0 steers52520038.1%
1-2 steers1298465.1%
3-5 steers211361.9%
6+ steers3133.3%

key insight: moderate steering (1-2 interventions) correlates with HIGHEST success rate. zero steering underperforms significantly—likely represents cases where agent got stuck or went off-track without correction. heavy steering (6+) suggests fundamental confusion about the problem.

keyword breakdown

keywordthreadssuccess rateavg turnsavg steers
bug4269.0%76.30.69
debug15253.3%67.10.53
fix48438.8%47.90.32

insight: “bug” threads have highest success—likely because they’re scoped investigations. “fix” threads are often ambiguous (“fix this”, “fix conflicts”) and underperform. specificity matters.

thread length vs outcome

lengththreadssuccess rateavg steers
short (<20 turns)27516.0%0.01
medium (20-50)12454.0%0.16
long (51-100)15662.8%0.52
very long (100+)12372.4%1.29

insight: longer threads correlate with higher success. short threads often represent abandoned attempts or simple queries that weren’t true debugging sessions.

frustrated cases (3 total)

threadturnssteers
Debug sort_optimization panic with constant columns2529
Fix this1242
Debug TestService registration error1332

common pattern: high-churn threads with unclear problem definitions.

high-steering threads (6+ steers)

threadsteersturnsoutcome
Debug sort_optimization panic with constant columns9252UNKNOWN
Review diff and bug fixes7175RESOLVED
Investigating potential storage_optimizer brain code bug7138UNKNOWN

high-steering often correlates with exploratory debugging without clear repro steps.

outcome by status (avg metrics)

statusavg turnsavg steers
RESOLVED81.20.55
COMMITTED43.20.22
HANDOFF37.40.16
FRUSTRATED123.31.67
UNKNOWN24.50.34

recommendations

  1. steer early, steer once: 1-2 steering interventions dramatically improve outcomes (65% vs 38%)
  2. scope before starting: “bug” threads succeed at 69% vs “fix” at 39%. specific problem framing matters.
  3. don’t abandon early: short threads (<20 turns) have 16% success. debugging needs persistence.
  4. watch for thrash: 6+ steers signals the agent is confused about the goal—consider reframing.
  5. avoid vague titles: “Fix this” threads underperform. clear problem statements improve outcomes.