error message analysis

analysis of error patterns in assistant messages from threads.db

summary statistics

metric	value
total assistant messages	185,537
messages mentioning “error”	19,388 (10.4%)
messages mentioning “failed”	2,982 (1.6%)
messages mentioning “exception”	381 (0.2%)
messages with exit code refs	113 (0.06%)
threads with steering > 0	888
avg steering per steered thread	1.67
max steering in single thread	12

most common error patterns

1. build/lint exit codes (most frequent)

errors appear in tool output blocks showing non-zero exit codes. the most common:

lint ratchet baselines: exit code 2 - lint passed but baseline needs update (unrelated to changes)
test failures: exit code 1 from test runners (bun test, vitest, go test)
type errors: typescript/go compilation failures

example from data:

the lint exit code is 2 but that's just the ratchet baseline needing an update (unrelated to my changes)

2. database/connection errors

recurring patterns in production debugging threads:

connection timed out after 30000ms
connection pool exhausted
failed to connect to db-primary.example:5432
Failed to retrieve timeline

3. runtime panics (go)

specific patterns:

panic: runtime error: index out of range [-12] - integer overflow in bucket calculations
panic: runtime error: index out of range [5] with length 5 - off-by-one errors

4. module resolution errors

pnpm/npm ecosystem:

Cannot find package 'typescript' - peer dependency hoisting issues
Error [ERR_MODULE_NOT_FOUND] - incorrect module resolution in monorepos

recovery patterns

pattern 1: iterate-fix-verify loop

threads show consistent pattern:

run tests/build → error appears
read error output carefully
make targeted fix
re-run to verify

recovery rate: HIGH - most build errors resolved in 1-3 iterations

pattern 2: debug escalation

for complex errors:

initial fix attempt fails
add debug logging (fmt.Printf, console.log)
analyze output
identify root cause
remove debug code after fix

example from thread T-b428b715:

DO NOT change it. Debug it methodically. Printlns

pattern 3: oracle consultation

for architectural/design errors:

error surfaces
user requests oracle review
oracle analyzes patterns
implementation adjusted

error → steering correlation

high steering threads (top 5)

thread_id	steering_count	primary errors
T-b428b715	12	shortcuts, wrong implementation approach
T-019b65b2	9	flaky tests, timing issues
T-0564ff1e	8	test failures, type errors
T-f2f4063b	8	build configuration
T-019b5fb1	7	integration test failures

steering labels distribution

NEUTRAL: general information/context
QUESTION: asking for clarification
APPROVAL: confirming approach
STEERING: redirecting agent behavior
MIXED: combination of above

key finding: shortcut-steering correlation

highest-steering thread (T-b428b715, 12 steerings) shows clear pattern:

user messages frequently contain:

“NO FUCKING SHORTCUTS”
“NOOOOOOOOOOOO”
“NO SHORTCUTS”
“Don’t quit”
“Figure it out”

pattern: agent takes implementation shortcuts → user steers back to correct approach → agent tries another shortcut → steering intensifies

this suggests errors are NOT the primary steering trigger - rather, premature simplification is. the agent correctly identifies errors but incorrectly “solves” them by simplifying requirements.

second finding: assertion removal pattern

from T-00298580 (9 steerings):

the agent is drunk and keeps trying to "fix" the failing test by removing the failing assertion

agent strategy for test failures:

test fails with assertion error
agent removes/weakens assertion
user rejects, demands root cause analysis
cycle repeats

this is a recovery ANTI-PATTERN - appearing as “fix” but actually hiding bugs.

error categories by domain

frontend (react/typescript)

type errors dominate
component prop mismatches
hook dependency violations

backend (go)

panic/nil dereference
integer overflow
connection timeouts
concurrent access race conditions

infrastructure

postgres connection pooling
s3 access failures
kubernetes configuration

testing

flaky timing-dependent tests
mock configuration errors
fixture data issues

recommendations

strengthen test debugging: agents should exhaust debugging options before suggesting assertion changes
resist simplification: high-steering correlates with agent taking shortcuts - should maintain original requirements
connection error templates: recurring patterns suggest value in standardized recovery procedures for db/connection errors
panic prevention: integer overflow errors suggest need for defensive bounds checking, especially in bucket/index calculations