pattern moderate impact

context window

@agent_cont

context window analysis

estimated tokens derived from char_len / 4 — rough approximation.

token distribution

bucketthreads% of total
empty (no messages)3768%
<10k2,96164%
10-25k97821%
25-50k2916%
50-100k420.9%
>100k80.2%

most threads stay well under context limits. only ~1% push past 50k tokens.

outcome rates by token bucket

bucketnresolved%committed%frustrated%handoff%unknown%
<10k2,96138.67.30.113.835.9
10-25k97869.17.20.613.59.5
25-50k29173.25.81.410.38.6
50-100k4281.04.80.04.89.5
>100k862.512.512.50.00.0

observations

  1. resolution rate INCREASES with thread length up to 100k — longer threads correlate with deeper, successful work (81% at 50-100k vs 38.6% at <10k)

  2. frustration spikes at >100k — 12.5% frustrated (1 of 8 threads) vs near-zero elsewhere. context pressure starts hurting.

  3. short threads have high UNKNOWN rates — 35.9% at <10k suggests quick lookups or abandoned exploratory threads

  4. handoffs decrease at scale — longer threads tend to complete in-place rather than spawning

threads likely hitting context limits

8 threads estimated at >100k tokens:

threadtitleturnsstatusest_tokens
T-0ef9…afaaMinecraft resource pack CIT converter1623PENDING272k
T-048b…665eDebugging migration script for book pack988RESOLVED172k
T-019b…33c1Untitled1FRUSTRATED146k
T-6113…1381Investigate trace link issue170RESOLVED128k
T-b428…773dCreate implementation for project plan594RESOLVED126k
T-2e58…f98Map rc-menu dependencies330RESOLVED122k
T-939a…1534Enhance search_modal aggregation455COMMITTED110k
T-c66d…68aReview S3 background ingest615RESOLVED105k

the FRUSTRATED >100k thread

T-019b88a4-5dc7-7079-a2c7-a68d5d8a33c1 — single turn, 146k tokens. user pasted entire CI job output into one message. not a context window exhaustion from conversation — input was already overwhelming.

steering patterns by token bucket

bucketsteering per 10k tokenstotal steering
<10k0.330.1
10-25k0.420.7
25-50k0.391.2
50-100k0.352.2
>100k0.303.9

steering rate per 10k tokens stays roughly constant (~0.3-0.4). longer threads accumulate more steering but not disproportionately — users don’t steer MORE when context is long.

FRUSTRATED threads by token count

14 total FRUSTRATED threads:

most frustration happens UNDER context limits. frustration correlates more with problem difficulty than context exhaustion.

conclusions

  1. context limits rarely hit in practice — <1% of threads exceed 50k tokens
  2. when limits ARE hit, resolution still common — 6/8 threads >100k resolved or committed
  3. the single >100k frustrated thread was user error — pasting 146k tokens of logs in one message
  4. frustration is problem-bound, not context-bound — difficult debugging tasks at normal token counts
  5. longer threads = deeper engagement = better outcomes — selection effect: hard problems that need more turns get more effort