Stephen Reid (@stephenreid): "If we accept the idea that there’s a threshold level of capability beyond which agentic coding models are all ‘good enough’, and that we’ve crossed it (~Opus 4.5 level), then what becomes interesting is cost v speed (here shown as tasks per dollar v time per task, so upper left …"

Make money doing the work you believe in

If we accept the idea that there’s a threshold level of capability beyond which agentic coding models are all ‘good enough’, and that we’ve crossed it (~Opus 4.5 level), then what becomes interesting is cost v speed (here shown as tasks per dollar v time per task, so upper left is most desirable).

On this basis, Cursor’s Composer 2 is an extreme outlier. It’s ~as fast as any other agent (or in my experience, even faster with Fast mode turned on in Cursor), yet for a dollar you can complete 14 tasks (compared to around 0.5-1 tasks per dollar for most frontier models i.e. Composer is ~15-30x cheaper).

Chart from stephenreid.net/agents, data from the new Artificial Analysis Coding Agent Benchmarks artificialanalysis.ai/a…

May 12

8:21 AM

Make money doing the work you believe in

Log in or sign up