If we accept the idea that there’s a threshold level of capability beyond which agentic coding models are all ‘good enough’, and that we’ve crossed it (~Opus 4.5 level), then what becomes interesting is cost v speed (here shown as tasks per dollar v time per task, so upper left is most desirable).
On this basis, Cursor’s Composer 2 is an extreme outlier. It’s ~as fast as any other agent (or in my experience, even faster with Fast mode turned on in Cursor), yet for a dollar you can complete 14 tasks (compared to around 0.5-1 tasks per dollar for most frontier models i.e. Composer is ~15-30x cheaper).