Dr Milan Milanović (@techworldwithmilan): "𝗗𝗼𝗲𝘀 𝗔𝗜-𝗣𝗼𝘄𝗲𝗿𝗲𝗱 𝗖𝗼𝗱𝗶𝗻𝗴 𝗧𝗿𝗮𝗱𝗲 𝗦𝗽𝗲𝗲𝗱 𝗳𝗼𝗿 𝗧𝗲𝗰𝗵𝗻𝗶𝗰𝗮𝗹 𝗗𝗲𝗯𝘁? Developers report 10x productivity gains from AI coding agents, yet a Carnegie Mellon study of 806 open-source GitHub repositories found something different. Researchers compare…"

The app for independent voices

𝗗𝗼𝗲𝘀 𝗔𝗜-𝗣𝗼𝘄𝗲𝗿𝗲𝗱 𝗖𝗼𝗱𝗶𝗻𝗴 𝗧𝗿𝗮𝗱𝗲 𝗦𝗽𝗲𝗲𝗱 𝗳𝗼𝗿 𝗧𝗲𝗰𝗵𝗻𝗶𝗰𝗮𝗹 𝗗𝗲𝗯𝘁?

Developers report 10x productivity gains from AI coding agents, yet a Carnegie Mellon study of 806 open-source GitHub repositories found something different.

Researchers compared Cursor-adopting projects against 1,380 matched control repositories, tracking code output and quality monthly using SonarQube.

Here are the key findings:

𝟭. 𝗧𝗵𝗲 𝘃𝗲𝗹𝗼𝗰𝗶𝘁𝘆 𝗯𝗼𝗼𝘀𝘁 𝗶𝘀 𝗿𝗲𝗮𝗹 𝗯𝘂𝘁 𝗱𝗶𝘀𝗮𝗽𝗽𝗲𝗮𝗿𝘀 𝗳𝗮𝘀𝘁

Projects saw a 𝟮𝟴𝟭% 𝗶𝗻𝗰𝗿𝗲𝗮𝘀𝗲 𝗶𝗻 𝗹𝗶𝗻𝗲𝘀 𝗮𝗱𝗱𝗲𝗱 and a 𝟱𝟱% 𝗶𝗻𝗰𝗿𝗲𝗮𝘀𝗲 𝗶𝗻 𝗰𝗼𝗺𝗺𝗶𝘁𝘀 during the first month after Cursor adoption. By month three, both metrics dropped back to pre-Cursor levels. The spike looks great on a dashboard. It just doesn't last.

𝟮. 𝗧𝗲𝗰𝗵𝗻𝗶𝗰𝗮𝗹 𝗱𝗲𝗯𝘁 𝗮𝗰𝗰𝘂𝗺𝘂𝗹𝗮𝘁𝗲𝘀 𝗮𝗻𝗱 𝘀𝘁𝗮𝘆𝘀

Static analysis warnings rose by 𝟯𝟬% and code complexity increased by 𝟰𝟭% on average. This decline in quality was persistent in the project.

𝟯. 𝗧𝗵𝗮𝘁 𝗱𝗲𝗯𝘁 𝗰𝗿𝗲𝗮𝘁𝗲𝘀 𝗮 𝘀𝗲𝗹𝗳-𝗿𝗲𝗶𝗻𝗳𝗼𝗿𝗰𝗶𝗻𝗴 𝘀𝗹𝗼𝘄𝗱𝗼𝘄𝗻

The researchers found a feedback loop between quality and velocity. A 𝟭𝟬𝟬% 𝗶𝗻𝗰𝗿𝗲𝗮𝘀𝗲 𝗶𝗻 𝗰𝗼𝗱𝗲 𝗰𝗼𝗺𝗽𝗹𝗲𝘅𝗶𝘁𝘆 caused a 𝟲𝟰.𝟱% 𝗱𝗲𝗰𝗿𝗲𝗮𝘀𝗲 in future development velocity. A 𝟭𝟬𝟬% 𝗶𝗻𝗰𝗿𝗲𝗮𝘀𝗲 𝗶𝗻 𝘀𝘁𝗮𝘁𝗶𝗰 𝗮𝗻𝗮𝗹𝘆𝘀𝗶𝘀 𝘄𝗮𝗿𝗻𝗶𝗻𝗴𝘀 caused a 𝟱𝟬.𝟯% 𝗱𝗿𝗼𝗽 in lines added. The two-month speed boost generates enough technical debt to drag down productivity for months afterward.

𝟰. 𝗔𝗜 𝘄𝗿𝗶𝘁𝗲𝘀 𝗺𝗼𝗿𝗲 𝗰𝗼𝗺𝗽𝗹𝗲𝘅 𝗰𝗼𝗱𝗲 𝘁𝗵𝗮𝗻 𝗵𝘂𝗺𝗮𝗻𝘀

Regardless of the codebase's size, Cursor-adopting projects still had 𝟵% 𝗵𝗶𝗴𝗵𝗲𝗿 𝗰𝗼𝗱𝗲 𝗰𝗼𝗺𝗽𝗹𝗲𝘅𝗶𝘁𝘆 than comparable projects producing the same volume of code. This means that such projects are harder to maintain.

QA has to keep up with higher output. We can say that teams adopting agentic coding tools without upgrading their processes are borrowing speed from the future.

The paper even suggests tools should consider "self-throttling," reducing suggestion volume when project complexity crosses healthy thresholds.

𝗟𝗶𝗻𝗲𝘀 𝗼𝗳 𝗰𝗼𝗱𝗲 𝗽𝗿𝗼𝗱𝘂𝗰𝗲𝗱 𝗶𝘀 𝗻𝗼𝘁 𝘁𝗵𝗲 𝘀𝗮𝗺𝗲 𝗮𝘀 𝗽𝗿𝗼𝗴𝗿𝗲𝘀𝘀 𝗺𝗮𝗱𝗲

What processes has your team put in place to manage code quality alongside AI coding tools?

Source: arxiv.org/abs/2511.04427

Apr 6

6:51 AM

The app for independent voices

Log in or sign up