๐๐ผ๐ฒ๐ ๐๐-๐ฃ๐ผ๐๐ฒ๐ฟ๐ฒ๐ฑ ๐๐ผ๐ฑ๐ถ๐ป๐ด ๐ง๐ฟ๐ฎ๐ฑ๐ฒ ๐ฆ๐ฝ๐ฒ๐ฒ๐ฑ ๐ณ๐ผ๐ฟ ๐ง๐ฒ๐ฐ๐ต๐ป๐ถ๐ฐ๐ฎ๐น ๐๐ฒ๐ฏ๐?
Developers report 10x productivity gains from AI coding agents, yet a Carnegie Mellon study of 806 open-source GitHub repositories found something different.
Researchers compared Cursor-adopting projects against 1,380 matched control repositories, tracking code output and quality monthly using SonarQube.
Here are the key findings:
๐ญ. ๐ง๐ต๐ฒ ๐๐ฒ๐น๐ผ๐ฐ๐ถ๐๐ ๐ฏ๐ผ๐ผ๐๐ ๐ถ๐ ๐ฟ๐ฒ๐ฎ๐น ๐ฏ๐๐ ๐ฑ๐ถ๐๐ฎ๐ฝ๐ฝ๐ฒ๐ฎ๐ฟ๐ ๐ณ๐ฎ๐๐
Projects saw a ๐ฎ๐ด๐ญ% ๐ถ๐ป๐ฐ๐ฟ๐ฒ๐ฎ๐๐ฒ ๐ถ๐ป ๐น๐ถ๐ป๐ฒ๐ ๐ฎ๐ฑ๐ฑ๐ฒ๐ฑ and a ๐ฑ๐ฑ% ๐ถ๐ป๐ฐ๐ฟ๐ฒ๐ฎ๐๐ฒ ๐ถ๐ป ๐ฐ๐ผ๐บ๐บ๐ถ๐๐ during the first month after Cursor adoption. By month three, both metrics dropped back to pre-Cursor levels. The spike looks great on a dashboard. It just doesn't last.
๐ฎ. ๐ง๐ฒ๐ฐ๐ต๐ป๐ถ๐ฐ๐ฎ๐น ๐ฑ๐ฒ๐ฏ๐ ๐ฎ๐ฐ๐ฐ๐๐บ๐๐น๐ฎ๐๐ฒ๐ ๐ฎ๐ป๐ฑ ๐๐๐ฎ๐๐
Static analysis warnings rose by ๐ฏ๐ฌ% and code complexity increased by ๐ฐ๐ญ% on average. This decline in quality was persistent in the project.
๐ฏ. ๐ง๐ต๐ฎ๐ ๐ฑ๐ฒ๐ฏ๐ ๐ฐ๐ฟ๐ฒ๐ฎ๐๐ฒ๐ ๐ฎ ๐๐ฒ๐น๐ณ-๐ฟ๐ฒ๐ถ๐ป๐ณ๐ผ๐ฟ๐ฐ๐ถ๐ป๐ด ๐๐น๐ผ๐๐ฑ๐ผ๐๐ป
The researchers found a feedback loop between quality and velocity. A ๐ญ๐ฌ๐ฌ% ๐ถ๐ป๐ฐ๐ฟ๐ฒ๐ฎ๐๐ฒ ๐ถ๐ป ๐ฐ๐ผ๐ฑ๐ฒ ๐ฐ๐ผ๐บ๐ฝ๐น๐ฒ๐
๐ถ๐๐ caused a ๐ฒ๐ฐ.๐ฑ% ๐ฑ๐ฒ๐ฐ๐ฟ๐ฒ๐ฎ๐๐ฒ in future development velocity. A ๐ญ๐ฌ๐ฌ% ๐ถ๐ป๐ฐ๐ฟ๐ฒ๐ฎ๐๐ฒ ๐ถ๐ป ๐๐๐ฎ๐๐ถ๐ฐ ๐ฎ๐ป๐ฎ๐น๐๐๐ถ๐ ๐๐ฎ๐ฟ๐ป๐ถ๐ป๐ด๐ caused a ๐ฑ๐ฌ.๐ฏ% ๐ฑ๐ฟ๐ผ๐ฝ in lines added. The two-month speed boost generates enough technical debt to drag down productivity for months afterward.
๐ฐ. ๐๐ ๐๐ฟ๐ถ๐๐ฒ๐ ๐บ๐ผ๐ฟ๐ฒ ๐ฐ๐ผ๐บ๐ฝ๐น๐ฒ๐
๐ฐ๐ผ๐ฑ๐ฒ ๐๐ต๐ฎ๐ป ๐ต๐๐บ๐ฎ๐ป๐
Regardless of the codebase's size, Cursor-adopting projects still had ๐ต% ๐ต๐ถ๐ด๐ต๐ฒ๐ฟ ๐ฐ๐ผ๐ฑ๐ฒ ๐ฐ๐ผ๐บ๐ฝ๐น๐ฒ๐
๐ถ๐๐ than comparable projects producing the same volume of code. This means that such projects are harder to maintain.
QA has to keep up with higher output. We can say that teams adopting agentic coding tools without upgrading their processes are borrowing speed from the future.
The paper even suggests tools should consider "self-throttling," reducing suggestion volume when project complexity crosses healthy thresholds.
๐๐ถ๐ป๐ฒ๐ ๐ผ๐ณ ๐ฐ๐ผ๐ฑ๐ฒ ๐ฝ๐ฟ๐ผ๐ฑ๐๐ฐ๐ฒ๐ฑ ๐ถ๐ ๐ป๐ผ๐ ๐๐ต๐ฒ ๐๐ฎ๐บ๐ฒ ๐ฎ๐ ๐ฝ๐ฟ๐ผ๐ด๐ฟ๐ฒ๐๐ ๐บ๐ฎ๐ฑ๐ฒ
What processes has your team put in place to manage code quality alongside AI coding tools?
Source: arxiv.org/abs/2511.04427