Researchers seeing little evidence of benefit from co pilots
A study by Uplevel found that AI coding assistants like GitHub Copilot do not significantly improve developer productivity and may increase bugs, with mixed results across different companies.
Read original articleA recent study by Uplevel has revealed that developers are not experiencing significant productivity gains from AI coding assistants like GitHub Copilot. The study analyzed the performance of around 800 developers over six months and found no notable improvements in key programming metrics such as pull request cycle time and throughput. In fact, the use of Copilot was associated with a 41% increase in bugs. While some developers report feeling more productive, the study suggests that many are spending more time reviewing code generated by AI rather than writing it. Additionally, the study indicated that AI tools have not alleviated developer burnout, as those using Copilot did not see a reduction in after-hours work compared to those who did not use it. Mixed results were reported across different companies; while some, like Innovative Solutions, noted significant productivity increases, others, such as Gehtsoft USA, found AI-generated code to be challenging to debug and often preferred rewriting code from scratch. The findings highlight the need for tempered expectations regarding the capabilities of AI coding assistants, emphasizing that they are not a replacement for human developers but rather tools that can enhance certain aspects of the coding process.
- A study found no significant productivity gains from AI coding assistants like GitHub Copilot.
- The use of Copilot was linked to a 41% increase in bugs in code.
- Developers reported spending more time reviewing AI-generated code rather than writing it.
- Mixed results were observed across different companies regarding the effectiveness of AI coding tools.
- Expectations about AI coding assistants should be moderated; they are not replacements for human developers.
Related
Ask HN: Am I using AI wrong for code?
The author is concerned about underutilizing AI tools for coding, primarily using Claude for brainstorming and small code snippets, while seeking recommendations for tools that enhance coding productivity and collaboration.
Ask HN: Will AI make us unemployed?
The author highlights reliance on AI tools like ChatGPT and GitHub Copilot, noting a 30% efficiency boost and concerns about potential job loss due to AI's increasing coding capabilities.
GitHub Copilot – Lessons
Siddharth discusses GitHub Copilot's strengths in pair programming and learning new languages, but notes its limitations with complex tasks, verbosity, and potential impact on problem-solving skills among new programmers.
Effects of Gen AI on High Skilled Work: Experiments with Software Developers
A study on generative AI's impact on software developers revealed a 26.08% productivity increase, particularly benefiting less experienced developers, through trials at Microsoft, Accenture, and a Fortune 100 company.
Why Copilot Is Making Programmers Worse at Programming
AI-driven coding tools like Copilot may enhance productivity but risk eroding fundamental programming skills, fostering dependency, reducing learning opportunities, isolating developers, and creating a false sense of expertise.
- Many users report mixed experiences, with some finding AI tools helpful for repetitive tasks while others encounter issues with incorrect code suggestions.
- Several commenters note an increase in bugs and incidents correlated with the use of AI tools, attributing this to misplaced trust in the technology.
- Some users emphasize the importance of verification and caution when using AI-generated code, likening it to working with human collaborators.
- There is a divide in opinions on productivity, with some users claiming significant boosts while others feel the tools complicate their workflow.
- Comments highlight the need for proper usage and understanding of AI tools, suggesting they are most effective when used as aids rather than replacements for human coding skills.
The best I can say is that the implementation in Jetbrains IntelliJ IDEA is pretty good. It's basically only useful for some repetitive Java boilerplate, but actually that's perfect, it's mindless enough yet easy to validate. It makes me dislike programming in Java a little bit less.
Urgh. I can't wait to retire.
I can’t say if the bug count is higher or not. Maybe it is higher in terms of total number of bugs I write throughout my coding session. But if bug count goes up 10% then the speed with which I fix those bugs and get to a final edit of my code is 30% or 40% faster, so the bug count is not the right metric.
Maybe the differentiator is that I am a solo-dev for all this work, and so the negative effects of the copilot are only experienced by me. If I were in a 10 person team, the bugs and the weird out of context code snippets would be magnified by the 9 other people, and the negative effects would be strong. But I don’t know.
Additionally, I find the copilot code suggestions during code reviews / pull requests sometimes useful. At times, it can offer some insightful bits about a code segment, such as potential exception handling fixes, etc.
I'd like to explore having copilot write unit tests, including representative test data, that can execute edge code paths. I haven't done this yet, but this seems exactly the type of thing that a "copilot" would do for me (not too unlike paired-programming, maybe).
Having a copilot completely write my code base, that's another thing entirely. There would be too much going back and verifying that it got it right. And additionally, I've seen it completely conjure up bogus solutions as well. For example, I've had copilot offer a configuration change that was completely fabricated; it looked legitimate enough that a senior systems engineer attempted to install/deliver the "fix" it offered when the suggestion was completely made up.
Overall, I guess my experience with copilot is not much different than working with any human. Trust but verify.
Much like how a nailgun won't just magically build you a house, it'll just let you build one quicker.
I get great benefit out of llms for coding. I'm not a good coder. But I am decent at planning and understanding what I want. Llms get me there 100x quicker than not using them. I don't need 4 years of cs to learn all the tedious algos for sorting or searching. I just ask an ai for a bunch of examples, assess them for what they are and get on with it. It can tell me the common pros and cons of it all and much like any other decision in business I make my best judgement and go with it.Need to sort a heap of data into x y z or convert it from x to y? Llm will show me the way, now I don't need to hire someone to do it for me.
But alas, so many seem to think a language interpretation tool is actually a do it all one stop shop of production. Pebkac, your using the tool wrong.
No it does not. Does an assistant have to be as qualified as their boss?
> “The LLM does not possess critical thinking, self-awareness, or the ability to think.”
This is completely irrelevant. The LLM can understand your instructions and it can type 30,000 times faster than you.
Related
Ask HN: Am I using AI wrong for code?
The author is concerned about underutilizing AI tools for coding, primarily using Claude for brainstorming and small code snippets, while seeking recommendations for tools that enhance coding productivity and collaboration.
Ask HN: Will AI make us unemployed?
The author highlights reliance on AI tools like ChatGPT and GitHub Copilot, noting a 30% efficiency boost and concerns about potential job loss due to AI's increasing coding capabilities.
GitHub Copilot – Lessons
Siddharth discusses GitHub Copilot's strengths in pair programming and learning new languages, but notes its limitations with complex tasks, verbosity, and potential impact on problem-solving skills among new programmers.
Effects of Gen AI on High Skilled Work: Experiments with Software Developers
A study on generative AI's impact on software developers revealed a 26.08% productivity increase, particularly benefiting less experienced developers, through trials at Microsoft, Accenture, and a Fortune 100 company.
Why Copilot Is Making Programmers Worse at Programming
AI-driven coding tools like Copilot may enhance productivity but risk eroding fundamental programming skills, fostering dependency, reducing learning opportunities, isolating developers, and creating a false sense of expertise.