<  Back to the Polytechnique Montréal portal

LLMs and Stack Overflow discussions: reliability, impact, and challenges

Leuson Mario Pedro Da Silva, Jordan Samhi and Foutse Khomh

Article (2025)

Open Acess document in PolyPublie and at official publisher
[img]
Preview
Open Access to the full text of this document
Published Version
Terms of Use: Creative Commons Attribution
Download (2MB)
Show abstract
Hide abstract

Abstract

Since its release in November 2022, ChatGPT has shaken up Stack Overflow, the premier platform for developers’ queries on programming and software development. Demonstrating an ability to generate instant, human-like responses to technical questions, ChatGPT has ignited debates within the developer community about the evolving role of human-driven platforms in the age of generative AI. Two months after ChatGPT’s release, Meta released its answer with its own Large Language Model (LLM) called LLaMA: the race was on. We conducted an empirical study analyzing questions from Stack Overflow and using these LLMs to address them. This way, we aim to quantify the reliability of LLMs’ answers and their potential to replace Stack Overflow in the long term; identify and understand why LLMs fail; measure users’ activity evolution with Stack Overflow over time; and compare LLMs together. Our empirical results are unequivocal: ChatGPT and LLaMA challenge human expertise, yet do not outperform it for some domains, while a significant decline in user posting activity has been observed. Furthermore, we also discuss the impact of our findings regarding the usage and development of new LLMs and provide guidelines for future challenges faced by users and researchers.

Uncontrolled Keywords

Supplementary Material:
Department: Department of Computer Engineering and Software Engineering
Funders: FRQ, NSERC, CIFAR, Canada Research Chairs Program
PolyPublie URL: https://publications.polymtl.ca/66641/
Journal Title: Journal of Systems and Software (vol. 230)
Publisher: Elsevier BV
DOI: 10.1016/j.jss.2025.112541
Official URL: https://doi.org/10.1016/j.jss.2025.112541
Date Deposited: 22 Jul 2025 12:29
Last Modified: 03 Feb 2026 19:47
Cite in APA 7: Da Silva, L. M. P., Samhi, J., & Khomh, F. (2025). LLMs and Stack Overflow discussions: reliability, impact, and challenges. Journal of Systems and Software, 230, 112541 (21 pages). https://doi.org/10.1016/j.jss.2025.112541

Statistics

Total downloads

Downloads per month in the last year

Origin of downloads

Dimensions

Repository Staff Only

View Item View Item