Hadhemi Jebnoun, Houssem Ben Braiek, Mohammad Masudur Rahman et Foutse Khomh
Communication écrite (2020)
Un lien externe est disponible pour ce documentAbstract
Deep learning practitioners are often interested in improving their model accuracy rather than the interpretability of their models. As a result, deep learning applications are inherently complex in their structures. They also need to continuously evolve in terms of code changes and model updates. Given these confounding factors, there is a great chance of violating the recommended programming practices by the developers in their deep learning applications. In particular, the code quality might be negatively affected due to their drive for the higher model performance. Unfortunately, the code quality of deep learning applications has rarely been studied to date. In this paper, we conduct an empirical study to investigate the distribution of code smells in deep learning applications. To this end, we perform a comparative analysis between deep learning and traditional open-source applications collected from GitHub. We have several major findings. First, long lambda expression, long ternary conditional expression, and complex container comprehension smells are frequently found in deep learning projects. That is, deep learning code involves more complex or longer expressions than the traditional code does. Second, the number of code smells increases across the releases of deep learning applications. Third, we found that there is a co-existence between code smells and software bugs in the studied deep learning code, which confirms our conjecture on the degraded code quality of deep learning applications.
Mots clés
deep learning; code smells; code quality
Sujet(s): |
2700 Technologie de l'information > 2705 Logiciels et développement 2700 Technologie de l'information > 2706 Génie logiciel |
---|---|
Département: | Département de génie informatique et génie logiciel |
Organismes subventionnaires: | GRSNG / NSERC, Fonds de recherche du Québec (FRQ) |
URL de PolyPublie: | https://publications.polymtl.ca/9346/ |
Nom de la conférence: | 17th International Conference on Mining Software Repositories (MSR 2020) |
Lieu de la conférence: | Seoul, Republic of Korea |
Date(s) de la conférence: | 2020-06-29 - 2020-06-30 |
Maison d'édition: | ACM |
DOI: | 10.1145/3379597.3387479 |
URL officielle: | https://doi.org/10.1145/3379597.3387479 |
Date du dépôt: | 07 sept. 2023 09:56 |
Dernière modification: | 25 sept. 2024 15:45 |
Citer en APA 7: | Jebnoun, H., Ben Braiek, H., Rahman, M. M., & Khomh, F. (juin 2020). The scent of deep learning code : an empirical study [Communication écrite]. 17th International Conference on Mining Software Repositories (MSR 2020), Seoul, Republic of Korea. https://doi.org/10.1145/3379597.3387479 |
---|---|
Statistiques
Dimensions