On Codex prompt engineering for OCL generation: an empirical study

Seif Abukhalaf, Mohammad Hamdaqa et Foutse Khomh

Communication écrite (2023)

Un lien externe est disponible pour ce document

Abstract

The Object Constraint Language (OCL) is a declarative language that adds constraints and object query expressions to Meta-Object Facility (MOF) models. OCL can provide precision and conciseness to UML models. Nevertheless, the unfamiliar syntax of OCL has hindered its adoption by software practitioners. LLMs, such as GPT-3, have made significant progress in many NLP tasks, such as text generation and semantic parsing. Similarly, researchers have improved on the downstream tasks by fine-tuning LLMs for the target task. Codex, a GPT-3 descendant by OpenAI, has been fine-tuned on publicly available code from GitHub and has proven the ability to generate code in many programming languages, powering the AI-pair programmer Copilot. One way to take advantage of Codex is to engineer prompts for the target downstream task. In this paper, we investigate the reliability of the OCL constraints generated by Codex from natural language specifications. To achieve this, we compiled a dataset of 15 UML models and 168 specifications from various educational resources. We manually crafted a prompt template with slots to populate with the UML information and the target task in the prefix format to complete the template with the generated OCL constraint. We used both zero- and few-shot learning methods in the experiments. The evaluation is reported by measuring the syntactic validity and the execution accuracy metrics of the generated OCL constraints. Moreover, to get insight into how close or natural the generated OCL constraints are compared to human-written ones, we measured the cosine similarity between the sentence embedding of the correctly generated and human-written OCL constraints. Our findings suggest that by enriching the prompts with the UML information of the models and enabling few-shot learning, the reliability of the generated OCL constraints increases. Furthermore, the results reveal a close similarity based on sentence embedding between the generated OCL constraints and the human-written ones in the ground truth, implying a level of clarity and understandability in the generated OCL constraints by Codex.

Matériel d'accompagnement:	Ensemble de données
Département:	Département de génie informatique et génie logiciel
Centre de recherche:	Autre
Organismes subventionnaires:	FRQ, CIFAR, NSERC
ISBN:	9798350311846
URL de PolyPublie:	https://publications.polymtl.ca/54573/
Nom de la conférence:	2023 IEEE/ACM 20th International Conference on Mining Software Repositories (MSR 2023)
Lieu de la conférence:	Melbourne, Australia
Date(s) de la conférence:	2023-05-15 - 2023-05-16
Maison d'édition:	IEEE
DOI:	10.1109/msr59073.2023.00033
URL officielle:	https://doi.org/10.1109/msr59073.2023.00033
Date du dépôt:	31 août 2023 14:19
Dernière modification:	03 févr. 2026 14:49

Citer en APA 7:	Abukhalaf, S., Hamdaqa, M., & Khomh, F. (mai 2023). On Codex prompt engineering for OCL generation: an empirical study [Communication écrite]. 2023 IEEE/ACM 20th International Conference on Mining Software Repositories (MSR 2023), Melbourne, Australia. https://doi.org/10.1109/msr59073.2023.00033

Statistiques

Dimensions

Actions réservées au personnel

Afficher document