Louis Clouatre, Philippe Trempe, Amal Zouaq and Sarath Chandar Anbil Parthipan
Paper (2021)
Open Acess document in PolyPublie and at official publisher |
|
Open Access to the full text of this document Published Version Terms of Use: Creative Commons Attribution Download (474kB) |
Abstract
Knowledge Bases (KBs) are easy to query, verifiable, and interpretable. They however scale with man-hours and high-quality data. Masked Language Models (MLMs), such as BERT, scale with computing power as well as unstructured raw text data. The knowledge contained within these models is however not directly interpretable. We propose to perform link prediction with MLMs to address both the KBs scalability issues and the MLMs interpretability issues. By committing the knowledge embedded in MLMs to a KB, it becomes interpretable. To do that we introduce MLMLM, Mean Likelihood Masked Language Model, an approach comparing the mean likelihood of generating the different entities to perform link prediction in a tractable manner. We obtain State of the Art (SotA) results on the WN18RR dataset and SotA results on the Precision@1 metric on the WikidataM5 inductive and transductive setting. We also obtain convincing results on link prediction on previously unseen entities, making MLMLM a suitable approach to introducing new entities to a KB.
Subjects: |
2700 Information technology > 2705 Software and development 2700 Information technology > 2706 Software engineering |
---|---|
Department: | Department of Computer Engineering and Software Engineering |
Funders: | Apogée Canada, Canada First Research Excellence Fund program, École Polytechnique Startup Fund PIED, Canada CIFAR AI Chair, GRSNG / NSERC - Discovery Grant |
PolyPublie URL: | https://publications.polymtl.ca/10613/ |
Conference Title: | The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021) |
Conference Location: | Bangkok, Thailand |
Conference Date(s): | 2021-08-01 - 2021-08-06 |
Publisher: | Association for Computational Linguistics |
DOI: | 10.18653/v1/2021.findings-acl.378 |
Official URL: | https://doi.org/10.18653/v1/2021.findings-acl.378 |
Date Deposited: | 19 Dec 2022 13:47 |
Last Modified: | 27 Sep 2024 00:14 |
Cite in APA 7: | Clouatre, L., Trempe, P., Zouaq, A., & Anbil Parthipan, S. C. (2021, August). MLMLM: link prediction with mean likelihood masked language model [Paper]. The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021), Bangkok, Thailand. https://doi.org/10.18653/v1/2021.findings-acl.378 |
---|---|
Statistics
Total downloads
Downloads per month in the last year
Origin of downloads
Dimensions