<  Back to the Polytechnique Montréal portal

Zeroth Order Optimization for Pretraining Language Models

Nathan Allaire, Mahsa Ghazvini Nejad, Sébastien Le Digabel and Vahid Partovi Nia

Paper (2025)

Open Acess document at official publisher
An external link is available for this item
Department: Department of Mathematics and Industrial Engineering
Research Center: GERAD - Research Group in Decision Analysis
ISBN: 9789897587306
PolyPublie URL: https://publications.polymtl.ca/64441/
Conference Title: 14th International Conference on Pattern Recognition Applications and Methods (ICPRAM 2025)
Conference Location: Porto, Portugal
Conference Date(s): 2025-02-23 - 2025-02-25
Journal Title: Proceedings of the 14th International Conference on Pattern Recognition Applications and Methods - ICPRAM (vol. 1)
Publisher: Scitepress
DOI: 10.5220/0013261100003905
Official URL: https://doi.org/10.5220/0013261100003905
Date Deposited: 07 Apr 2025 11:27
Last Modified: 07 Apr 2025 11:27
Cite in APA 7: Allaire, N., Ghazvini Nejad, M., Le Digabel, S., & Partovi Nia, V. (2025, February). Zeroth Order Optimization for Pretraining Language Models [Paper]. 14th International Conference on Pattern Recognition Applications and Methods (ICPRAM 2025), Porto, Portugal. Published in Proceedings of the 14th International Conference on Pattern Recognition Applications and Methods - ICPRAM, 1. https://doi.org/10.5220/0013261100003905

Statistics

Dimensions

Repository Staff Only

View Item View Item