![]() | Monter d'un niveau |
Anbil Parthipan, S. C., Khetarpal, K., Rajendran, J., & Riemer, M. (décembre 2024). Balancing Context Length and Mixing Times for Reinforcement Learning at Scale [Communication écrite]. 38th Conference on Neural Information Processing Systems (NeurIPS 2024), Vancouver, BC, Canada. Lien externe
Thakkar, M., Fournier, Q., Riemer, M., Chen, P.-Y., Zouaq, A., Das, P., & Anbil Parthipan, S. C. (août 2024). A Deep Dive into the Trade-Offs of Parameter-Efficient Preference Alignment Techniques [Communication écrite]. 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024), Hybrid, Bangkok, Thailand. Lien externe