<  Back to the Polytechnique Montréal portal

Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models

Jerry Huang, Prasanna Parthasarathi, Mehdi Rezagholizadeh and Sarath Chandar Anbil Parthipan

Paper (2024)

An external link is available for this item
Department: Department of Computer Engineering and Software Engineering
PolyPublie URL: https://publications.polymtl.ca/65059/
Conference Title: Conference on Empirical Methods in Natural Language Processing
Conference Location: Miami, Florida, USA
Conference Date(s): 2024-11-12 - 2024-11-16
Publisher: Association for Computational Linguistics
DOI: 10.18653/v1/2024.emnlp-main.332
Official URL: https://doi.org/10.18653/v1/2024.emnlp-main.332
Date Deposited: 09 May 2025 09:33
Last Modified: 09 May 2025 09:33
Cite in APA 7: Huang, J., Parthasarathi, P., Rezagholizadeh, M., & Anbil Parthipan, S. C. (2024, November). Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models [Paper]. Conference on Empirical Methods in Natural Language Processing, Miami, Florida, USA. https://doi.org/10.18653/v1/2024.emnlp-main.332

Statistics

Dimensions

Repository Staff Only

View Item View Item