![]() | Monter d'un niveau |
Awal, R., Massoud, M., Feizi, A., Li, Z., Wang, S., Pal, C. J., Agrawal, A., Vazquez, D., Reddy, S., Rodriguez, J. A., Taslakian, P., Gella, S., & Rajeswar, S. (novembre 2025). WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation [Communication écrite]. Conference on Empirical Methods in Natural Language Processing (EMNLP 2025), Suzhou, China. Lien externe
Nayak, S., Jian, X., Lin, K. Q., Rodriguez, J. A., Kalsi, M., Chapados, N., Özsu, M. T., Agrawal, A., Vazquez, D., Pal, C. J., Taslakian, P., Gella, S., & Rajeswar, S. (février 2025). UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction [Communication écrite]. 42nd International Conference on Machine Learning (PMLR 2025), Vancouver, BC, Canada. Lien externe
Rodriguez, J., Jian, X., Panigrahi, S. S., Zhang, T., Feizi, A., Puri, A., Kalkunte, A., Savard, F., Masry, A., Nayak, S., Awal, R., Massoud, M., Abaskohi, A., Li, Z., Wang, S., Noel, P.-A., Richter, M. L., Vadacchino, S., Agarwal, S., ... Rajeswar, S. (avril 2025). BIGDOCS : an open dataset for training multimodal models on document and code tasks [Communication écrite]. 13th International Conference on Learning Representations (ICLR 2025), Singapore, Singapore. Lien externe
Sahu, G., Puri, A., Rodriguez, J. A., Abaskohi, A., Chegini, M., Drouin, A., Taslakian, P., Zantedeschi, V., Lacoste, A., Vazquez, D., Chapados, N., Pal, C. J., Rajeswar, S., & Laradji, I. (avril 2025). InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation [Communication écrite]. 13th International Conference on Learning Representations (ICLR 2025), Singapore, Singapore. Lien externe