Monter d'un niveau |
Bard, N., Foerster, J. N., Anbil Parthipan, S. C., Burch, N., Lanctot, M., Song, H. F., Parisotto, E., Dumoulin, V., Moitra, S., Hughes, E., Dunning, I., Mourad, S., Larochelle, H., Bellemare, M. G., & Bowling, M. (2020). The Hanabi challenge: A new frontier for AI research. Artificial Intelligence, 280, 19 pages. Lien externe
Gottipati, S. K., Pathak, Y., Nuttall, R., Sahir, Chunduru, R., Touati, A., Subramanian, S. G., Taylor, M. E., & Anbil Parthipan, S. C. (décembre 2020). Maximum reward formulation in reinforcement learning [Communication écrite]. 2020 NeurIPS Deep RL Workshop (15 pages). Lien externe
Gottipati, S. K., Sattarov, B., Niu, S., Pathak, Y., Wei, H., Liu, S., Thomas, K. M. J., Blackburn, S., Coley, C. W., Tang, J., Anbil Parthipan, S. C., & Bengio, Y. (juillet 2020). Learning To Navigate The Synthetically Accessible Chemical Space Using Reinforcement Learning. [Communication écrite]. 37th International Conference on Machine Learning (ICML 2020), Vienna, Austria. Lien externe
Laleh, T., Faramarzi, M., Rish, I., & Anbil Parthipan, S. C. (juillet 2020). Chaotic continual learning [Communication écrite]. 37th International Conference on Machine Learning (PMLR 2020), Vienna, Austria (6 pages). Lien externe
Van Seijen, H., Nekoei, H., Racah, E., & Anbil Parthipan, S. C. (décembre 2020). The LoCA regret: A consistent metric to evaluate model-based behavior in reinforcement learning [Communication écrite]. 34th Conference on Neural Information Processing Systems (NeurIPS 2020). Non disponible