Monter d'un niveau |
Van Seijen, H., Nekoei, H., Racah, E., & Anbil Parthipan, S. C. (décembre 2020). The LoCA regret: A consistent metric to evaluate model-based behavior in reinforcement learning [Communication écrite]. 34th Conference on Neural Information Processing Systems (NeurIPS 2020). Non disponible