![]() | Up a level |
Van Seijen, H., Nekoei, H., Racah, E., & Anbil Parthipan, S. C. (2020, December). The LoCA regret: A consistent metric to evaluate model-based behavior in reinforcement learning [Paper]. 34th Conference on Neural Information Processing Systems (NeurIPS 2020). Unavailable