<  Back to the Polytechnique Montréal portal

TDprop : does adaptive optimization with Jacobi preconditioning help temporal difference learning?

Joshua Romoff, Peter Henderson, David Kanaa, Emmanuel Bengio, Ahmed Touati, Pierre-Luc Bacon and Joelle Pineau

Paper (2021)

An external link is available for this item
Department: Department of Computer Engineering and Software Engineering
PolyPublie URL: https://publications.polymtl.ca/49068/
Conference Title: 20th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2021)
Conference Date(s): 2021-05-03 - 2021-05-07
Publisher: International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)
Official URL: https://www.ifaamas.org/Proceedings/aamas2021/pdfs...
Date Deposited: 18 Apr 2023 15:00
Last Modified: 25 Sep 2024 16:38
Cite in APA 7: Romoff, J., Henderson, P., Kanaa, D., Bengio, E., Touati, A., Bacon, P.-L., & Pineau, J. (2021, May). TDprop : does adaptive optimization with Jacobi preconditioning help temporal difference learning? [Paper]. 20th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2021) (9 pages). https://www.ifaamas.org/Proceedings/aamas2021/pdfs/p1082.pdf

Statistics

Stats are not available on this system.

Repository Staff Only

View Item View Item