Monter d'un niveau |
Romoff, J., Henderson, P., Kanaa, D., Bengio, E., Touati, A., Bacon, P.-L., & Pineau, J. (mai 2021). TDprop : does adaptive optimization with Jacobi preconditioning help temporal difference learning? [Communication écrite]. 20th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2021) (9 pages). Lien externe