Monter d'un niveau |
Romoff, J., Henderson, P., Kanaa, D., Bengio, E., Touati, A., Bacon, P.-L., & Pineau, J. (mai 2021). TDprop : does adaptive optimization with Jacobi preconditioning help temporal difference learning? [Communication écrite]. 20th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2021) (9 pages). Lien externe
Gottipati, S. K., Pathak, Y., Nuttall, R., Sahir, Chunduru, R., Touati, A., Subramanian, S. G., Taylor, M. E., & Anbil Parthipan, S. C. (décembre 2020). Maximum reward formulation in reinforcement learning [Communication écrite]. 2020 NeurIPS Deep RL Workshop (15 pages). Lien externe
Le Ny, J., Touati, A., & Pappas, G. J. (avril 2014). Real-time privacy-preserving model-based estimation of traffic flows [Communication écrite]. 5th ACM/IEEE International Conference on Cyber-Physical Systems (ICCPS 2014), Berlin, Germany. Lien externe