Monter d'un niveau |
Gottipati, S. K., Pathak, Y., Nuttall, R., Sahir, Chunduru, R., Touati, A., Subramanian, S. G., Taylor, M. E., & Anbil Parthipan, S. C. (décembre 2020). Maximum reward formulation in reinforcement learning [Communication écrite]. 2020 NeurIPS Deep RL Workshop (15 pages). Lien externe
Le Ny, J., Touati, A., & Pappas, G. J. (avril 2014). Real-time privacy-preserving model-based estimation of traffic flows [Communication écrite]. 5th ACM/IEEE International Conference on Cyber-Physical Systems (ICCPS 2014), Berlin, Germany. Lien externe
Romoff, J., Henderson, P., Kanaa, D., Bengio, E., Touati, A., Bacon, P.-L., & Pineau, J. (mai 2021). TDprop : does adaptive optimization with Jacobi preconditioning help temporal difference learning? [Communication écrite]. 20th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2021) (9 pages). Lien externe