Up a level |
Gottipati, S. K., Pathak, Y., Nuttall, R., Sahir, Chunduru, R., Touati, A., Subramanian, S. G., Taylor, M. E., & Anbil Parthipan, S. C. (2020, December). Maximum reward formulation in reinforcement learning [Paper]. 2020 NeurIPS Deep RL Workshop (15 pages). External link
Le Ny, J., Touati, A., & Pappas, G. J. (2014, April). Real-time privacy-preserving model-based estimation of traffic flows [Paper]. 5th ACM/IEEE International Conference on Cyber-Physical Systems (ICCPS 2014), Berlin, Germany. External link
Romoff, J., Henderson, P., Kanaa, D., Bengio, E., Touati, A., Bacon, P.-L., & Pineau, J. (2021, May). TDprop : does adaptive optimization with Jacobi preconditioning help temporal difference learning? [Paper]. 20th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2021) (9 pages). External link