![]() | Monter d'un niveau |
Ma, A., Pan, Y., & Farahmand, A.-M. (2023). Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods. Transactions on Machine Learning Research, 57 pages. Lien externe
Zhao, X., Pan, Y., Xiao, C., Anbil Parthipan, S. C., & Rajendran, J. (juillet 2023). Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning [Communication écrite]. 39th Conference on Uncertainty in Artificial Intelligence (UAI 2023), Pittsburgh, PA, USA. Lien externe