![]() | Monter d'un niveau |
Kastner, T., Erdogdu, M. A., & Farahmand, A.-M. (décembre 2023). Distributional model equivalence for risk-sensitive reinforcement learning [Communication écrite]. 37th Conference on Neural Information Processing Systems (NeurIPS 2023), New Orleans, Louisiana, USA (22 pages). Lien externe
Ma, A., Pan, Y., & Farahmand, A.-M. (2023). Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods. Transactions on Machine Learning Research, 57 pages. Lien externe