![]() | Monter d'un niveau |
Farahmand, A.-M. (décembre 2016). Iterative value-aware model learning [Présentation]. Dans 13th European Workshop on Reinforcement Learning (EWRL 2016), Barcelona, Spain. Non disponible
Farahmand, A.-M., Precup, D., Barreto, A. M. S., & Ghavamzadeh, M. (octobre 2013). CAPI : generalized classification-based approximate policy iteration [Communication écrite]. Multi-Disciplinary Conference on Reinforcement Learning and Decision Making (RLDM 2013), Princeton, NJ, USA. Non disponible
Fard, M. M., Grinberg, Y., Farahmand, A.-M., Pineau, J., & Precup, D. (décembre 2013). Bellman error based feature generation using random projections on sparse spaces [Communication écrite]. 27th Conference on Neural Information Processing Systems (NeurIPS 2013), Las Vegas, NV, USA (9 pages). Lien externe
Kim, B., Farahmand, A.-M., Pineau, J., & Precup, D. (octobre 2013). Approximate policy iteration with demonstrated data [Communication écrite]. Multi-Disciplinary Conference on Reinforcement Learning and Decision Making (RLDM 2013), Princeton, NJ, USA. Non disponible
Kim, B., Farahmand, A.-M., Pineau, J., & Precup, D. (décembre 2013). Learning from limited demonstration [Communication écrite]. 27th Conference on Neural Information Processing Systems (NeurIPS 2013), Las Vegas, NV, USA (9 pages). Lien externe