Monter d'un niveau |
Barde, P., Roy, J., Jeon, W., Pineau, J., Nowrouzezahrai, D., & Pal, C. J. (décembre 2020). Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization [Communication écrite]. 34th Conference on Neural Information Processing Systems (NeurIPS 2020) (11 pages). Lien externe
Roy, J., Barde, P., Harvey, F. G., Nowrouzezahrai, D., & Pal, C. J. (décembre 2020). Promoting coordination through policy regularization in multi-agent deep reinforcement learning [Communication écrite]. 34th Conference on Neural Information Processing Systems (NeurIPS 2020). Lien externe