Up a level |
Barde, P., Roy, J., Jeon, W., Pineau, J., Nowrouzezahrai, D., & Pal, C. J. (2020, December). Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization [Paper]. 34th Conference on Neural Information Processing Systems (NeurIPS 2020) (11 pages). External link