Up a level |
Gottipati, S. K., Pathak, Y., Nuttall, R., Sahir, Chunduru, R., Touati, A., Subramanian, S. G., Taylor, M. E., & Anbil Parthipan, S. C. (2020, December). Maximum reward formulation in reinforcement learning [Paper]. 2020 NeurIPS Deep RL Workshop (15 pages). External link