Author of the publication

TDprop: Does Adaptive Optimization With Jacobi Preconditioning Help Temporal Difference Learning?

, , , , , , and . AAMAS, page 1082-1090. ACM, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Underwater multi-robot convoying using visual tracking by detection., , , , , , , , , and . IROS, page 4189-4196. IEEE, (2017)A Safe Harbor for AI Evaluation and Red Teaming., , , , , , , , , and 13 other author(s). CoRR, (2024)Benchmark Environments for Multitask Learning in Continuous Domains., , , , , and . CoRR, (2017)Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset., , , , , , and . NeurIPS, (2022)TDprop: Does Adaptive Optimization With Jacobi Preconditioning Help Temporal Difference Learning?, , , , , , and . AAMAS, page 1082-1090. ACM, (2021)Entropy Regularization for Population Estimation., , , and . AAAI, page 12198-12204. AAAI Press, (2023)What's in Your "Safe" Data?: Identifying Benign Data that Breaks Safety., , and . CoRR, (2024)A Survey of Available Corpora for Building Data-Driven Dialogue Systems., , , , and . CoRR, (2015)Cost Adaptation for Robust Decentralized Swarm Behaviour., , , and . CoRR, (2017)Where Did My Optimum Go?: An Empirical Analysis of Gradient Descent Optimization in Policy Gradient Methods., , and . CoRR, (2018)