Author of the publication

Understanding Self-Predictive Learning for Reinforcement Learning.

, , , , , , , , , , , , , , , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 33632-33656. PMLR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Adapting to Delays and Data in Adversarial Multi-Armed Bandits., and . ICML, volume 139 of Proceedings of Machine Learning Research, page 3988-3997. PMLR, (2021)Online Learning with Gaussian Payoffs and Side Observations., , and . NIPS, page 1360-1368. (2015)Detecting Overfitting via Adversarial Examples., , and . NeurIPS, page 7856-7866. (2019)Think out of the "Box": Generically-Constrained Asynchronous Composite Optimization and Hedging., , and . NeurIPS, page 12225-12235. (2019)ImpatientCapsAndRuns: Approximately Optimal Algorithm Configuration from an Infinite Pool., , , , , , and . NeurIPS, (2020)The Shortest Path Problem Under Partial Monitoring., , and . COLT, volume 4005 of Lecture Notes in Computer Science, page 468-482. Springer, (2006)Scalable Metric Learning for Co-Embedding., , , and . ECML/PKDD (1), volume 9284 of Lecture Notes in Computer Science, page 625-642. Springer, (2015)Improved convergence rates in empirical vector quantizer design., , and . ISIT, page 301. IEEE, (2004)Efficient Multi-start Strategies for Local Search Algorithms., and . ECML/PKDD (1), volume 5781 of Lecture Notes in Computer Science, page 705-720. Springer, (2009)Max-affine estimators for convex stochastic programming., , and . CoRR, (2016)