Author of the publication

Energy-based Predictive Representations for Partially Observed Reinforcement Learning.

, , , , , , and . UAI, volume 216 of Proceedings of Machine Learning Research, page 2477-2487. PMLR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Data Perturbation for Escaping Local Maxima in Learning., , , and . AAAI/IAAI, page 132-139. AAAI Press / The MIT Press, (2002)Trust-PCL: An Off-Policy Trust Region Method for Continuous Control., , , and . CoRR, (2017)Rank/Norm Regularization with Closed-Form Solutions: Application to Subspace Clustering, and . CoRR, (2012)Variational Rejection Sampling., , , , and . AISTATS, volume 84 of Proceedings of Machine Learning Research, page 823-832. PMLR, (2018)Learning Gene Regulatory Networks via Globally Regularized Risk Minimization., and . RECOMB-CG, volume 4751 of Lecture Notes in Computer Science, page 83-95. Springer, (2007)Sparse Learning Based Linear Coherent Bi-clustering., , , , and . WABI, volume 7534 of Lecture Notes in Computer Science, page 346-364. Springer, (2012)Stochastic Neural Networks with Monotonic Activation Functions, , , , and . (2015)cite arxiv:1601.00034v2.pdfComment: AISTATS 2016.Trust-PCL: An Off-Policy Trust Region Method for Continuous Control., , , and . ICLR (Poster), OpenReview.net, (2018)Improving Policy Gradient by Exploring Under-appreciated Rewards., , and . ICLR (Poster), OpenReview.net, (2017)Self-Supervised Chinese Word Segmentation., and . IDA, volume 2189 of Lecture Notes in Computer Science, page 238-247. Springer, (2001)