Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Energy-based Predictive Representations for Partially Observed Reinforcement Learning.

T. Zhang, T. Ren, C. Xiao, W. Xiao, J. Gonzalez, D. Schuurmans, and B. Dai. UAI, volume 216 of Proceedings of Machine Learning Research, page 2477-2487. PMLR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Dale Adams

Aklilu Dalelo

Dale Rubbra

Dale Pearson

Jonathan Dale

Other publications of authors with the same name

Data Perturbation for Escaping Local Maxima in Learning.G. Elidan, M. Ninio, N. Friedman, and D. Schuurmans. AAAI/IAAI, page 132-139. AAAI Press / The MIT Press, (2002)Trust-PCL: An Off-Policy Trust Region Method for Continuous Control.O. Nachum, M. Norouzi, K. Xu, and D. Schuurmans. CoRR, (2017)Rank/Norm Regularization with Closed-Form Solutions: Application to Subspace ClusteringY. Yu, and D. Schuurmans. CoRR, (2012)Variational Rejection Sampling.A. Grover, R. Gummadi, M. Lázaro-Gredilla, D. Schuurmans, and S. Ermon. AISTATS, volume 84 of Proceedings of Machine Learning Research, page 823-832. PMLR, (2018)Learning Gene Regulatory Networks via Globally Regularized Risk Minimization.Y. Guo, and D. Schuurmans. RECOMB-CG, volume 4751 of Lecture Notes in Computer Science, page 83-95. Springer, (2007)Sparse Learning Based Linear Coherent Bi-clustering.Y. Shi, X. Liao, X. Zhang, G. Lin, and D. Schuurmans. WABI, volume 7534 of Lecture Notes in Computer Science, page 346-364. Springer, (2012)Stochastic Neural Networks with Monotonic Activation FunctionsS. Ravanbakhsh, B. Poczos, J. Schneider, D. Schuurmans, and R. Greiner. (2015)cite arxiv:1601.00034v2.pdfComment: AISTATS 2016.Trust-PCL: An Off-Policy Trust Region Method for Continuous Control.O. Nachum, M. Norouzi, K. Xu, and D. Schuurmans. ICLR (Poster), OpenReview.net, (2018)Improving Policy Gradient by Exploring Under-appreciated Rewards.O. Nachum, M. Norouzi, and D. Schuurmans. ICLR (Poster), OpenReview.net, (2017)Self-Supervised Chinese Word Segmentation.F. Peng, and D. Schuurmans. IDA, volume 2189 of Lecture Notes in Computer Science, page 238-247. Springer, (2001)

BibSonomy

Disambiguation of "Schuurmans, Dale"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Energy-based Predictive Representations for Partially Observed Reinforcement Learning.

Please choose a person to relate this publication to

Dale Adams

Aklilu Dalelo

Dale Rubbra

Dale Pearson

Jonathan Dale

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Schuurmans, Dale"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Energy-based Predictive Representations for Partially Observed Reinforcement Learning.

Please choose a person to relate this publication to

Dale Adams

Aklilu Dalelo

Dale Rubbra

Dale Pearson

Jonathan Dale

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Energy-based Predictive Representations for Partially Observed Reinforcement Learning.