Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Off-Policy Policy Gradient with Stationary Distribution Correction.

Y. Liu, A. Swaminathan, A. Agarwal, and E. Brunskill. UAI, volume 115 of Proceedings of Machine Learning Research, page 1180-1190. AUAI Press, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Alekh Agarwal

Alekh Jindal

Artem Alekhin

Suman Agarwal

Swarna Agarwal

Other publications of authors with the same name

Model-based RL in Contextual Decision Processes: PAC bounds and Exponential Improvements over Model-free ApproachesW. Sun, N. Jiang, A. Krishnamurthy, A. Agarwal, and J. Langford. (2018)cite arxiv:1811.08540Comment: COLT 2019.Off-Policy Policy Gradient with State Distribution Correction.Y. Liu, A. Swaminathan, A. Agarwal, and E. Brunskill. CoRR, (2019)Stochastic optimization and sparse statistical recovery: An optimal algorithm for high dimensions.A. Agarwal, S. Negahban, and M. Wainwright. CISS, page 1-2. IEEE, (2014)Fast global convergence of gradient methods for high-dimensional statistical recoveryA. Agarwal, S. Negahban, and M. Wainwright. CoRR, (2011)On the Optimality of Sparse Model-Based Planning for Markov Decision Processes.A. Agarwal, S. Kakade, and L. Yang. CoRR, (2019)Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback.C. Zhang, A. Agarwal, H. III, J. Langford, and S. Negahban. ICML, volume 97 of Proceedings of Machine Learning Research, page 7335-7344. PMLR, (2019)Optimizing Interactive Systems via Data-Driven Objectives.Z. Li, J. Kiseleva, A. Agarwal, M. de Rijke, and R. White. CoRR, (2020)Noisy matrix decomposition via convex relaxation: Optimal rates in high dimensions.A. Agarwal, S. Negahban, and M. Wainwright. ICML, page 1129-1136. Omnipress, (2011)Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes.A. Agarwal, S. Kakade, J. Lee, and G. Mahajan. COLT, volume 125 of Proceedings of Machine Learning Research, page 64-66. PMLR, (2020)Message-passing for Graph-structured Linear Programs: Proximal Methods and Rounding Schemes.P. Ravikumar, A. Agarwal, and M. Wainwright. J. Mach. Learn. Res., (2010)

BibSonomy

Disambiguation of "Agarwal, Alekh"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Off-Policy Policy Gradient with Stationary Distribution Correction.

Please choose a person to relate this publication to

Alekh Agarwal

Alekh Jindal

Artem Alekhin

Suman Agarwal

Swarna Agarwal

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Agarwal, Alekh"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Off-Policy Policy Gradient with Stationary Distribution Correction.

Please choose a person to relate this publication to

Alekh Agarwal

Alekh Jindal

Artem Alekhin

Suman Agarwal

Swarna Agarwal

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Off-Policy Policy Gradient with Stationary Distribution Correction.