Inproceedings,

Distributionally Robust Policy Gradient for Offline Contextual Bandits.

Z. Yang, Y. Guo, P. Xu, A. Liu, and A. Anandkumar.
AISTATS, volume 206 of Proceedings of Machine Learning Research, page 6443-6462. PMLR, (2023)

Meta data

BibTeX key: conf/aistats/YangGXLA23
entry type: inproceedings
booktitle: AISTATS
year: 2023
pages: 6443-6462
publisher: PMLR
series: Proceedings of Machine Learning Research
volume: 206
crossref: conf/aistats/2023
ee: https://proceedings.mlr.press/v206/yang23f.html
url: http://dblp.uni-trier.de/db/conf/aistats/aistats2023.html#YangGXLA23

Tags

dblp

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

search on