Inproceedings,

Distributionally Robust Policy Gradient for Offline Contextual Bandits.

, , , , and .
AISTATS, volume 206 of Proceedings of Machine Learning Research, page 6443-6462. PMLR, (2023)

Meta data

Tags

Users

  • @dblp

Comments and Reviews