Inproceedings,

Learning in Structured MDPs with Convex Cost Functions: Improved Regret Bounds for Inventory Management.

, and .
EC, page 743-744. ACM, (2019)

Meta data

Tags

Users

  • @dblp

Comments and Reviews