Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Sublinear Optimal Policy Value Estimation in Contextual Bandits., , and . CoRR, (2019)Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy., , , and . AAAI, page 4436-4443. AAAI Press, (2020)Fairer but Not Fair Enough On the Equitability of Knowledge Tracing., and . LAK, page 335-339. ACM, (2019)Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning., , and . NeurIPS, page 13626-13640. (2021)Reinforcement Learning with State Observation Costs in Action-Contingent Noiselessly Observable Markov Decision Processes., , and . NeurIPS, page 15650-15666. (2021)Off-policy Policy Evaluation For Sequential Decisions Under Unobserved Confounding., , , and . NeurIPS, (2020)Examining the Use of an AI-Powered Teacher Orchestration Tool at Scale., , , and . L@S, page 356-360. ACM, (2024)Adaptive Instrument Design for Indirect Experiments., , , and . ICLR, OpenReview.net, (2024)Tighter Problem-Dependent Regret Bounds in Reinforcement Learning without Domain Knowledge using Value Function Bounds., and . ICML, volume 97 of Proceedings of Machine Learning Research, page 7304-7312. PMLR, (2019)Combining parametric and nonparametric models for off-policy evaluation., , , , and . ICML, volume 97 of Proceedings of Machine Learning Research, page 2366-2375. PMLR, (2019)