From post

Learning from an Exploring Demonstrator: Optimal Reward Estimation for Bandits.

, , , , и . AISTATS, том 151 из Proceedings of Machine Learning Research, стр. 6357-6386. PMLR, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Contextual Symmetries in Probabilistic Graphical Models., , , и . IJCAI, стр. 3560-3568. IJCAI/AAAI Press, (2016)ASAP-UCT: Abstraction of State-Action Pairs in UCT., , , и . IJCAI, стр. 1509-1515. AAAI Press, (2015)Graphite: Iterative Generative Modeling of Graphs., , и . ICML, том 97 из Proceedings of Machine Learning Research, стр. 2434-2444. PMLR, (2019)Best arm identification in multi-armed bandits with delayed feedback., , , , , , , , , и 1 other автор(ы). AISTATS, том 84 из Proceedings of Machine Learning Research, стр. 833-842. PMLR, (2018)Controllable Generative Modeling via Causal Reasoning., , и . Trans. Mach. Learn. Res., (2022)JUMBO: Scalable Multi-task Bayesian Optimization using Offline Data., , , и . CoRR, (2021)Boosted Generative Models., и . AAAI, стр. 3077-3084. AAAI Press, (2018)The Open Catalyst Challenge 2021: Competition Report., , , , , , , , , и 15 other автор(ы). NeurIPS (Competition and Demos), том 176 из Proceedings of Machine Learning Research, стр. 29-40. PMLR, (2021)Decision Transformer: Reinforcement Learning via Sequence Modeling., , , , , , , , и . NeurIPS, стр. 15084-15097. (2021)BCD Nets: Scalable Variational Approaches for Bayesian Causal Discovery., , и . NeurIPS, стр. 7095-7110. (2021)