Author of the publication

On the Design of Estimators for Bandit Off-Policy Evaluation.

, , , and . ICML, volume 97 of Proceedings of Machine Learning Research, page 6468-6476. PMLR, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Concise Introduction to Multiagent Systems and Distributed AI. Informatics Institute, University of Amsterdam, (September 2003)Non-communicative multi-robot coordination in dynamic environments, , and . Robotics and Autonomous Systems, 50 (2-3): 99 - 114 (2005)fastGapFill : Efficient gap filling in metabolic networks, , and . Bioinformatics, (May 7, 2014)NP-hardness of polytope M-matrix testing and related problems. CoRR, (2012)On the Design of Estimators for Bandit Off-Policy Evaluation., , , and . ICML, volume 97 of Proceedings of Machine Learning Research, page 6468-6476. PMLR, (2019)Optimal and approximate Q-value functions for decentralized POMDPs, , and . Journal of Artificial Intelligence Research, (May 28, 2008)Supervised Dimension Reduction of Intrinsically Low-Dimensional Data., , and . Neural Comput., 14 (1): 191-215 (2002)Non-linear CCA and PCA by Alignment of Local Models., , and . NIPS, page 297-304. MIT Press, (2003)Coordinating Principal Component Analyzers., , and . ICANN, volume 2415 of Lecture Notes in Computer Science, page 914-919. Springer, (2002)Towards an Optimal Scoring Policy for Simulated Soccer Agents., , , and . RoboCup, volume 2752 of Lecture Notes in Computer Science, page 296-303. Springer, (2002)