From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Online tanulás nemstacionárius Markov döntési folyamatokban. Budapest University of Technology and Economics, Hungary, (2013)Offline Primal-Dual Reinforcement Learning for Linear MDPs., , , и . AISTATS, том 238 из Proceedings of Machine Learning Research, стр. 3169-3177. PMLR, (2024)Adversarial Contextual Bandits Go Kernelized., , и . ALT, том 237 из Proceedings of Machine Learning Research, стр. 907-929. PMLR, (2024)Offline RL via Feature-Occupancy Gradient Ascent., и . CoRR, (2024)Prediction by random-walk perturbation., , и . COLT, том 30 из JMLR Workshop and Conference Proceedings, стр. 460-473. JMLR.org, (2013)First-order regret bounds for combinatorial semi-bandits.. COLT, том 40 из JMLR Workshop and Conference Proceedings, стр. 1360-1375. JMLR.org, (2015)Online Influence Maximization with Local Observations., , и . ALT, том 98 из Proceedings of Machine Learning Research, стр. 557-580. PMLR, (2019)Iterate averaging as regularization for stochastic gradient descent, и . (2018)cite arxiv:1802.08009.Efficient and robust algorithms for adversarial linear contextual bandits., и . COLT, том 125 из Proceedings of Machine Learning Research, стр. 3049-3068. PMLR, (2020)Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods, и . CoRR, (2012)