From post

Near-Optimal Model-Free Reinforcement Learning in Non-Stationary Episodic MDPs.

, , , , и . ICML, том 139 из Proceedings of Machine Learning Research, стр. 7447-7458. PMLR, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Risk-Aware Linear Bandits: Theory and Applications in Smart Order Routing., , и . CoRR, (2022)Hedging the Drift: Learning to Optimize under Non-Stationarity., , и . CoRR, (2019)Risk-Aware Linear Bandits with Application in Smart Order Routing., , и . ICAIF, стр. 334-342. ACM, (2022)Coresets for differentially private k-means clustering and applications to privacy in mobile sensor networks., , , и . IPSN, стр. 3-15. ACM, (2017)Near-Optimal Regret Bounds for Model-Free RL in Non-Stationary Episodic MDPs., , , , и . CoRR, (2020)User Experience Design Professionals' Perceptions of Generative Artificial Intelligence., , , , , и . CoRR, (2023)Learning to Price Supply Chain Contracts against a Learning Retailer., , и . CoRR, (2022)Near-Optimal Model-Free Reinforcement Learning in Non-Stationary Episodic MDPs., , , , и . ICML, том 139 из Proceedings of Machine Learning Research, стр. 7447-7458. PMLR, (2021)Trigger Warning Labeling with RoBERTa and Resampling for Distressing Content Detection., , , , , , , и . CLEF (Working Notes), том 3497 из CEUR Workshop Proceedings, стр. 2557-2561. CEUR-WS.org, (2023)Learning to Optimize under Non-Stationarity., , и . AISTATS, том 89 из Proceedings of Machine Learning Research, стр. 1079-1087. PMLR, (2019)