Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Optimistic Policy Optimization via Multiple Importance Sampling.

M. Papini, A. Metelli, L. Lupo, and M. Restelli. ICML, volume 97 of Proceedings of Machine Learning Research, page 4989-4999. PMLR, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Alfredo Marcello Marcello

Guglielmo Restelli

Marcello Andrea

Marcello Mariucci

Marcello Bisotti

Other publications of authors with the same name

Risk-Averse Trust Region Optimization for Reward-Volatility Reduction.L. Bisi, L. Sabbioni, E. Vittori, M. Papini, and M. Restelli. IJCAI, page 4583-4589. ijcai.org, (2020)Special Track on AI in FinTech.Inverse Reinforcement Learning with Sub-optimal Experts.R. Poiani, G. Curti, A. Metelli, and M. Restelli. CoRR, (2024)A Practical Guide to Multi-Objective Reinforcement Learning and Planning.C. Hayes, R. Radulescu, E. Bargiacchi, J. Källström, M. Macfarlane, M. Reymond, T. Verstraeten, L. Zintgraf, R. Dazeley, F. Heintz and 8 other author(s). CoRR, (2021)Simultaneously Updating All Persistence Values in Reinforcement Learning.L. Sabbioni, L. Daire, L. Bisi, A. Metelli, and M. Restelli. CoRR, (2022)Best Arm Identification for Stochastic Rising Bandits.M. Mussi, A. Montenegro, F. Trovò, M. Restelli, and A. Metelli. CoRR, (2023)Tight Performance Guarantees of Imitator Policies with Continuous Actions.D. Maran, A. Metelli, and M. Restelli. CoRR, (2022)Multi-objective Reinforcement Learning through Continuous Pareto Manifold Approximation.S. Parisi, M. Pirotta, and M. Restelli. J. Artif. Intell. Res., (2016)Policy gradient approaches for multi-objective sequential decision making.S. Parisi, M. Pirotta, N. Smacchia, L. Bascetta, and M. Restelli. IJCNN, page 2323-2330. IEEE, (2014)Piecewise constant reinforcement learning for robotic applications.A. Bonarini, A. Lazaric, and M. Restelli. ICINCO-ICSO, page 214-221. INSTICC Press, (2007)978-972-8865-82-5.Extensive-form games with heterogeneous populations: solution concepts, equilibria characterization, learning dynamics.N. Gatti, F. Panozzo, and M. Restelli. Intelligenza Artificiale, 10 (1): 19-31 (2016)

BibSonomy

Disambiguation of "Restelli, Marcello"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Optimistic Policy Optimization via Multiple Importance Sampling.

Please choose a person to relate this publication to

Alfredo Marcello Marcello

Guglielmo Restelli

Marcello Andrea

Marcello Mariucci

Marcello Bisotti

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Restelli, Marcello"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Optimistic Policy Optimization via Multiple Importance Sampling.

Please choose a person to relate this publication to

Alfredo Marcello Marcello

Guglielmo Restelli

Marcello Andrea

Marcello Mariucci

Marcello Bisotti

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Optimistic Policy Optimization via Multiple Importance Sampling.