Author of the publication

Model-based Policy Optimization under Approximate Bayesian Inference.

, , and . AISTATS, volume 238 of Proceedings of Machine Learning Research, page 3250-3258. PMLR, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

On the Minimax Capacity Loss Under Sub-Nyquist Universal Sampling., , and . IEEE Trans. Inf. Theory, 63 (6): 3348-3367 (2017)Interactive learning of words and objects for a humanoid robot. (Apprentissage interactif de mots et d'objets pour un robot humanoïde).. University of Paris-Saclay, France, (2017)Context-Dependent Preferences and Innovation Strategy., and . Manag. Sci., 59 (12): 2747-2765 (2013)Sequential Search with Refinement: Model and Application with Click-Stream Data., and . Manag. Sci., 63 (12): 4345-4365 (2017)Implicit Regularization in Nonconvex Statistical Estimation: Gradient Descent Converges Linearly for Phase Retrieval and Matrix Completion., , , and . ICML, volume 80 of Proceedings of Machine Learning Research, page 3351-3360. PMLR, (2018)Shannon meets Nyquist: Capacity limits of sampled analog channels., , and . ICASSP, page 3104-3107. IEEE, (2011)The Curious Price of Distributional Robustness in Reinforcement Learning with a Generative Model., , , , , and . CoRR, (2023)PIC 4th Challenge: Semantic-Assisted Multi-Feature Encoding and Multi-Head Decoding for Dense Video Captioning., , , , , and . CoRR, (2022)Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence., , , , , and . CoRR, (2021)Rankitect: Ranking Architecture Search Battling World-class Engineers at Meta Scale., , , , , , , , , and 12 other author(s). CoRR, (2023)