Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Empirical Algorithms for General Stochastic Systems with Continuous States and Actions., , and . CDC, page 6344-6349. IEEE, (2019)Approximate Relative Value Learning for Average-reward Continuous State MDPs., , and . UAI, volume 115 of Proceedings of Machine Learning Research, page 956-964. AUAI Press, (2019)Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes., , , , and . ICML, volume 119 of Proceedings of Machine Learning Research, page 10170-10180. PMLR, (2020)Optimal Spectrum Sensing for Cognitive Radio with Imperfect Detector., , , and . VTC Spring, page 1-5. IEEE, (2014)Self-Exploring Language Models: Active Preference Elicitation for Online Alignment., , , , , , and . CoRR, (2024)Cost-Effective Proxy Reward Model Construction with On-Policy and Active Learning., , , , , , , , and . CoRR, (2024)An Empirical Relative Value Learning Algorithm for Non-parametric MDPs with Continuous State Space., , and . ECC, page 1368-1373. IEEE, (2019)Finite Time Guarantees for Continuous State MDPs with Generative Model., and . CDC, page 3617-3622. IEEE, (2020)Randomized function fitting-based empirical value iteration., , , and . CDC, page 2467-2472. IEEE, (2017)Language Models can be Logical Solvers., , , , , , and . CoRR, (2023)