From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Diffusion Model Alignment Using Direct Preference Optimization., , , , , , , , , и . CoRR, (2023)Direct Preference Optimization: Your Language Model is Secretly a Reward Model, , , , , и . (2023)MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?, , , , , , , , , и 9 other автор(ы). CoRR, (2024)Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents., , , , , , и . CoRR, (2024)Disentangling Length from Quality in Direct Preference Optimization., , , и . ACL (Findings), стр. 4998-5017. Association for Computational Linguistics, (2024)MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning., , , , , и . CoRL, том 229 из Proceedings of Machine Learning Research, стр. 3654-3671. PMLR, (2023)Visual Adversarial Imitation Learning using Variational Models., , , и . NeurIPS, стр. 3016-3028. (2021)Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback., , , , , , , и . EMNLP, стр. 5433-5442. Association for Computational Linguistics, (2023)OpenVLA: An Open-Source Vision-Language-Action Model., , , , , , , , , и 8 other автор(ы). CoRR, (2024)Open X-Embodiment: Robotic Learning Datasets and RT-X Models : Open X-Embodiment Collaboration., , , , , , , , , и 269 other автор(ы). ICRA, стр. 6892-6903. IEEE, (2024)