Author of the publication

Preference Transformer: Modeling Human Preferences using Transformers for RL.

, , , , , and . ICLR, OpenReview.net, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

The Layer 2 Handoff Scheme for Mobile IP over IEEE 802.11 Wireless LAN., and . ICCSA (1), volume 3043 of Lecture Notes in Computer Science, page 1144-1150. Springer, (2004)The Modeling and Traffic Feedback Control for QoS Management on Local Network., , , and . ICCSA (2), volume 2668 of Lecture Notes in Computer Science, page 463-471. Springer, (2003)Regularizing Class-Wise Predictions via Self-Knowledge Distillation., , , and . CVPR, page 13873-13882. Computer Vision Foundation / IEEE, (2020)Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning., , , , , , and . NeurIPS, page 3029-3042. (2021)SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning., , , , , and . ICLR, OpenReview.net, (2022)SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA of LLMs., , , , , , , and . CoRR, (2024)Meta-Learning with Self-Improving Momentum Target., , , , and . NeurIPS, (2022)Luminance and gamma optimization for mobile display in low ambient conditions., , , , , , , and . IQSP, volume 9396 of SPIE Proceedings, page 93960B. SPIE, (2015)Evaluation of High Dynamic Range TVs using Actual HDR Content., , , , , , and . CIC, page 219-224. Society for Imaging Science and Technology, (2018)Statistical analysis of upper ocean temperature response to typhoons from ARGO floats and satellite data., , , and . IGARSS, page 2564-2567. IEEE, (2005)