Author of the publication

Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer.

, , , , , , , and . CoRR, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Life Assistants for the Elderly Based on Mobile Devices., , , , , and . DASC/PiCom/DataCom/CyberSciTech, page 537-542. IEEE, (2019)Behavior Contrastive Learning for Unsupervised Skill Discovery., , , , , , , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 39183-39204. PMLR, (2023)Decentralized Single-Timescale Actor-Critic on Zero-Sum Two-Player Stochastic Games., , , and . ICML, volume 139 of Proceedings of Machine Learning Research, page 3899-3909. PMLR, (2021)Provably Efficient Offline Reinforcement Learning for Partially Observable Markov Decision Processes., , , , and . ICML, volume 162 of Proceedings of Machine Learning Research, page 8016-8038. PMLR, (2022)Diverse randomized value functions: A provably pessimistic approach for offline reinforcement learning., , , , and . Inf. Sci., (2024)Landslide Hazard Prediction Based on Small Baseline Subset-Interferometric Synthetic-Aperture Radar Technology Combined with Land-Use Dynamic Change and Hydrological Conditions (Sichuan, China)., and . Remote. Sens., 16 (15): 2715 (August 2024)Policy Learning Using Weak Supervision., , , and . NeurIPS, page 19960-19973. (2021)Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards., , , , , and . CoRR, (2024)Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer., , , , , , , and . CoRR, (2024)Measuring and Reducing LLM Hallucination without Gold-Standard Answers via Expertise-Weighting., , , , , and . CoRR, (2024)