Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Metric Residual Network for Sample Efficient Goal-Conditioned Reinforcement Learning., , , and . AAAI, page 8799-8806. AAAI Press, (2023)Non-asymptotic Confidence Intervals of Off-policy Evaluation: Primal and Dual Bounds., , , and . ICLR, OpenReview.net, (2021)Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization., , , , , , , , , and 5 other author(s). ICLR, OpenReview.net, (2024)HIVE: Harnessing Human Feedback for Instructional Visual Editing., , , , , , , , , and 2 other author(s). CoRR, (2023)Demand Prediction by Incorporating Internet-of-Things Data: A Case of Automobile Repair and Maintenance Service., , and . HICSS, page 5017-5026. ScholarSpace, (2024)Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward., , , , , , , , , and 1 other author(s). CoRR, (2024)Incremental Few-shot Text Classification with Multi-round New Classes: Formulation, Dataset and System., , , and . NAACL-HLT, page 1351-1360. Association for Computational Linguistics, (2021)A Kernel Loss for Solving the Bellman Equation., , and . NeurIPS, page 15430-15441. (2019)A Unified Framework for Alternating Offline Model Training and Policy Learning., , , and . NeurIPS, (2022)Metric Residual Networks for Sample Efficient Goal-conditioned Reinforcement Learning., , , and . CoRR, (2022)