Author of the publication

Optimistic Model Rollouts for Pessimistic Offline Policy Optimization.

, , , , , , , and . AAAI, page 16678-16686. AAAI Press, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Focus on Hard Categories and Hard Examples: Remote Sensing Image Scene Classification via Expert Model and Hard Example Mining., , , , and . IEEE Geosci. Remote. Sens. Lett., (2022)Enhanced Cross-Modal Transformer Model for Video Semantic Similarity Measurement., , , , , , and . IEEE Trans. Circuits Syst. II Express Briefs, 71 (1): 475-479 (January 2024)Fden: Mining Effective Information of Features in Detecting Network Anomalies., , , , , , and . ICASSP, page 8553-8557. IEEE, (2021)NTIRE 2022 Challenge on Perceptual Image Quality Assessment., , , , , , , , , and 46 other author(s). CVPR Workshops, page 950-966. IEEE, (2022)Power Efficient Video Super-Resolution on Mobile NPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report., , , , , , , , , and 43 other author(s). ECCV Workshops (3), volume 13803 of Lecture Notes in Computer Science, page 130-152. Springer, (2022)Understanding and Predicting Docker Build Duration: An Empirical Study of Containerized Workflow of OSS Projects., , , , and . ASE, page 111:1-111:13. ACM, (2022)Quantification of Transducer Misalignment in Ultrasound Tongue Imaging., and . INTERSPEECH, page 3735-3739. ISCA, (2020)Multimodal Feature Fusion for Video Advertisements Tagging Via Stacking Ensemble., , , and . CoRR, (2021)Uncertainty-Penalized Reinforcement Learning from Human Feedback with Diverse Reward LoRA Ensembles., , , , , , , and . CoRR, (2024)Nuclear Norm Maximization Based Curiosity-Driven Learning., , , , , , , and . CoRR, (2022)