Author of the publication

Stable Training of Bellman Error in Reinforcement Learning.

, , , and . ICONIP (5), volume 1333 of Communications in Computer and Information Science, page 439-448. Springer, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A pooled subspace mixture density model for pattern classification in high-dimensional spaces., , and . IJCNN, page 2466-2471. IEEE, (2008)Highway Transformer: Self-Gating Enhanced Self-Attentive Networks, , and . Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, page 6887--6900. Online, Association for Computational Linguistics, (July 2020)Exploiting coarse-to-fine mechanism for fine-grained recognition., , , , and . ICIP, page 649-653. IEEE, (2016)Transductive Learning on Adaptive Graphs., , , , and . AAAI, page 661-666. AAAI Press, (2010)POPO: Pessimistic Offline Policy Optimization., and . CoRR, (2020)WD3: Taming the Estimation Bias in Deep Reinforcement Learning., and . ICTAI, page 391-398. IEEE, (2020)Combing Policy Evaluation and Policy Improvement in a Unified f-Divergence Framework., , , , , , and . CoRR, (2021)DAS3D: Dual-Modality Anomaly Synthesis for 3D Anomaly Detection., , , and . ECCV Workshops (4), volume 15626 of Lecture Notes in Computer Science, page 148-165. Springer, (2024)Baffle: Hiding Backdoors in Offline Reinforcement Learning Datasets., , , , , , , , , and 1 other author(s). SP, page 2086-2104. IEEE, (2024)Meticulously Selecting 1% of the Dataset for Pre-training! Generating Differentially Private Images Data with Semantics Query., , , , , and . CoRR, (2023)