Author of the publication

A Policy Efficient Reduction Approach to Convex Constrained Deep Reinforcement Learning.

, , , , and . CoRR, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Lookahead: An Inference Acceleration Framework for Large Language Model with Lossless Generation Accuracy., , , and . CoRR, (2023)Know Your Needs Better: Towards Structured Understanding of Marketer Demands with Analogical Reasoning Augmented LLMs., , , , , , , and . CoRR, (2024)Unified Hallucination Detection for Multimodal Large Language Models., , , , , , , , , and . CoRR, (2024)Multiple Instance Learning for Uplift Modeling., , , , , and . CoRR, (2023)ULMA: Unified Language Model Alignment with Demonstration and Point-wise Human Preference., , , , , and . CoRR, (2023)Token-free LLMs Can Generate Chinese Classical Poetry with More Accurate Format., , , , and . CoRR, (2024)Intent Mining: A Social and Semantic Enhanced Topic Model for Operation-Friendly Digital Marketing., , , , , , , , , and 1 other author(s). ICDE, page 3254-3267. IEEE, (2022)Non-stationary Time-aware Kernelized Attention for Temporal Event Prediction., , , , , , and . KDD, page 1224-1232. ACM, (2022)Towards Fine-Grained Temporal Network Representation via Time-Reinforced Random Walk., , , , and . AAAI, page 4973-4980. AAAI Press, (2020)Generalizing Consistent Multi-Class Classification with Rejection to be Compatible with Arbitrary Losses., , , , , , , and . NeurIPS, (2022)