Author of the publication

From Importance Sampling to Doubly Robust Policy Gradient

, and . (2019)cite arxiv:1910.09066Comment: 26 pages; 2 figures.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Model-based RL in Contextual Decision Processes: PAC bounds and Exponential Improvements over Model-free Approaches, , , , and . (2018)cite arxiv:1811.08540Comment: COLT 2019.Hand segmentation for hand-object interaction from depth map., , , , , and . GlobalSIP, page 259-263. IEEE, (2017)A Decoupled Learning Strategy for Massive Access Optimization in Cellular IoT Networks., , , and . CoRR, (2020)Layer-wise learning based stochastic gradient descent method for the optimization of deep convolutional neural network., , , and . J. Intell. Fuzzy Syst., 37 (4): 5641-5654 (2019)分布式星群网络中基于蚁群算法的通信量分类路由 (ACO Based Traffic Classified Routing Algorithm in Distributed Satellite Cluster Network)., and . 计算机科学, 42 (10): 95-100 (2015)Layered Space-time Architecture for LAS-MIMO CDMA System, , , and . page 2015-2024. (2005)Trust-aware generative adversarial network with recurrent neural network for recommender systems., , , , , and . Int. J. Intell. Syst., 36 (2): 778-795 (2021)Multi-scale feature learning and temporal probing strategy for one-stage temporal action localization., , , , and . Int. J. Intell. Syst., 37 (7): 4092-4112 (2022)SAN: Attention-based social aggregation neural networks for recommendation system., , , , , and . Int. J. Intell. Syst., 37 (6): 3373-3393 (2022)Distributionally Favorable Optimization: A Framework for Data-Driven Decision-Making with Endogenous Outliers., and . SIAM J. Optim., 34 (1): 419-458 (March 2024)