Author of the publication

Off-Policy Training for Truncated TD(λ) Boosted Soft Actor-Critic.

, , , , , , and . PRICAI (3), volume 13033 of Lecture Notes in Computer Science, page 46-59. Springer, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Sub-captions Semantic-Guided Network for Image Captioning., , , , , and . ICIC (3), volume 13395 of Lecture Notes in Computer Science, page 367-379. Springer, (2022)A novel representative image selection method in lager-scale image dataset., , and . ICIMCS, page 331-334. ACM, (2013)Lazy-CFR: a fast regret minimization algorithm for extensive games with imperfect information., , , , and . CoRR, (2018)Racing Thompson: an Efficient Algorithm for Thompson Sampling with Non-conjugate Priors., , and . CoRR, (2017)Dynamic hierarchical Markov random fields and their application to web data extraction., , , and . ICML, volume 227 of ACM International Conference Proceeding Series, page 1175-1182. ACM, (2007)Learning from crowds in the presence of schools of thought., and . KDD, page 226-234. ACM, (2012)StatSnowball: a statistical approach to extracting entity relationships., , , , and . WWW, page 101-110. ACM, (2009)Racing Thompson: an Efficient Algorithm for Thompson Sampling with Non-conjugate Priors., , and . ICML, volume 80 of Proceedings of Machine Learning Research, page 5995-6003. PMLR, (2018)Discriminative infinite latent feature models., and . ChinaSIP, page 184-188. IEEE, (2013)Robust Learning of Deep Time Series Anomaly Detection Models with Contaminated Training Data., , , and . CoRR, (2022)