Author of the publication

Trust region policy optimization via entropy regularization for Kullback-Leibler divergence constraint.

, , , and . Neurocomputing, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Seasonal Auto-Regressive Model Based Support Vector Regression Prediction Method for H5N1 avian Influenza Animal Events., , and . International Journal of Computational Intelligence and Applications, 10 (2): 199-230 (2011)A Knowledge-based Efficiency Assessment System for Distribution Network using Data Envelopment Analysis., , , , and . RCIS, page 331-336. IEEE, (2010)Personalized Multi-Stage Decision Support in Reverse Logistics Management., and . Intelligent Data Mining, volume 5 of Studies in Computational Intelligence, Springer, (2005)Multi-follower linear bilevel programming: model and Kuhn-Tucker approach., , , and . IADIS AC, page 81-88. IADIS, (2005)A particle swarm optimization based algorithm for fuzzy bilevel decision making with constraints-shared followers., , and . SAC, page 1075-1079. ACM, (2009)A Bilevel Optimization Model and a PSO-based Algorithm in Day-ahead Electricity Markets., , , and . SMC, page 611-616. IEEE, (2009)A Framework of Hybrid Recommender System for Personalized Clinical Prescription., , , and . ISKE, page 189-195. IEEE, (2015)A knowledge-based multi-role decision support system for ore blending cost optimization of blast furnaces., , and . Eur. J. Oper. Res., 215 (1): 194-203 (2011)Model Checking for Asynchronous Web Service Composition Based on XYZ/ADL., , , and . WISM (2), volume 6988 of Lecture Notes in Computer Science, page 428-435. Springer, (2011)Learning under Concept Drift: A Review., , , , , and . CoRR, (2020)