From post

Improving Sample Efficiency of Multiagent Reinforcement Learning With Nonexpert Policy for Flocking Control.

, , , , , и . IEEE Internet Things J., 10 (16): 14014-14027 (августа 2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

A Hierarchical Representation Policy Iteration Algorithm for Reinforcement Learning., , , , и . IScIDE, том 7751 из Lecture Notes in Computer Science, стр. 735-742. Springer, (2012)Research on the Method of Simulating Knowledge Structure of the Information Searchers - Illustrated by the Case of Pomology Information Retrieval., , , и . CCTA (1), том 419 из IFIP Advances in Information and Communication Technology, стр. 450-457. Springer, (2013)Tuning thermal transport in nanotubes with topological defects, , и . Applied Physics Letters, 99 (9): 091905 (2011)Constructing an EJB Application in a WFMS., , , и . COMPSAC, стр. 284-286. IEEE Computer Society, (2002)Satellite-Relayed Intercontinental Quantum Network, , , , , , , , , и 26 other автор(ы). Phys. Rev. Lett., (января 2018)Dynamic activities on an agent-based workflow management system., , и . AICCSA, стр. 122. IEEE Computer Society, (2005)The Global Precipitation Climatology Project (GPCP) Monthly Analysis (New Version 2.3) and a Review of 2017 Global Precipitation, , , , , , , , , и 3 other автор(ы). Atmosphere, 9 (4): 138+ (07.04.2018)PPLN-Based Flexible Optical Logic <emphasis emphasistype="smcaps">and</emphasis> Gate, , , , , , и . (2008)Examining ERP Committee Beliefs: A Comparison of Alternative Models., , , и . ICIS, стр. 62. Association for Information Systems, (2008)The sequence and de novo assembly of the giant panda genome, , , , , , , , , и 113 other автор(ы). Nature, 463 (7279): 311--317 (января 2010)