From post

Average Reward Reinforcement Learning for Semi-Markov Decision Processes.

, , , и . ICONIP (1), том 10634 из Lecture Notes in Computer Science, стр. 768-777. Springer, (2017)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

A unified approach to time-aggregated Markov decision processes., и . Autom., (2016)Visual Grasping for a Lightweight Aerial Manipulator Based on NSGA-II and Kinematic Compensation., , , , и . ICRA, стр. 1-6. IEEE, (2018)Singularity-Robust Hybrid Visual Servoing Control for Aerial Manipulator., , , , , и . ROBIO, стр. 562-568. IEEE, (2018)Exploration via Distributional Reinforcement Learning with Epistemic and Aleatoric Uncertainty Estimation., , , , , и . CASE, стр. 2256-2261. IEEE, (2021)The realization of ZrOxNy temperature sensors with good sensitivity and stability in the temperature range above 150K., , , , , и . NEMS, стр. 1165-1168. IEEE, (2021)A Deep Safe Reinforcement Learning Approach for Mapless Navigation., , , , , и . ROBIO, стр. 1520-1525. IEEE, (2021)Multi-Robot Real-time Game Strategy Learning based on Deep Reinforcement Learning., , , , , и . ROBIO, стр. 1192-1197. IEEE, (2022)Decision Making for Autonomous Driving Via Multimodal Transformer and Deep Reinforcement Learning*., , , и . RCAR, стр. 481-486. IEEE, (2022)Hydrogen abstraction reactions of OH radicals with CH3CH2CH2Cl and CH3CHClCH3: A mechanistic and kinetic study., , , и . J. Comput. Chem., 33 (1): 66-75 (2012)Face recognition based on convolutional neural network and support vector machine., , и . ICIA, стр. 1787-1792. IEEE, (2016)