Article,

Quantifying the effect of feedback frequency in interactive reinforcement learning for robotic tasks

D. Harnack, J. Pivin-Bachler, and N. Navarro-Guerrero.
Neural Computing and Applications, 35 (23): 16931–16943 (August 2023)Special Issue on Human-aligned Reinforcement Learning for Autonomous Agents and Robots.
DOI: 10.1007/s00521-022-07949-0

Abstract

Reinforcement learning (RL) has become widely adopted in robot control. Despite many successes, one major persisting problem can be very low data efficiency. One solution is interactive feedback, which has been shown to speed up RL considerably. As a result, there is an abundance of different strategies, which are, however, primarily tested on discrete grid-world and small scale optimal control scenarios. In the literature, there is no consensus about which feedback frequency is optimal or at which time the feedback is most beneficial. To resolve these discrepancies we isolate and quantify the effect of feedback frequency in robotic tasks with continuous state and action spaces. The experiments encompass inverse kinematics learning for robotic manipulator arms of different complexity. We show that seemingly contradictory reported phenomena occur at different complexity levels. Furthermore, our results suggest that no single ideal feedback frequency exists. Rather that feedback frequency should be changed as the agent’s proficiency in the task increases.

BibTeX key: Harnack_2022
entry type: article
year: 2023
month: aug
journal: Neural Computing and Applications
number: 23
pages: 16931–16943
publisher: Springer Science and Business Media LLC
volume: 35
DOI: 10.1007/s00521-022-07949-0
url: https://doi.org/10.1007%2Fs00521-022-07949-0
note: Special Issue on Human-aligned Reinforcement Learning for Autonomous Agents and Robots

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@article{Harnack_2022, abstract = {Reinforcement learning (RL) has become widely adopted in robot control. Despite many successes, one major persisting problem can be very low data efficiency. One solution is interactive feedback, which has been shown to speed up RL considerably. As a result, there is an abundance of different strategies, which are, however, primarily tested on discrete grid-world and small scale optimal control scenarios. In the literature, there is no consensus about which feedback frequency is optimal or at which time the feedback is most beneficial. To resolve these discrepancies we isolate and quantify the effect of feedback frequency in robotic tasks with continuous state and action spaces. The experiments encompass inverse kinematics learning for robotic manipulator arms of different complexity. We show that seemingly contradictory reported phenomena occur at different complexity levels. Furthermore, our results suggest that no single ideal feedback frequency exists. Rather that feedback frequency should be changed as the agent’s proficiency in the task increases.}, added-at = {2023-02-09T12:35:51.000+0100}, author = {Harnack, Daniel and Pivin-Bachler, Julie and Navarro-Guerrero, Nicol{\'{a}}s}, biburl = {https://www.bibsonomy.org/bibtex/209d76849c1697675f02442608fc4d9e7/nng}, doi = {10.1007/s00521-022-07949-0}, interhash = {2d7a4e13a1cedb28de7a8a24bad9fab5}, intrahash = {09d76849c1697675f02442608fc4d9e7}, journal = {Neural Computing and Applications}, keywords = {#rank1 myown}, month = aug, note = {Special Issue on Human-aligned Reinforcement Learning for Autonomous Agents and Robots}, number = 23, pages = {16931–16943}, publisher = {Springer Science and Business Media {LLC}}, timestamp = {2024-02-05T14:56:32.000+0100}, title = {Quantifying the effect of feedback frequency in interactive reinforcement learning for robotic tasks}, url = {https://doi.org/10.1007%2Fs00521-022-07949-0}, volume = 35, year = 2023 }

BibSonomy

Quantifying the effect of feedback frequency in interactive reinforcement learning for robotic tasks

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on