Article,

PERL: Parameter Efficient Reinforcement Learning from Human Feedback.

H. Sidahmed, S. Phatale, A. Hutcheson, Z. Lin, Z. Chen, Z. Yu, J. Jin, R. Komarytsia, C. Ahlheim, Y. Zhu, S. Chaudhary, B. Li, S. Ganesh, B. Byrne, J. Hoffmann, H. Mansoor, W. Li, A. Rastogi, and L. Dixon.
CoRR, (2024)

Meta data

BibTeX key: journals/corr/abs-2403-10704
entry type: article
year: 2024
journal: CoRR
volume: abs/2403.10704
ee: https://doi.org/10.48550/arXiv.2403.10704
url: http://dblp.uni-trier.de/db/journals/corr/corr2403.html#abs-2403-10704

Tags

dblp

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

search on