Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Self-Explore to Avoid the Pit: Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards.

H. Hwang, D. Kim, S. Kim, S. Ye, and M. Seo. CoRR, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Eunyoung Seo

Ean-Jeong Seo

Ki-Chang Seong

Bong-Seock Seo

Paek Pyung Seon

Other publications of authors with the same name

Syntactic Question Abstraction and Retrieval for Data-Scarce Semantic Parsing.W. Hwang, J. Yim, S. Park, and M. Seo. AKBC, (2020)In-Context Instruction Learning.S. Ye, H. Hwang, S. Yang, H. Yun, Y. Kim, and M. Seo. CoRR, (2023)KTRL+F: Knowledge-Augmented In-Document Search.H. Oh, H. Shin, M. Ko, H. Lee, and M. Seo. CoRR, (2023)TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models.J. Jang, S. Ye, C. Lee, S. Yang, J. Shin, J. Han, G. Kim, and M. Seo. EMNLP, page 6237-6250. Association for Computational Linguistics, (2022)Generative Multi-hop Retrieval.H. Lee, S. Yang, H. Oh, and M. Seo. EMNLP, page 1417-1436. Association for Computational Linguistics, (2022)Towards Continual Knowledge Learning of Language Models.J. Jang, S. Ye, S. Yang, J. Shin, J. Han, G. Kim, S. Choi, and M. Seo. ICLR, OpenReview.net, (2022)Gradient Ascent Post-training Enhances Language Model Generalization.D. Yoon, J. Jang, S. Kim, and M. Seo. ACL (2), page 851-864. Association for Computational Linguistics, (2023)Two Examples are Better than One: Context Regularization for Gradient-based Prompt Tuning.H. Ha, S. Jung, J. Park, M. Seo, S. won Hwang, and B. Chun. ACL (Findings), page 3335-3350. Association for Computational Linguistics, (2023)Aligning Large Language Models through Synthetic Feedback.S. Kim, S. Bae, J. Shin, S. Kang, D. Kwak, K. Yoo, and M. Seo. EMNLP, page 13677-13700. Association for Computational Linguistics, (2023)Investigating How Large Language Models Leverage Internal Knowledge to Perform Complex Reasoning.M. Ko, S. Park, J. Park, and M. Seo. CoRR, (2024)

BibSonomy

Disambiguation of "Seo, Minjoon"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Self-Explore to Avoid the Pit: Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards.

Please choose a person to relate this publication to

Eunyoung Seo

Ean-Jeong Seo

Ki-Chang Seong

Bong-Seock Seo

Paek Pyung Seon

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Seo, Minjoon"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Self-Explore to Avoid the Pit: Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards.

Please choose a person to relate this publication to

Eunyoung Seo

Ean-Jeong Seo

Ki-Chang Seong

Bong-Seock Seo

Paek Pyung Seon

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Self-Explore to Avoid the Pit: Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards.