Author of the publication

Self-Explore to Avoid the Pit: Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards.

, , , , and . CoRR, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Syntactic Question Abstraction and Retrieval for Data-Scarce Semantic Parsing., , , and . AKBC, (2020)In-Context Instruction Learning., , , , , and . CoRR, (2023)KTRL+F: Knowledge-Augmented In-Document Search., , , , and . CoRR, (2023)TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models., , , , , , , and . EMNLP, page 6237-6250. Association for Computational Linguistics, (2022)Generative Multi-hop Retrieval., , , and . EMNLP, page 1417-1436. Association for Computational Linguistics, (2022)Towards Continual Knowledge Learning of Language Models., , , , , , , and . ICLR, OpenReview.net, (2022)Gradient Ascent Post-training Enhances Language Model Generalization., , , and . ACL (2), page 851-864. Association for Computational Linguistics, (2023)Two Examples are Better than One: Context Regularization for Gradient-based Prompt Tuning., , , , , and . ACL (Findings), page 3335-3350. Association for Computational Linguistics, (2023)Aligning Large Language Models through Synthetic Feedback., , , , , , and . EMNLP, page 13677-13700. Association for Computational Linguistics, (2023)Investigating How Large Language Models Leverage Internal Knowledge to Perform Complex Reasoning., , , and . CoRR, (2024)