Author of the publication

Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers.

, , , , , and . CoRR, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization., , , , , , , , , and . CoRR, (2024)Neural Knowledge Bank for Pretrained Transformers., , , , , and . CoRR, (2022)Inductively Representing Out-of-Knowledge-Graph Entities by Optimal Estimation Under Translational Assumptions., , , , , , and . RepL4NLP@ACL-IJCNLP, page 83-89. Association for Computational Linguistics, (2021)Large Language Models Are Unconscious of Unreasonability in Math Problems., , and . CoRR, (2024)Live Video Comment Generation Based on Surrounding Frames and Live Comments.. CoRR, (2018)Incorporating Connections Beyond Knowledge Embeddings: A Plug-and-Play Module to Enhance Commonsense Reasoning in Machine Reading Comprehension., , , and . CoRR, (2021)Bi-Drop: Generalizable Fine-tuning for Pre-trained Language Models via Adaptive Subnetwork Optimization., , , , , , and . CoRR, (2023)On the Representation Collapse of Sparse Mixture of Experts., , , , , , , , , and . CoRR, (2022)Coarse-to-Fine Entity Representations for Document-level Relation Extraction., , , , and . CoRR, (2020)LiveBot: Generating Live Video Comments Based on Visual and Textual Contexts., , , , and . AAAI, page 6810-6817. AAAI Press, (2019)