Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers.

D. Dai, Y. Sun, L. Dong, Y. Hao, Z. Sui, and F. Wei. CoRR, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Jing Dai

Huangdong Dai

Yangguang Dai

Min Dai

Vodang Dai

Other publications of authors with the same name

PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization.X. Meng, D. Dai, W. Luo, Z. Yang, S. Wu, X. Wang, P. Wang, Q. Dong, L. Chen, and Z. Sui. CoRR, (2024)Neural Knowledge Bank for Pretrained Transformers.D. Dai, W. Jiang, Q. Dong, Y. Lyu, Q. She, and Z. Sui. CoRR, (2022)Inductively Representing Out-of-Knowledge-Graph Entities by Optimal Estimation Under Translational Assumptions.D. Dai, H. Zheng, F. Luo, P. Yang, T. Liu, Z. Sui, and B. Chang. RepL4NLP@ACL-IJCNLP, page 83-89. Association for Computational Linguistics, (2021)Large Language Models Are Unconscious of Unreasonability in Math Problems.J. Ma, D. Dai, and Z. Sui. CoRR, (2024)Live Video Comment Generation Based on Surrounding Frames and Live Comments.D. Dai. CoRR, (2018)Incorporating Connections Beyond Knowledge Embeddings: A Plug-and-Play Module to Enhance Commonsense Reasoning in Machine Reading Comprehension.D. Dai, H. Zheng, Z. Sui, and B. Chang. CoRR, (2021)Bi-Drop: Generalizable Fine-tuning for Pre-trained Language Models via Adaptive Subnetwork Optimization.S. Tong, H. Xia, D. Dai, T. Liu, B. Lin, Y. Cao, and Z. Sui. CoRR, (2023)On the Representation Collapse of Sparse Mixture of Experts.Z. Chi, L. Dong, S. Huang, D. Dai, S. Ma, B. Patra, S. Singhal, P. Bajaj, X. Song, and F. Wei. CoRR, (2022)Coarse-to-Fine Entity Representations for Document-level Relation Extraction.D. Dai, J. Ren, S. Zeng, B. Chang, and Z. Sui. CoRR, (2020)LiveBot: Generating Live Video Comments Based on Visual and Textual Contexts.S. Ma, L. Cui, D. Dai, F. Wei, and X. Sun. AAAI, page 6810-6817. AAAI Press, (2019)

BibSonomy

Disambiguation of "Dai, Damai"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers.

Please choose a person to relate this publication to

Jing Dai

Huangdong Dai

Yangguang Dai

Min Dai

Vodang Dai

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Dai, Damai"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers.

Please choose a person to relate this publication to

Jing Dai

Huangdong Dai

Yangguang Dai

Min Dai

Vodang Dai

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers.