Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Scalable Learning to Optimize: A Learned Optimizer Can Train Big Models.

X. Chen, T. Chen, Y. Cheng, W. Chen, A. Awadallah, and Z. Wang. ECCV (23), volume 13683 of Lecture Notes in Computer Science, page 389-405. Springer, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Chen Chen

Shih Chen Chen

Yi-Chen Chen

Chen-Loung Chen

Other publications of authors with the same name

BANG: Bridging Autoregressive and Non-autoregressive Generation with Large Scale Pretraining.W. Qi, Y. Gong, J. Jiao, Y. Yan, D. Liu, W. Chen, K. Tang, H. Li, J. Chen, R. Zhang and 2 other author(s). CoRR, (2020)XLM-K: Improving Cross-Lingual Language Model Pre-training with Multilingual Knowledge.X. Jiang, Y. Liang, W. Chen, and N. Duan. AAAI, page 10840-10848. AAAI Press, (2022)Soft-Labeled Contrastive Pre-Training for Function-Level Code Representation.X. Li, D. Guo, Y. Gong, Y. Lin, Y. Shen, X. Qiu, D. Jiang, W. Chen, and N. Duan. EMNLP (Findings), page 118-129. Association for Computational Linguistics, (2022)Large-scale L-BFGS using MapReduce.W. Chen, Z. Wang, and J. Zhou. NIPS, page 1332-1340. (2014)Reasoning Like Program Executors.X. Pi, Q. Liu, B. Chen, M. Ziyadi, Z. Lin, Q. Fu, Y. Gao, J. Lou, and W. Chen. EMNLP, page 761-779. Association for Computational Linguistics, (2022)Scalable Learning to Optimize: A Learned Optimizer Can Train Big Models.X. Chen, T. Chen, Y. Cheng, W. Chen, A. Awadallah, and Z. Wang. ECCV (23), volume 13683 of Lecture Notes in Computer Science, page 389-405. Springer, (2022)Transfer Understanding from Head Queries to Tail Queries.Y. Song, H. Wang, W. Chen, and S. Wang. CIKM, page 1299-1308. ACM, (2014)MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation.S. Zuo, Q. Zhang, C. Liang, P. He, T. Zhao, and W. Chen. NAACL-HLT, page 1610-1623. Association for Computational Linguistics, (2022)OmniTab: Pretraining with Natural and Synthetic Data for Few-shot Table-based Question Answering.Z. Jiang, Y. Mao, P. He, G. Neubig, and W. Chen. NAACL-HLT, page 932-942. Association for Computational Linguistics, (2022)Adversarial Retriever-Ranker for Dense Text Retrieval.H. Zhang, Y. Gong, Y. Shen, J. Lv, N. Duan, and W. Chen. ICLR, OpenReview.net, (2022)

BibSonomy

Disambiguation of "Chen, Weizhu"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Scalable Learning to Optimize: A Learned Optimizer Can Train Big Models.

Please choose a person to relate this publication to

Chen Chen

Chen Chen

Shih Chen Chen

Yi-Chen Chen

Chen-Loung Chen

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Chen, Weizhu"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Scalable Learning to Optimize: A Learned Optimizer Can Train Big Models.

Please choose a person to relate this publication to

Chen Chen

Chen Chen

Shih Chen Chen

Yi-Chen Chen

Chen-Loung Chen

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Scalable Learning to Optimize: A Learned Optimizer Can Train Big Models.