Author of the publication

Brainformers: Trading Simplicity for Efficiency.

, , , , , , , , , , , , , , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 42531-42542. PMLR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Flow Contrastive Estimation of Energy-Based Models., , , , , and . CoRR, (2019)Training Socially Aligned Language Models in Simulated Human Society., , , , , , , and . CoRR, (2023)Flow Contrastive Estimation of Energy-Based Models., , , , , and . CVPR, page 7515-7525. Computer Vision Foundation / IEEE, (2020)Semi-supervised Sequence Learning., and . NIPS, page 3079-3087. (2015)Finetuned Language Models are Zero-Shot Learners., , , , , , , , and . ICLR, OpenReview.net, (2022)GLaM: Efficient Scaling of Language Models with Mixture-of-Experts., , , , , , , , , and 17 other author(s). ICML, volume 162 of Proceedings of Machine Learning Research, page 5547-5569. PMLR, (2022)AirDialogue: An Environment for Goal-Oriented Dialogue Research., , , and . EMNLP, page 3844-3854. Association for Computational Linguistics, (2018)Adversarial Training Methods for Semi-Supervised Text Classification, , and . (2016)cite arxiv:1605.07725Comment: Published as a conference paper at ICLR 2017.Learning Longer-term Dependencies in RNNs with Auxiliary Losses., , , and . ICML, volume 80 of Proceedings of Machine Learning Research, page 4972-4981. PMLR, (2018)Explaining an increase in predicted risk for clinical alerts., , , , , , , and . CHIL, page 80-89. ACM, (2020)