Author of the publication

Brainformers: Trading Simplicity for Efficiency.

, , , , , , , , , , , , , , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 42531-42542. PMLR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Transferable Graph Optimizers for ML Compilers., , , , , , , , , and 2 other author(s). NeurIPS, (2020)An Image Compression Encryption Algorithm Based on Chaos and ZUC Stream Cipher., , , and . Entropy, 24 (5): 742 (2022)Brainformers: Trading Simplicity for Efficiency., , , , , , , , , and 5 other author(s). ICML, volume 202 of Proceedings of Machine Learning Research, page 42531-42542. PMLR, (2023)Atomic In-place Updates for Non-volatile Main Memories with Kamino-Tx., , , , , , and . EuroSys, page 499-512. ACM, (2017)Piton: A 25-core academic manycore research processor., , , , , , , , and . Hot Chips Symposium, page 1-38. IEEE, (2016)Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference., , , , , , , , , and 2 other author(s). NeurIPS, (2023)Mixture-of-Experts Meets Instruction Tuning: A Winning Combination for Large Language Models., , , , , , , , , and 10 other author(s). ICLR, OpenReview.net, (2024)Learning Large Graph Property Prediction via Graph Segment Training., , , , , , , and . CoRR, (2023)GLaM: Efficient Scaling of Language Models with Mixture-of-Experts., , , , , , , , , and 17 other author(s). ICML, volume 162 of Proceedings of Machine Learning Research, page 5547-5569. PMLR, (2022)Power and Energy Characterization of an Open Source 25-Core Manycore Processor., , , , , , , , , and . HPCA, page 762-775. IEEE Computer Society, (2018)