From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Mesh-tensorflow: Deep learning for supercomputers, , , , , , , , , и 1 other автор(ы). Advances in Neural Information Processing Systems, стр. 10435--10444. (2018)GSPMD: General and Scalable Parallelization for ML Computation Graphs., , , , , , , , , и 6 other автор(ы). CoRR, (2021)Adafactor: Adaptive Learning Rates with Sublinear Memory Cost., и . ICML, том 80 из Proceedings of Machine Learning Research, стр. 4603-4611. PMLR, (2018)Music Transformer: Generating Music with Long-Term Structure, , , , , , , , , и . 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9, 2019, OpenReview.net, (2019)GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding., , , , , , , , и . ICLR, OpenReview.net, (2021)The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation., , , , , , , , , и 6 other автор(ы). ACL (1), стр. 76-86. Association for Computational Linguistics, (2018)Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity., , и . J. Mach. Learn. Res., (2022)Searching for Efficient Transformers for Language Modeling., , , , , и . NeurIPS, стр. 6010-6022. (2021)Blockwise Parallel Decoding for Deep Autoregressive Models., , и . NeurIPS, стр. 10107-10116. (2018)Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer, , , , , , , , и . (октября 2019)cite arxiv:1910.10683.