From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Mesh-tensorflow: Deep learning for supercomputers, , , , , , , , , и 1 other автор(ы). Advances in Neural Information Processing Systems, стр. 10435--10444. (2018)Adafactor: Adaptive Learning Rates with Sublinear Memory Cost., и . ICML, том 80 из Proceedings of Machine Learning Research, стр. 4603-4611. PMLR, (2018)GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding., , , , , , , , и . CoRR, (2020)Sparse Non-negative Matrix Language Modeling., , и . Trans. Assoc. Comput. Linguistics, (2016)GSPMD: General and Scalable Parallelization for ML Computation Graphs., , , , , , , , , и 6 other автор(ы). CoRR, (2021)Music Transformer, , , , , , , , , и . (2018)cite arxiv:1809.04281Comment: Improved skewing section and accompanying figures. Previous titles are Än Improved Relative Self-Attention Mechanism for Transformer with Application to Music Generation" and "Music Transformer".Sparse non-negative matrix language modeling for geo-annotated query session data., и . ASRU, стр. 8-14. IEEE, (2015)Sparse non-negative matrix language modeling for skip-grams., , и . INTERSPEECH, стр. 1428-1432. ISCA, (2015)NN-Grams: Unifying Neural Network and n-Gram Language Models for Speech Recognition., , , и . INTERSPEECH, стр. 3499-3503. ISCA, (2016)Talking-Heads Attention., , , , и . CoRR, (2020)