From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

meProp: Sparsified Back Propagation for Accelerated Deep Learning with Reduced Overfitting., , , и . ICML, том 70 из Proceedings of Machine Learning Research, стр. 3299-3308. PMLR, (2017)LongNet: Scaling Transformers to 1,000,000,000 Tokens, , , , , , и . (2023)cite arxiv:2307.02486Comment: Work in progress.DeltaLM: Encoder-Decoder Pre-training for Language Generation and Translation by Augmenting Pretrained Multilingual Encoders., , , , , , , , и . CoRR, (2021)GTrans: Grouping and Fusing Transformer Layers for Neural Machine Translation., , , , , , , и . IEEE ACM Trans. Audio Speech Lang. Process., (2023)BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation., , , , , , , , , и . NAACL-HLT, стр. 1550-1565. Association for Computational Linguistics, (2022)Multimodal Matching Transformer for Live Commenting., , , , , и . ECAI, том 325 из Frontiers in Artificial Intelligence and Applications, стр. 1998-2005. IOS Press, (2020)Accelerating Graph-Based Dependency Parsing with Lock-Free Parallel Perceptron., , , и . NLPCC (1), том 11108 из Lecture Notes in Computer Science, стр. 260-268. Springer, (2018)Q-Sparse: All Large Language Models can be Fully Sparsely-Activated., , , и . CoRR, (2024)mT6: Multilingual Pretrained Text-to-Text Transformer with Translation Pairs., , , , , , , , и . EMNLP (1), стр. 1671-1683. Association for Computational Linguistics, (2021)A Length-Extrapolatable Transformer., , , , , , , , и . ACL (1), стр. 14590-14604. Association for Computational Linguistics, (2023)