From post

копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

The case for 4-bit precision: k-bit Inference Scaling Laws

T. Dettmers, и L. Zettlemoyer. (2022)cite arxiv:2212.09720.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

Arno Dettmers

Almut Dettmers

Heinrich Dettmers

Swantje Dettmers

Dode Dettmers

Другие публикации лиц с тем же именем

8-bit Optimizers via Block-wise Quantization.T. Dettmers, M. Lewis, S. Shleifer, и L. Zettlemoyer. ICLR, OpenReview.net, (2022)Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models.M. Li, S. Gururangan, T. Dettmers, M. Lewis, T. Althoff, N. Smith, и L. Zettlemoyer. CoRR, (2022)Training Transformers Together.A. Borzunov, M. Ryabinin, T. Dettmers, Q. Lhoest, L. Saulnier, M. Diskin, Y. Jernite, и T. Wolf. CoRR, (2022)High Performance Natural Language Processing.G. Ilharco, C. Ilharco, I. Turc, T. Dettmers, F. Ferreira, и K. Lee. EMNLP (Tutorial Abstracts), стр. 24-27. Association for Computational Linguistics, (2020)The case for 4-bit precision: k-bit Inference Scaling Laws.T. Dettmers, и L. Zettlemoyer. ICML, том 202 из Proceedings of Machine Learning Research, стр. 7750-7774. PMLR, (2023)LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale.T. Dettmers, M. Lewis, Y. Belkada, и L. Zettlemoyer. CoRR, (2022)Training Transformers Together.A. Borzunov, M. Ryabinin, T. Dettmers, Q. Lhoest, L. Saulnier, M. Diskin, Y. Jernite, и T. Wolf. NeurIPS (Competition and Demos), том 176 из Proceedings of Machine Learning Research, стр. 335-342. PMLR, (2021)Towards A Unified View of Sparse Feed-Forward Network in Pretraining Large Language Model.Z. Liu, T. Dettmers, X. Lin, V. Stoyanov, и X. Li. EMNLP, стр. 15038-15061. Association for Computational Linguistics, (2023)The case for 4-bit precision: k-bit Inference Scaling LawsT. Dettmers, и L. Zettlemoyer. (2022)cite arxiv:2212.09720.QLoRA: Efficient Finetuning of Quantized LLMsT. Dettmers, A. Pagnoni, A. Holtzman, и L. Zettlemoyer. (2023)cite arxiv:2305.14314Comment: Extended NeurIPS submission.

Что такое BibSonomy?: С чего начать; Кнопки для браузера; Помощь
Разработчикам: Обзор; API-документация

Контакт и защита личных данных: о нас; Cookies; Сообщить о проблеме; BibSonomy Вики

Интеграция: PUMA; Расширение для TYPO3; Плагин для; Клиент Java REST; Поддерживаемые источники; далее

О BibSonomy: Команда; Блог; Список рассылки
Социальные сети: Наш Twitter