копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Don't Stop Pretraining: Adapt Language Models to Domains and Tasks

S. Gururangan, A. Marasović, S. Swayamdipta, K. Lo, I. Beltagy, D. Downey, и N. Smith. (2020)cite arxiv:2004.10964Comment: ACL 2020.

Аннотация

Language models pretrained on text from a wide variety of sources form the foundation of today's NLP. In light of the success of these broad-coverage models, we investigate whether it is still helpful to tailor a pretrained model to the domain of a target task. We present a study across four domains (biomedical and computer science publications, news, and reviews) and eight classification tasks, showing that a second phase of pretraining in-domain (domain-adaptive pretraining) leads to performance gains, under both high- and low-resource settings. Moreover, adapting to the task's unlabeled data (task-adaptive pretraining) improves performance even after domain-adaptive pretraining. Finally, we show that adapting to a task corpus augmented using simple data selection strategies is an effective alternative, especially when resources for domain-adaptive pretraining might be unavailable. Overall, we consistently find that multi-phase adaptive pretraining offers large gains in task performance.

Описание

2004.10964.pdf

Линки и ресурсы

ключ BibTeX: gururangan2020pretraining
тип записи: misc
год: 2020
Примечание: cite arxiv:2004.10964Comment: ACL 2020

тэги

@nosebrain- тэги данного пользователя выделены

Цитировать эту публикацию

искать в

Метаданные

Последнее изменение 3 лет назад
Создан 3 лет назад

Комментарии и рецензии
(0)

Комментарии, или рецензии отсутствуют. Вы можете их написать!