copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters

R. Wang, D. Tang, N. Duan, Z. Wei, X. Huang, J. ji, G. Cao, D. Jiang, and M. Zhou. (2020)cite arxiv:2002.01808.

Abstract

We study the problem of injecting knowledge into large pre-trained models like BERT and RoBERTa. Existing methods typically update the original parameters of pre-trained models when injecting knowledge. However, when multiple kinds of knowledge are injected, they may suffer from the problem of catastrophic forgetting. To address this, we propose K-Adapter, which remains the original parameters of the pre-trained model fixed and supports continual knowledge infusion. Taking RoBERTa as the pre-trained model, K-Adapter has a neural adapter for each kind of infused knowledge, like a plug-in connected to RoBERTa. There is no information flow between different adapters, thus different adapters are efficiently trained in a distributed way. We inject two kinds of knowledge, including factual knowledge obtained from automatically aligned text-triplets on Wikipedia and Wikidata, and linguistic knowledge obtained from dependency parsing. Results on three knowledge-driven tasks (total six datasets) including relation classification, entity typing and question answering demonstrate that each adapter improves the performance, and the combination of both adapters brings further improvements. Probing experiments further show that K-Adapter captures richer factual and commonsense knowledge than RoBERTa.

Description

K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters

Links and resources

BibTeX key: wang2020kadapter
entry type: misc
year: 2020
url: http://arxiv.org/abs/2002.01808
note: cite arxiv:2002.01808

@hotho's tags highlighted

Cite this publication

@misc{wang2020kadapter, abstract = {We study the problem of injecting knowledge into large pre-trained models like BERT and RoBERTa. Existing methods typically update the original parameters of pre-trained models when injecting knowledge. However, when multiple kinds of knowledge are injected, they may suffer from the problem of catastrophic forgetting. To address this, we propose K-Adapter, which remains the original parameters of the pre-trained model fixed and supports continual knowledge infusion. Taking RoBERTa as the pre-trained model, K-Adapter has a neural adapter for each kind of infused knowledge, like a plug-in connected to RoBERTa. There is no information flow between different adapters, thus different adapters are efficiently trained in a distributed way. We inject two kinds of knowledge, including factual knowledge obtained from automatically aligned text-triplets on Wikipedia and Wikidata, and linguistic knowledge obtained from dependency parsing. Results on three knowledge-driven tasks (total six datasets) including relation classification, entity typing and question answering demonstrate that each adapter improves the performance, and the combination of both adapters brings further improvements. Probing experiments further show that K-Adapter captures richer factual and commonsense knowledge than RoBERTa.}, added-at = {2020-05-06T10:27:29.000+0200}, author = {Wang, Ruize and Tang, Duyu and Duan, Nan and Wei, Zhongyu and Huang, Xuanjing and ji, Jianshu and Cao, Guihong and Jiang, Daxin and Zhou, Ming}, biburl = {https://www.bibsonomy.org/bibtex/235d80999c453f2f984bbe85ba387d2ab/hotho}, description = {K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters}, interhash = {42b3154aea4b5ed945ec9c65869c2128}, intrahash = {35d80999c453f2f984bbe85ba387d2ab}, keywords = {deep forgetting knowledge learning nlp toread}, note = {cite arxiv:2002.01808}, timestamp = {2020-05-06T10:27:29.000+0200}, title = {K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters}, url = {http://arxiv.org/abs/2002.01808}, year = 2020 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters

Comments and Reviews
(0)