copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Contrastive Learning of Medical Visual Representations from Paired Images and Text

Y. Zhang, H. Jiang, Y. Miura, C. Manning, and C. Langlotz. (2020)cite arxiv:2010.00747.

Abstract

Learning visual representations of medical images is core to medical image understanding but its progress has been held back by the small size of hand-labeled datasets. Existing work commonly relies on transferring weights from ImageNet pretraining, which is suboptimal due to drastically different image characteristics, or rule-based label extraction from the textual report data paired with medical images, which is inaccurate and hard to generalize. We propose an alternative unsupervised strategy to learn medical visual representations directly from the naturally occurring pairing of images and textual data. Our method of pretraining medical image encoders with the paired text data via a bidirectional contrastive objective between the two modalities is domain-agnostic, and requires no additional expert input. We test our method by transferring our pretrained weights to 4 medical image classification tasks and 2 zero-shot retrieval tasks, and show that our method leads to image representations that considerably outperform strong baselines in most settings. Notably, in all 4 classification tasks, our method requires only 10% as much labeled training data as an ImageNet initialized counterpart to achieve better or comparable performance, demonstrating superior data efficiency.

Description

Contrastive Learning of Medical Visual Representations from Paired Images and Text

Links and resources

BibTeX key: zhang2020contrastive
entry type: misc
year: 2020
url: http://arxiv.org/abs/2010.00747
note: cite arxiv:2010.00747

Cite this publication

@misc{zhang2020contrastive, abstract = {Learning visual representations of medical images is core to medical image understanding but its progress has been held back by the small size of hand-labeled datasets. Existing work commonly relies on transferring weights from ImageNet pretraining, which is suboptimal due to drastically different image characteristics, or rule-based label extraction from the textual report data paired with medical images, which is inaccurate and hard to generalize. We propose an alternative unsupervised strategy to learn medical visual representations directly from the naturally occurring pairing of images and textual data. Our method of pretraining medical image encoders with the paired text data via a bidirectional contrastive objective between the two modalities is domain-agnostic, and requires no additional expert input. We test our method by transferring our pretrained weights to 4 medical image classification tasks and 2 zero-shot retrieval tasks, and show that our method leads to image representations that considerably outperform strong baselines in most settings. Notably, in all 4 classification tasks, our method requires only 10% as much labeled training data as an ImageNet initialized counterpart to achieve better or comparable performance, demonstrating superior data efficiency.}, added-at = {2020-10-18T16:17:07.000+0200}, author = {Zhang, Yuhao and Jiang, Hang and Miura, Yasuhide and Manning, Christopher D. and Langlotz, Curtis P.}, biburl = {https://www.bibsonomy.org/bibtex/2dd23a5be233cbae7c884108cb383baea/nosebrain}, description = {Contrastive Learning of Medical Visual Representations from Paired Images and Text}, interhash = {ad3ae0338d1fc390710e33825a584b82}, intrahash = {dd23a5be233cbae7c884108cb383baea}, keywords = {contrastive encoder image learning medical text}, note = {cite arxiv:2010.00747}, timestamp = {2020-10-18T16:17:07.000+0200}, title = {Contrastive Learning of Medical Visual Representations from Paired Images and Text}, url = {http://arxiv.org/abs/2010.00747}, year = 2020 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Contrastive Learning of Medical Visual Representations from Paired Images and Text

Abstract

Description

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Contrastive Learning of Medical Visual Representations from Paired Images and Text

Abstract

Description

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Contrastive Learning of Medical Visual Representations from Paired Images and Text

Comments and Reviews
(0)