копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Engaging Image Captioning Via Personality

xxx. (25.10.2018)

Аннотация

Standard image captioning tasks such as COCO and Flickr30k are factual, neutral in tone and (to a human) state the obvious (e.g., &\#34;a man playing a guitar&\#34;). While such tasks are useful to verify that a machine understands the content of an image, they are not engaging to humans as captions. With this in mind we define a new task, Personality-Captions, where the goal is to be as engaging to humans as possible by incorporating controllable style and personality traits. We collect and release a large dataset of 201,858 of such captions conditioned over 215 possible traits. We build models that combine existing work from (i) sentence representations (Mazare et al., 2018) with Transformers trained on 1.7 billion dialogue examples; and (ii) image representations (Mahajan et al., 2018) with ResNets trained on 3.5 billion social media images. We obtain state-of-the-art performance on Flickr30k and COCO, and strong performance on our new task. Finally, online evaluations validate that our task and models are engaging to humans, with our best model close to human performance.

Линки и ресурсы

ключ BibTeX: citeulike:14649646
тип записи: misc
год: 2018
месяц: oct
день: 25
citeulike-article-id: 14649646
citeulike-linkout-1: http://arxiv.org/pdf/1810.10665
priority: 4
posted-at: 2018-10-31 10:43:29
eprint: 1810.10665
citeulike-linkout-0: http://arxiv.org/abs/1810.10665
archiveprefix: arXiv
url: http://arxiv.org/abs/1810.10665

тэги

Цитировать эту публикацию

искать в

Метаданные

Последнее изменение 5 лет назад
Создан 5 лет назад

Комментарии и рецензии
(0)

Комментарии, или рецензии отсутствуют. Вы можете их написать!