Captioning Images Using Different Styles

被引：3

作者：

Mathews, Alexander ^{[1
]}

机构：

[1] Australian Natl Univ, Canberra, ACT, Australia

来源：

MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE | 2015年

关键词：

caption generation; image description; object naming; sentiment;

D O I：

10.1145/2733373.2807998

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

I develop techniques that can be used to incorporate stylistic objectives into existing image captioning systems. Style is generally a very tricky concept to define, thus I concentrate on two specific components of style. First I develop a technique for predicting how people will name visual objects. I demonstrate that this technique could be used to generate captions with human like naming conventions. Full details are available in a recent publication [16]. Second I outline a system for generating sentences which express a strong positive or negative sentiment. Finally I present two possible future directions which are aimed at modelling style more generally. These are learning to imitate an individuals captioning style and generating a diverse set of captions for a single image.

引用

页码：665 / 668

页数：4

共 50 条

[1] Vision to Language: Captioning Images using Deep Learning
Charu, Shreyasi
Mishra, S. P.
Gandhi, Tapan
[J]. 2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2020,
[2] Captioning Remote Sensing Images Using Transformer Architecture
Nanal, Wrucha
Hajiarbabi, Mohammadreza
[J]. 2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION, ICAIIC, 2023, : 413 - 418
[3] Arabic Captioning for Images of Clothing Using Deep Learning
Al-Malki, Rasha Saleh
Al-Aama, Arwa Yousuf
[J]. SENSORS, 2023, 23 (08)
[4] Captioning Ultrasound Images Automatically
Alsharid, Mohammad
Sharma, Harshita
Drukker, Lior
Chatelain, Pierre
Papageorghiou, Aris T.
Noble, J. Alison
[J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT IV, 2019, 11767 : 338 - 346
[5] Captioning the Images: A Deep Analysis
Chaudhari, Chaitrali P.
Devane, Satish
[J]. COMPUTING, COMMUNICATION AND SIGNAL PROCESSING, ICCASP 2018, 2019, 810 : 987 - 999
[6] Captioning Images with Diverse Objects
Venugopalan, Subhashini
Mooney, Raymond
Hendricks, Lisa Anne
Darrell, Trevor
Rohrbach, Marcus
Saenko, Kate
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1170 - 1178
[7] Captioning Images on Mobile Devices Using Semi-Statistical Extraction
Castellanos, Ari Ernesto Ortiz
Avalos, Jorge Enrique Roman
[J]. ICVIP 2019: PROCEEDINGS OF 2019 3RD INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, 2019, : 157 - 161
[8] Retrieved Generative Captioning for Medical Images
Beddiar, Djamila Romaissa
Oussalah, Mourad
Seppanen, Tapio
[J]. 20TH INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING, CBMI 2023, 2023, : 48 - 54
[9] Weakly Supervised Captioning of Ultrasound Images
Alsharid, Mohammad
Sharma, Harshita
Drukker, Lior
Papageorgiou, Aris T.
Noble, J. Alison
[J]. MEDICAL IMAGE UNDERSTANDING AND ANALYSIS, MIUA 2022, 2022, 13413 : 187 - 198
[10] VERBATIM, STANDARD, OR EDITED? READING PATTERNS OF DIFFERENT CAPTIONING STYLES AMONG DEAF, HARD OF HEARING, AND HEARING VIEWERS
Szarkowska, Agnieszka
Krejtz, Izabela
Klyszejko, Zuzanna
Wieczorek, Anna
[J]. AMERICAN ANNALS OF THE DEAF, 2011, 156 (04) : 363 - 378

← 1 2 3 4 5 →