Captioning Images Using Different Styles

被引:3
|
作者
Mathews, Alexander [1 ]
机构
[1] Australian Natl Univ, Canberra, ACT, Australia
关键词
caption generation; image description; object naming; sentiment;
D O I
10.1145/2733373.2807998
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
I develop techniques that can be used to incorporate stylistic objectives into existing image captioning systems. Style is generally a very tricky concept to define, thus I concentrate on two specific components of style. First I develop a technique for predicting how people will name visual objects. I demonstrate that this technique could be used to generate captions with human like naming conventions. Full details are available in a recent publication [16]. Second I outline a system for generating sentences which express a strong positive or negative sentiment. Finally I present two possible future directions which are aimed at modelling style more generally. These are learning to imitate an individuals captioning style and generating a diverse set of captions for a single image.
引用
收藏
页码:665 / 668
页数:4
相关论文
共 50 条
  • [1] Vision to Language: Captioning Images using Deep Learning
    Charu, Shreyasi
    Mishra, S. P.
    Gandhi, Tapan
    [J]. 2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2020,
  • [2] Captioning Remote Sensing Images Using Transformer Architecture
    Nanal, Wrucha
    Hajiarbabi, Mohammadreza
    [J]. 2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION, ICAIIC, 2023, : 413 - 418
  • [3] Arabic Captioning for Images of Clothing Using Deep Learning
    Al-Malki, Rasha Saleh
    Al-Aama, Arwa Yousuf
    [J]. SENSORS, 2023, 23 (08)
  • [4] Captioning Ultrasound Images Automatically
    Alsharid, Mohammad
    Sharma, Harshita
    Drukker, Lior
    Chatelain, Pierre
    Papageorghiou, Aris T.
    Noble, J. Alison
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT IV, 2019, 11767 : 338 - 346
  • [5] Captioning the Images: A Deep Analysis
    Chaudhari, Chaitrali P.
    Devane, Satish
    [J]. COMPUTING, COMMUNICATION AND SIGNAL PROCESSING, ICCASP 2018, 2019, 810 : 987 - 999
  • [6] Captioning Images with Diverse Objects
    Venugopalan, Subhashini
    Mooney, Raymond
    Hendricks, Lisa Anne
    Darrell, Trevor
    Rohrbach, Marcus
    Saenko, Kate
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1170 - 1178
  • [7] Captioning Images on Mobile Devices Using Semi-Statistical Extraction
    Castellanos, Ari Ernesto Ortiz
    Avalos, Jorge Enrique Roman
    [J]. ICVIP 2019: PROCEEDINGS OF 2019 3RD INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, 2019, : 157 - 161
  • [8] Retrieved Generative Captioning for Medical Images
    Beddiar, Djamila Romaissa
    Oussalah, Mourad
    Seppanen, Tapio
    [J]. 20TH INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING, CBMI 2023, 2023, : 48 - 54
  • [9] Weakly Supervised Captioning of Ultrasound Images
    Alsharid, Mohammad
    Sharma, Harshita
    Drukker, Lior
    Papageorgiou, Aris T.
    Noble, J. Alison
    [J]. MEDICAL IMAGE UNDERSTANDING AND ANALYSIS, MIUA 2022, 2022, 13413 : 187 - 198
  • [10] VERBATIM, STANDARD, OR EDITED? READING PATTERNS OF DIFFERENT CAPTIONING STYLES AMONG DEAF, HARD OF HEARING, AND HEARING VIEWERS
    Szarkowska, Agnieszka
    Krejtz, Izabela
    Klyszejko, Zuzanna
    Wieczorek, Anna
    [J]. AMERICAN ANNALS OF THE DEAF, 2011, 156 (04) : 363 - 378