Human-Centric Image Captioning

被引:0
|
作者
Yang, Zuopeng [1 ]
Wang, Pengbo [1 ]
Chu, Tianshu [1 ]
Yang, Jie [1 ,2 ,3 ]
机构
[1] Shanghai Jiao Tong Univ, Inst Image Proc & Pattern Recognit, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Inst Med Robot, Shanghai 200240, Peoples R China
[3] MOE Key Lab Syst Control & Informat Proc, Shanghai 200240, Peoples R China
关键词
Human-centric; Image captioning; Feature hierarchization;
D O I
10.1016/j.patcog.2022.108545
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a new topic, Human-Centric Captioning, to mainly describe the human behavior in an image. Human activities and relationships are the primary objectives of visual understanding in daily applications. However, existing image captioning systems cannot differently treat humans and other objects, which limits the ability to understand and describe diverse human activities. As the first explorer of this new task, we build a novel Human-Centric COCO dataset concentrating on humans. Accordingly, we propose a novel Human-Centric Captioning Model (HCCM) that focuses on human-centric feature hierarchization and sentence generation. Specifically, our model first utilizes human body part level knowledge to hierarchize the image features and then applies a novel three-branch captioning model to process these hierarchical features independently to calibrate the descriptions of human actions. Comprehensive experiments demonstrate that our HCCM achieves the state-of-the-art performance with BLEU-4, CIDEr and SPICE scores of 41.5, 127.3, 23.5 respectively. Dataset and code are publicly available at https://github.com/JohnDreamer/HCCM/. (c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Mind the gap: Modelling the human in human-centric computing
    Fitzpatrick, Geraldine
    [J]. 2018 IEEE SYMPOSIUM ON VISUAL LANGUAGES AND HUMAN-CENTRIC COMPUTING (VL/HCC), 2018, : 3 - 3
  • [42] Human-centric Image Rendering for Natural and Comfortable Viewing—Image Optimization Based on Human Visual Information Processing Models
    Fukiage, Taiki
    [J]. NTT Technical Review, 2024, 22 (11): : 29 - 35
  • [43] Human-Centric AI: The Symbiosis of Human and Artificial Intelligence
    Horvatic, Davor
    Lipic, Tomislav
    [J]. ENTROPY, 2021, 23 (03)
  • [44] Human-centric approach for human-robot interaction
    Narumi, M
    Imai, M
    [J]. PRICAI 2004: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3157 : 993 - 994
  • [45] Introduction of human-centric AI assistant to aid radiologists for multimodal breast image classification
    Calisto, Francisco Maria
    Santiago, Carlos
    Nunes, Nuno
    Nascimento, Jacinto C.
    [J]. INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2021, 150
  • [46] A unified efficient deep image compression framework and its application on human-centric Task
    Chen, Xueyuan
    Hu, Zhihao
    Lu, Guo
    Liu, Jiaheng
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (29) : 73407 - 73425
  • [47] An Image Authentication Method for Secure Internet-Based Communication in Human-Centric Computing
    Chen, Yung-Yao
    Hsia, Chih-Hsien
    Kao, Hsiang-Yun
    Wang, You-An
    Hu, Yu-Chen
    [J]. JOURNAL OF INTERNET TECHNOLOGY, 2020, 21 (07): : 1893 - 1903
  • [48] Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing
    Baldrati, Alberto
    Morelli, Davide
    Cartella, Giuseppe
    Cornia, Marcella
    Bertini, Marco
    Cucchiara, Rita
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 23336 - 23345
  • [49] Human-Centric Image Cropping with Partition-Aware and Content-Preserving Features
    Zhang, Bo
    Niu, Li
    Zhao, Xing
    Zhang, Liqing
    [J]. COMPUTER VISION, ECCV 2022, PT VII, 2022, 13667 : 181 - 197
  • [50] Human-Centric Technology Based on Orange Computing
    Wang, Hung-Yi
    Chen, Bo-Wei
    Bharanitharan, K.
    Wu, Jaw-Shyang
    Tseng, Shih-Pang
    Wang, Jhing-Fa
    [J]. 1ST INTERNATIONAL CONFERENCE ON ORANGE TECHNOLOGIES (ICOT 2013), 2013, : 250 - 251