Human-Centric Image Captioning

被引:0
|
作者
Yang, Zuopeng [1 ]
Wang, Pengbo [1 ]
Chu, Tianshu [1 ]
Yang, Jie [1 ,2 ,3 ]
机构
[1] Shanghai Jiao Tong Univ, Inst Image Proc & Pattern Recognit, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Inst Med Robot, Shanghai 200240, Peoples R China
[3] MOE Key Lab Syst Control & Informat Proc, Shanghai 200240, Peoples R China
关键词
Human-centric; Image captioning; Feature hierarchization;
D O I
10.1016/j.patcog.2022.108545
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a new topic, Human-Centric Captioning, to mainly describe the human behavior in an image. Human activities and relationships are the primary objectives of visual understanding in daily applications. However, existing image captioning systems cannot differently treat humans and other objects, which limits the ability to understand and describe diverse human activities. As the first explorer of this new task, we build a novel Human-Centric COCO dataset concentrating on humans. Accordingly, we propose a novel Human-Centric Captioning Model (HCCM) that focuses on human-centric feature hierarchization and sentence generation. Specifically, our model first utilizes human body part level knowledge to hierarchize the image features and then applies a novel three-branch captioning model to process these hierarchical features independently to calibrate the descriptions of human actions. Comprehensive experiments demonstrate that our HCCM achieves the state-of-the-art performance with BLEU-4, CIDEr and SPICE scores of 41.5, 127.3, 23.5 respectively. Dataset and code are publicly available at https://github.com/JohnDreamer/HCCM/. (c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Human-Centric Image Captioning
    Yang, Zuopeng
    Wang, Pengbo
    Chu, Tianshu
    Yang, Jie
    [J]. Pattern Recognition, 2022, 126
  • [2] HUMAN-CENTRIC IMAGE RETRIEVAL WITH GAZE-BASED IMAGE CAPTIONING
    Feng, Yuhu
    Maeda, Keisuke
    Ogawa, Takahiro
    Haseyama, Miki
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3828 - 3832
  • [3] PDH: A human-centric interface for image libraries
    Moghaddam, B
    Tian, Q
    Lesh, N
    Shen, C
    Huang, TS
    [J]. IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : 901 - 904
  • [4] Human-Centric Image Categorization Based on Poselets
    Bai S.
    [J]. Sens. Imaging, 1 (1-19): : 1 - 19
  • [5] Human-centric sensing
    Srivastava, Mani
    Abdelzaher, Tarek
    Szymanski, Boleslaw
    [J]. PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2012, 370 (1958): : 176 - 197
  • [6] Human-Centric Computing
    Rabaey, Jan M.
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2020, 28 (01) : 3 - 11
  • [7] The Human-Centric SMED
    Fonda, Edoardo
    Meneghetti, Antonella
    [J]. SUSTAINABILITY, 2022, 14 (01)
  • [8] Human-centric assembly
    Tracht, Kirsten
    Weidner, Robert
    [J]. WT Werkstattstechnik, 2023, 113 (09):
  • [9] Human-Centric Computing
    Rabaey, Jan M.
    [J]. 2021 IEEE INTERNATIONAL ELECTRON DEVICES MEETING (IEDM), 2021,
  • [10] Human-centric smart manufacturing
    Wang, Baicun
    Peng, Tao
    Wang, Xi Vincent
    Wuest, Thorsten
    Romero, David
    Wang, Lihui
    [J]. JOURNAL OF MANUFACTURING SYSTEMS, 2023, 69 : 18 - 19