Image-Collection Summarization Using Scene-Graph Generation With External Knowledge

被引:0
|
作者
Phueaksri, Itthisak [1 ,2 ]
Kastner, Marc A. [3 ]
Kawanishi, Yasutomo [1 ,2 ]
Komamizu, Takahiro [1 ,4 ]
Ide, Ichiro [1 ,4 ]
机构
[1] Nagoya Univ, Grad Sch Informat, Nagoya, Aichi 4648601, Japan
[2] RIKEN, Informat Res & Dev & Strategy Headquarters, Guardian Robot Project, Kyoto 6190288, Japan
[3] Kyoto Univ, Grad Sch Informat, Kyoto 6068501, Japan
[4] Nagoya Univ, Math & Data Sci Ctr, Nagoya, Aichi 4648601, Japan
关键词
Object detection; Knowledge graphs; Semantics; Visualization; Image analysis; Market research; Image collection summarization; multiple-image summarization; semantic images summarization; scene-graph generation; scene-graph summarization; SIMILARITY; LANGUAGE;
D O I
10.1109/ACCESS.2024.3360113
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Summarization tasks aim to summarize multiple pieces of information into a short description or representative information. A text summarization task summarizes textual information into a short description, whereas an image collection summarization task summarizes an image collection into images or textual representation in which the challenge is to understand the relationship between images. In recent years, scene-graph generation has shown the advantage of describing the visual contexts of a single-image, and incorporating external knowledge into the scene-graph generation model has also given effective directions for unseen single-image scene-graph generation. While external knowledge has been implemented in related work, it is still challenging to use this information efficiently for relationship estimation during the summarization. Following this trend, in this paper, we propose a novel scene-graph-based image-collection summarization model that aims to generate a summarized scene-graph of an image collection. The key idea of the proposed method is to enhance the relation predictor toward relationships between images in an image collection incorporating knowledge graphs as external knowledge for training a model. With this approach, we build an end-to-end framework that can generate a summarized scene graph of an image collection. To evaluate the proposed method, we also build an extended annotated MS-COCO dataset for this task and introduce an evaluation process that focuses on estimating the similarity between a summarized scene graph and ground-truth scene graphs. Traditional evaluation focuses on calculating precision and recall scores, which involve true positive predictions without balancing precision and recall. Meanwhile, the proposed evaluation process focuses on calculating the F-score of the similarity between a summarized scene graph and ground-truth scene graphs, which aims to balance both false positives and false negatives. Experimental results show that using external knowledge to enhance the relation predictor achieves better results than existing methods.
引用
收藏
页码:17499 / 17512
页数:14
相关论文
共 50 条
  • [21] Knowledge-Embedded Routing Network for Scene Graph Generation
    Chen, Tianshui
    Yu, Weihao
    Chen, Riquan
    Lin, Liang
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6156 - 6164
  • [22] Label Semantic Knowledge Distillation for Unbiased Scene Graph Generation
    Li, Lin
    Xiao, Jun
    Shi, Hanrong
    Wang, Wenxiao
    Shao, Jian
    Liu, An-An
    Yang, Yi
    Chen, Long
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 195 - 206
  • [23] Video Scene Graph Generation with Spatial-Temporal Knowledge
    Pu, Tao
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9340 - 9344
  • [24] A Novel Framework for Scene Graph Generation via Prior Knowledge
    Wang, Zhenghao
    Lian, Jing
    Li, Linhui
    Zhao, Jian
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3768 - 3781
  • [25] SCENE GRAPH TO IMAGE GENERATION WITH CONTEXTUALIZED OBJECT LAYOUT REFINEMENT
    Ivgi, Maor
    Benny, Yaniv
    Ben-David, Avichai
    Berant, Jonathan
    Wolf, Lior
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2428 - 2432
  • [26] Text summarization using concept graph and BabelNet knowledge base
    Rashidghalam, Haniyeh
    Taherkhani, Mina
    Mahmoudi, Fariborz
    2016 ARTIFICIAL INTELLIGENCE AND ROBOTICS (IRANOPEN), 2016, : 115 - 119
  • [27] RSSGG_CS: Remote Sensing Image Scene Graph Generation by Fusing Contextual Information and Statistical Knowledge
    Lin, Zhiyuan
    Zhu, Feng
    Wang, Qun
    Kong, Yanzi
    Wang, Jianyu
    Huang, Liang
    Hao, Yingming
    REMOTE SENSING, 2022, 14 (13)
  • [28] Towards Captioning an Image Collection from a Combined Scene Graph Representation Approach
    Phueaksri, Itthisak
    Kastner, Marc A.
    Kawanishi, Yasutomo
    Komamizu, Takahiro
    Ide, Ichiro
    MULTIMEDIA MODELING, MMM 2023, PT I, 2023, 13833 : 178 - 190
  • [29] Knowledge-Based Scene Graph Generation with Visual Contextual Dependency
    Zhang, Lizong
    Yin, Haojun
    Hui, Bei
    Liu, Sijuan
    Zhang, Wei
    MATHEMATICS, 2022, 10 (14)
  • [30] Knowledge-Enhanced Context Representation for Unbiased Scene Graph Generation
    Wang, Yuanlong
    Liu, Zhenqi
    Zhang, Hu
    Li, Ru
    WEB AND BIG DATA, APWEB-WAIM 2024, PT I, 2024, 14961 : 248 - 263