Image-Collection Summarization Using Scene-Graph Generation With External Knowledge

被引：0

作者：

Phueaksri, Itthisak ^{[1
,2
]}

Kastner, Marc A. ^{[3
]}

Kawanishi, Yasutomo ^{[1
,2
]}

Komamizu, Takahiro ^{[1
,4
]}

Ide, Ichiro ^{[1
,4
]}

机构：

[1] Nagoya Univ, Grad Sch Informat, Nagoya, Aichi 4648601, Japan

[2] RIKEN, Informat Res & Dev & Strategy Headquarters, Guardian Robot Project, Kyoto 6190288, Japan

[3] Kyoto Univ, Grad Sch Informat, Kyoto 6068501, Japan

[4] Nagoya Univ, Math & Data Sci Ctr, Nagoya, Aichi 4648601, Japan

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Object detection; Knowledge graphs; Semantics; Visualization; Image analysis; Market research; Image collection summarization; multiple-image summarization; semantic images summarization; scene-graph generation; scene-graph summarization; SIMILARITY; LANGUAGE;

D O I：

10.1109/ACCESS.2024.3360113

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Summarization tasks aim to summarize multiple pieces of information into a short description or representative information. A text summarization task summarizes textual information into a short description, whereas an image collection summarization task summarizes an image collection into images or textual representation in which the challenge is to understand the relationship between images. In recent years, scene-graph generation has shown the advantage of describing the visual contexts of a single-image, and incorporating external knowledge into the scene-graph generation model has also given effective directions for unseen single-image scene-graph generation. While external knowledge has been implemented in related work, it is still challenging to use this information efficiently for relationship estimation during the summarization. Following this trend, in this paper, we propose a novel scene-graph-based image-collection summarization model that aims to generate a summarized scene-graph of an image collection. The key idea of the proposed method is to enhance the relation predictor toward relationships between images in an image collection incorporating knowledge graphs as external knowledge for training a model. With this approach, we build an end-to-end framework that can generate a summarized scene graph of an image collection. To evaluate the proposed method, we also build an extended annotated MS-COCO dataset for this task and introduce an evaluation process that focuses on estimating the similarity between a summarized scene graph and ground-truth scene graphs. Traditional evaluation focuses on calculating precision and recall scores, which involve true positive predictions without balancing precision and recall. Meanwhile, the proposed evaluation process focuses on calculating the F-score of the similarity between a summarized scene graph and ground-truth scene graphs, which aims to balance both false positives and false negatives. Experimental results show that using external knowledge to enhance the relation predictor achieves better results than existing methods.

引用

页码：17499 / 17512

页数：14

共 50 条

[1] Scene Graph Generation with External Knowledge and Image Reconstruction
Gu, Jiuxiang
Zhao, Handong
Lin, Zhe
Li, Sheng
Cai, Jianfei
Ling, Mingyang
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1969 - 1978
[2] Enriching Scene-Graph Generation with Prior Knowledge from Work Instruction
Jesko, Zoltan
Tuan-Anh
Halaszl, Gergely
Abonyi, Janos
Ruppert, Minas
ADVANCES IN PRODUCTION MANAGEMENT SYSTEMS-PRODUCTION MANAGEMENT SYSTEMS FOR VOLATILE, UNCERTAIN, COMPLEX, AND AMBIGUOUS ENVIRONMENTS, PT II, APMS 2024, 2024, 729 : 290 - 302
[3] An Approach to Generate a Caption for an Image Collection Using Scene Graph Generation
Phueaksri, Itthisak
Kastner, Marc A.
Kawanishi, Yasutomo
Komamizu, Takahiro
Ide, Ichiro
IEEE ACCESS, 2023, 11 : 128245 - 128260
[4] Urban scene representation and summarization using knowledge graph
Pandey, Sourav
Patel, Ashish Singh
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 275
[5] Image Captioning with Scene-graph Based Semantic Concepts
Gao, Lizhao
Wang, Bo
Wang, Wenmin
PROCEEDINGS OF 2018 10TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING (ICMLC 2018), 2018, : 225 - 229
[6] Guide and interact: scene-graph based generation and control of video captions
Lu, Xuyang
Gao, Yang
MULTIMEDIA SYSTEMS, 2023, 29 (02) : 797 - 809
[7] Guide and interact: scene-graph based generation and control of video captions
Xuyang Lu
Yang Gao
Multimedia Systems, 2023, 29 : 797 - 809
[8] A Deep Local and Global Scene-Graph Matching for Image-Text Retrieval
Manh-Duy Nguyen
Binh T Nguyen
Cathal Gurrin
NEW TRENDS IN INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES, 2021, 337 : 510 - 523
[9] Incorporating External Knowledge into Unsupervised Graph Model for Document Summarization
Tang, Tiancheng
Yuan, Tianyi
Tang, Xinhuai
Chen, Delai
ELECTRONICS, 2020, 9 (09) : 1 - 13
[10] An automated image-collection system for crystallization experiments using SBS standard microplates
Brostromer, Erik
Nan, Jie
Su, Xiao-Dong
ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2007, 63 : 119 - 125

← 1 2 3 4 5 →