Paraphrase Acquisition from Image Captions

被引：0

作者：

Gohsen, Marcel ^{[1
]}

Hagen, Matthias ^{[2
]}

Potthast, Martin ^{[3
,4
]}

Stein, Benno ^{[1
]}

机构：

[1] Bauhaus Univ Weimar, Weimar, Germany

[2] Friedrich Schiller Univ Jena, Jena, Germany

[3] Univ Leipzig, Leipzig, Germany

[4] ScaDS AI, Leipzig, Germany

来源：

17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023 | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose to use image captions from the Web as a previously underutilized resource for paraphrases (i.e., texts with the same "message") and to create and analyze a corresponding dataset. When an image is reused on the Web, an original caption is often assigned. We hypothesize that different captions for the same image naturally form a set of mutual paraphrases. To demonstrate the suitability of this idea, we analyze captions in the English Wikipedia, where editors frequently relabel the same image for different articles. The paper introduces the underlying mining technology, the resulting Wikipedia-IPC dataset, and compares known paraphrase corpora with respect to their syntactic and semantic paraphrase similarity to our new resource. In this context, we introduce characteristic maps along the two similarity dimensions to identify the style of paraphrases coming from different sources. An annotation study demonstrates the high reliability of the algorithmically determined characteristic maps.

引用

页码：3348 / 3358

页数：11

共 50 条

[31] Towards Generating and Evaluating Iconographic Image Captions of Artworks
Cetinic, Eva
[J]. JOURNAL OF IMAGING, 2021, 7 (08)
[32] Event Recognition Based on Classification of Generated Image Captions
Savchenko, Andrey, V
Miasnikov, Evgeniy, V
[J]. ADVANCES IN INTELLIGENT DATA ANALYSIS XVIII, IDA 2020, 2020, 12080 : 418 - 430
[33] The image of the Jew in Flavius Josephus's 'Paraphrase of the Bible'
Perelmuter, HG
[J]. CATHOLIC BIBLICAL QUARTERLY, 2000, 62 (01): : 164 - 165
[34] The image of the Jew in Flavius Josephus' paraphrase of the Bible.
Nodet, E
[J]. REVUE BIBLIQUE, 2004, 111 (04) : 626 - 630
[35] Generating Diverse and Descriptive Image Captions Using Visual Paraphrases
Liu, Lixin
Tang, Jiajun
Wan, Xiaojun
Guo, Zongming
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4239 - 4248
[36] Guiding image captioning models toward more specific captions
Kornblith, Simon
Li, Lala
Wang, Zirui
Nguyen, Thao
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 15213 - 15223
[37] Representing Image Captions as Concept Graphs using Semantic Information
Ghosh, Swarnendu
Das, Nibaran
Goncalves, Teresa
Quaresma, Paulo
[J]. 2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 162 - 167
[38] Semantic space captioner: generating image captions step by step
Zhu, Chenhao
Ye, Xia
Lu, Qiduo
[J]. JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
[39] Material appearance acquisition from a single image
Zhang, Xu
Cui, Shulin
Cui, Hanwen
Yang, Lin
Wu, Tao
[J]. SEVENTH INTERNATIONAL CONFERENCE ON ELECTRONICS AND INFORMATION ENGINEERING, 2017, 10322
[40] Towards Generating Stylized Image Captions via Adversarial Training
Nezami, Omid Mohamad
Dras, Mark
Wan, Stephen
Paris, Cecile
Hamey, Len
[J]. PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2019, 11670 : 270 - 284

← 1 2 3 4 5 →