Paraphrase Acquisition from Image Captions

被引:0
|
作者
Gohsen, Marcel [1 ]
Hagen, Matthias [2 ]
Potthast, Martin [3 ,4 ]
Stein, Benno [1 ]
机构
[1] Bauhaus Univ Weimar, Weimar, Germany
[2] Friedrich Schiller Univ Jena, Jena, Germany
[3] Univ Leipzig, Leipzig, Germany
[4] ScaDS AI, Leipzig, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose to use image captions from the Web as a previously underutilized resource for paraphrases (i.e., texts with the same "message") and to create and analyze a corresponding dataset. When an image is reused on the Web, an original caption is often assigned. We hypothesize that different captions for the same image naturally form a set of mutual paraphrases. To demonstrate the suitability of this idea, we analyze captions in the English Wikipedia, where editors frequently relabel the same image for different articles. The paper introduces the underlying mining technology, the resulting Wikipedia-IPC dataset, and compares known paraphrase corpora with respect to their syntactic and semantic paraphrase similarity to our new resource. In this context, we introduce characteristic maps along the two similarity dimensions to identify the style of paraphrases coming from different sources. An annotation study demonstrates the high reliability of the algorithmically determined characteristic maps.
引用
收藏
页码:3348 / 3358
页数:11
相关论文
共 50 条
  • [31] Towards Generating and Evaluating Iconographic Image Captions of Artworks
    Cetinic, Eva
    [J]. JOURNAL OF IMAGING, 2021, 7 (08)
  • [32] Event Recognition Based on Classification of Generated Image Captions
    Savchenko, Andrey, V
    Miasnikov, Evgeniy, V
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS XVIII, IDA 2020, 2020, 12080 : 418 - 430
  • [33] The image of the Jew in Flavius Josephus's 'Paraphrase of the Bible'
    Perelmuter, HG
    [J]. CATHOLIC BIBLICAL QUARTERLY, 2000, 62 (01): : 164 - 165
  • [34] The image of the Jew in Flavius Josephus' paraphrase of the Bible.
    Nodet, E
    [J]. REVUE BIBLIQUE, 2004, 111 (04) : 626 - 630
  • [35] Generating Diverse and Descriptive Image Captions Using Visual Paraphrases
    Liu, Lixin
    Tang, Jiajun
    Wan, Xiaojun
    Guo, Zongming
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4239 - 4248
  • [36] Guiding image captioning models toward more specific captions
    Kornblith, Simon
    Li, Lala
    Wang, Zirui
    Nguyen, Thao
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 15213 - 15223
  • [37] Representing Image Captions as Concept Graphs using Semantic Information
    Ghosh, Swarnendu
    Das, Nibaran
    Goncalves, Teresa
    Quaresma, Paulo
    [J]. 2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 162 - 167
  • [38] Semantic space captioner: generating image captions step by step
    Zhu, Chenhao
    Ye, Xia
    Lu, Qiduo
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
  • [39] Material appearance acquisition from a single image
    Zhang, Xu
    Cui, Shulin
    Cui, Hanwen
    Yang, Lin
    Wu, Tao
    [J]. SEVENTH INTERNATIONAL CONFERENCE ON ELECTRONICS AND INFORMATION ENGINEERING, 2017, 10322
  • [40] Towards Generating Stylized Image Captions via Adversarial Training
    Nezami, Omid Mohamad
    Dras, Mark
    Wan, Stephen
    Paris, Cecile
    Hamey, Len
    [J]. PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2019, 11670 : 270 - 284