CIDEr: Consensus-based Image Description Evaluation

被引:0
|
作者
Vedantam, Ramakrishna [1 ]
Zitnick, C. Lawrence [2 ]
Parikh, Devi [1 ]
机构
[1] Virginia Tech, Blacksburg, VA 24061 USA
[2] Microsoft Res, Redmond, WA USA
关键词
MODELS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatically describing an image with a sentence is a long-standing challenge in computer vision and natural language processing. Due to recent progress in object detection, attribute classification, action recognition, etc., there is renewed interest in this area. However, evaluating the quality of descriptions has proven to be challenging. We propose a novel paradigm for evaluating image descriptions that uses human consensus. This paradigm consists of three main parts: a new triplet-based method of collecting human annotations to measure consensus, a new automated metric that captures consensus, and two new datasets: PASCAL-50S and ABSTRACT-50S that contain 50 sentences describing each image. Our simple metric captures human judgment of consensus better than existing metrics across sentences generated by various sources. We also evaluate five state-of-the-art image description approaches using this new protocol and provide a benchmark for future comparisons. A version of CIDEr named CIDEr-D is available as a part of MS COCO evaluation server to enable systematic evaluation and benchmarking.
引用
收藏
页码:4566 / 4575
页数:10
相关论文
共 50 条
  • [41] Consensus-based Optimization in Multiplex Networks
    Rodriguez-Camargo, Christian D.
    Mojica-Nava, Eduardo
    IFAC PAPERSONLINE, 2023, 56 (02): : 1217 - 1222
  • [42] ON THE COMPLEMENTARITY OF THE CONSENSUS-BASED DISORDER PREDICTION
    Peng, Zhenling
    Kurgan, Lukasz
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2012, 2012, : 176 - 187
  • [43] Consensus-based recommendations in respiratory medicine
    Wilson, Kevin C.
    EUROPEAN RESPIRATORY JOURNAL, 2020, 56 (06)
  • [44] Consensus-Based Ranking of Wikipedia Topics
    Nema, Waleed
    Tang, Yinshan
    2017 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2017), 2017, : 114 - 124
  • [45] Consensus-based table form recognition
    Nielson, HE
    Barrett, WA
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 906 - 910
  • [46] A CONSENSUS-BASED APPROACH TO PRACTICE PARAMETERS
    MEEKER, CI
    OBSTETRICS AND GYNECOLOGY, 1992, 79 (05): : 790 - 793
  • [47] A new consensus-based unemployment indicator
    Claveria, Oscar
    APPLIED ECONOMICS LETTERS, 2019, 26 (10) : 812 - 817
  • [48] A Consensus-Based Algorithm for Truck Platooning
    Saeednia, Mahnam
    Menendez, Monica
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2017, 18 (02) : 404 - 415
  • [49] A consensus-based approach for ontology integration
    Nguyen, Ngoc Thanh
    Rusin, Michal
    2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WORKSHOPS PROCEEDINGS, 2006, : 514 - +
  • [50] A Consensus-Based Algorithm for Multi-Objective Optimization and Its Mean-Field Description
    Borghi, Giacomo
    Herty, Michael
    Pareschi, Lorenzo
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 4131 - 4136