CIDEr: Consensus-based Image Description Evaluation

被引:0
|
作者
Vedantam, Ramakrishna [1 ]
Zitnick, C. Lawrence [2 ]
Parikh, Devi [1 ]
机构
[1] Virginia Tech, Blacksburg, VA 24061 USA
[2] Microsoft Res, Redmond, WA USA
关键词
MODELS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatically describing an image with a sentence is a long-standing challenge in computer vision and natural language processing. Due to recent progress in object detection, attribute classification, action recognition, etc., there is renewed interest in this area. However, evaluating the quality of descriptions has proven to be challenging. We propose a novel paradigm for evaluating image descriptions that uses human consensus. This paradigm consists of three main parts: a new triplet-based method of collecting human annotations to measure consensus, a new automated metric that captures consensus, and two new datasets: PASCAL-50S and ABSTRACT-50S that contain 50 sentences describing each image. Our simple metric captures human judgment of consensus better than existing metrics across sentences generated by various sources. We also evaluate five state-of-the-art image description approaches using this new protocol and provide a benchmark for future comparisons. A version of CIDEr named CIDEr-D is available as a part of MS COCO evaluation server to enable systematic evaluation and benchmarking.
引用
收藏
页码:4566 / 4575
页数:10
相关论文
共 50 条
  • [21] Development and evaluation of consensus-based sediment effect concentrations for polychlorinated biphenyls
    MacDonald, DD
    Dipinto, LM
    Field, J
    Ingersoll, CG
    Long, ER
    Swartz, RC
    ENVIRONMENTAL TOXICOLOGY AND CHEMISTRY, 2000, 19 (05) : 1403 - 1413
  • [22] Consensus-based evaluation for fault isolation and on-line evolutionary regeneration
    Zhang, KN
    DeMara, RF
    Sharma, CA
    EVOLVABLE SYSTEMS: FROM BIOLOGY TO HARDWARE, 2005, 3637 : 12 - 24
  • [23] A CONSENSUS-BASED GOLD STANDARD FOR THE EVALUATION OF MASS CASUALTY TRIAGE SYSTEMS
    Lerner, E. Brooke
    McKee, Courtney H.
    Cady, Charles E.
    Cone, David C.
    Colella, M. Riccardo
    Cooper, Arthur
    Coule, Phillip L.
    Lairet, Julio R.
    Liu, J. Marc
    Pirrallo, Ronald G.
    Sasser, Scott M.
    Schwartz, Richard
    Shepherd, Greene
    Swienton, Raymond E.
    PREHOSPITAL EMERGENCY CARE, 2015, 19 (02) : 267 - 271
  • [24] A group consensus-based travel destination evaluation method with online reviews
    Jian Wu
    Qing Hong
    Mingshuo Cao
    Yujia Liu
    Hamido Fujita
    Applied Intelligence, 2022, 52 : 1306 - 1324
  • [25] Development and Evaluation of Consensus-Based Sediment Quality Guidelines for Freshwater Ecosystems
    D. D. MacDonald
    C. G. Ingersoll
    T. A. Berger
    Archives of Environmental Contamination and Toxicology, 2000, 39 : 20 - 31
  • [26] CONSENSUS-BASED POLICY SOLUTIONS FOR MEDICINES ADHERENCE FOR EUROPE: DEVELOPMENT AND EVALUATION
    Clyne, W.
    White, S.
    McLachlan, S.
    VALUE IN HEALTH, 2012, 15 (04) : A14 - A14
  • [27] ASSESSMENT OF A CONSENSUS-BASED MULTIPLE INFORMATION SOURCE JOB EVALUATION SYSTEM
    SCHWAB, DP
    HENEMAN, HG
    JOURNAL OF APPLIED PSYCHOLOGY, 1986, 71 (02) : 354 - 356
  • [28] Development and evaluation of consensus-based sediment quality guidelines for freshwater ecosystems
    MacDonald, DD
    Ingersoll, CG
    Berger, TA
    ARCHIVES OF ENVIRONMENTAL CONTAMINATION AND TOXICOLOGY, 2000, 39 (01) : 20 - 31
  • [29] Consensus-Based Linear and Nonlinear Filtering
    Battistelli, G.
    Chisci, L.
    Mugnai, G.
    Farina, A.
    Graziano, A.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2015, 60 (05) : 1410 - 1415
  • [30] Consensus-based optimisation with truncated noise
    Fornasier, Massimo
    Richtarik, Peter
    Riedl, Konstantin
    Sun, Lukang
    EUROPEAN JOURNAL OF APPLIED MATHEMATICS, 2025, 36 (02) : 292 - 315