Modeling human judgment of digital imagery for multimedia retrieval

被引:8
|
作者
Volkmer, Timo [1 ]
Thom, James A. [1 ]
Tahaghoghi, Seyed M. M. [1 ]
机构
[1] RMIT Univ, Sch Comp Sci & Informat Technol, Melbourne, Vic 3001, Australia
关键词
annotation; latent class modeling;
D O I
10.1109/TMM.2007.900153
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The application of machine learning techniques to image and video search has been shown to boost the performance of multimedia retrieval systems, and promises to lead to more generalized semantic search approaches. In particular, the availability of large training collections allows model-driven search using a substantial number of semantic concepts. The training collections are obtained in a manual annotation process where human raters review images and assign predefined semantic concept labels. Besides being prone to human error, manual image annotation is biased by the view of the individual annotator because visual information almost always leaves room for ambiguity. Ideally, several independent judgments are obtained per image, and the inter-rater agreement is assessed. While disagreement between ratings bears valuable information on the annotation quality, it complicates the task of clearly classifying rated images based on multiple judgments. In the absence of a gold standard, evaluating multiple judgments and resolving disagreement between raters is not trivial. In this paper, we present an approach using latent structure analysis to solve this problem. We apply latent class modeling to the annotation data collected during the TRECVID 2005 Annotation Forum, and demonstrate how to use this statistic to clearly classify each image on the basis of varying numbers of ratings. We use latent class modeling to quantify the annotation quality and discuss the results in comparison with the well-known Kappa inter-rater agreement measure.
引用
收藏
页码:967 / 974
页数:8
相关论文
共 50 条
  • [21] Multimedia document retrieval
    Ozkarahan, Esen, 1600, Pergamon Press Inc, Tarrytown, NY, United States (31):
  • [22] Multimedia information retrieval
    Lay, JA
    Muneesawang, P
    Guan, L
    CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING 2001, VOLS I AND II, CONFERENCE PROCEEDINGS, 2001, : 619 - 624
  • [23] Multimedia Retrieval that Works
    Aygun, Ramazan S.
    Benesova, Wanda
    IEEE 1ST CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2018), 2018, : 63 - 68
  • [25] Retrieval in multimedia presentations
    Augusto Celentano
    Ombretta Gaggi
    Maria Luisa Sapino
    Multimedia Systems, 2004, 10 : 72 - 82
  • [26] Multimedia retrieval algorithmics
    Veltkamp, Remco C.
    SOFSEM 2007: THEORY AND PRACTICE OF COMPUTER SCIENCE, PROCEEDINGS, 2007, 4362 : 138 - 154
  • [27] Multimedia Retrieval in and for XR
    Pegia, Maria
    Diplaris, Sotiris
    Vrochidis, Stefanos
    Schuldt, Heiko
    Spiess, Florian
    Arnold, Rahel
    Bailer, Werner
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 1324 - 1325
  • [28] Empirical comparison of image retrieval color similarity methods with human judgment
    Chan, Hock Chuan
    DISPLAYS, 2008, 29 (03) : 260 - 267
  • [29] Retrieval in multimedia presentations
    Celentano, A
    Gaggi, O
    Sapino, ML
    MULTIMEDIA SYSTEMS, 2004, 10 (01) : 72 - 82
  • [30] Multimedia retrieval benchmarks
    Over, P
    Leung, C
    Ip, H
    Grubinger, M
    IEEE MULTIMEDIA, 2004, 11 (02) : 80 - 84