Modeling human judgment of digital imagery for multimedia retrieval

被引:8
|
作者
Volkmer, Timo [1 ]
Thom, James A. [1 ]
Tahaghoghi, Seyed M. M. [1 ]
机构
[1] RMIT Univ, Sch Comp Sci & Informat Technol, Melbourne, Vic 3001, Australia
关键词
annotation; latent class modeling;
D O I
10.1109/TMM.2007.900153
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The application of machine learning techniques to image and video search has been shown to boost the performance of multimedia retrieval systems, and promises to lead to more generalized semantic search approaches. In particular, the availability of large training collections allows model-driven search using a substantial number of semantic concepts. The training collections are obtained in a manual annotation process where human raters review images and assign predefined semantic concept labels. Besides being prone to human error, manual image annotation is biased by the view of the individual annotator because visual information almost always leaves room for ambiguity. Ideally, several independent judgments are obtained per image, and the inter-rater agreement is assessed. While disagreement between ratings bears valuable information on the annotation quality, it complicates the task of clearly classifying rated images based on multiple judgments. In the absence of a gold standard, evaluating multiple judgments and resolving disagreement between raters is not trivial. In this paper, we present an approach using latent structure analysis to solve this problem. We apply latent class modeling to the annotation data collected during the TRECVID 2005 Annotation Forum, and demonstrate how to use this statistic to clearly classify each image on the basis of varying numbers of ratings. We use latent class modeling to quantify the annotation quality and discuss the results in comparison with the well-known Kappa inter-rater agreement measure.
引用
收藏
页码:967 / 974
页数:8
相关论文
共 50 条
  • [41] Personalization in multimedia retrieval: A survey
    Lu, Yijuan
    Sebe, Nicu
    Hytnen, Ross
    Tian, Qi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2011, 51 (01) : 247 - 277
  • [42] Recent and technologies in multimedia retrieval
    Satoh S.
    Kyokai Joho Imeji Zasshi/Journal of the Institute of Image Information and Television Engineers, 2010, 64 (08): : 1213 - 1218
  • [43] Retrieval of Illegal and Objectionable Multimedia
    Choi, Byeongcheol
    Kim, Jungnyeo
    Ryou, Jeacheol
    NCM 2008: 4TH INTERNATIONAL CONFERENCE ON NETWORKED COMPUTING AND ADVANCED INFORMATION MANAGEMENT, VOL 2, PROCEEDINGS, 2008, : 645 - +
  • [44] A BIT OF MULTIMEDIA RETRIEVAL ALGORITHMICS
    Woeginger, Gerhard J.
    Veltkamp, Remco C.
    BULLETIN OF THE EUROPEAN ASSOCIATION FOR THEORETICAL COMPUTER SCIENCE, 2007, (91): : 29 - 41
  • [45] SCULPTEUR: Multimedia retrieval for museums
    Goodall, S
    Lewis, PH
    Martinez, K
    Sinclair, PAS
    Giorgini, F
    Addis, MJ
    Boniface, MJ
    Lahanier, C
    Stevenson, J
    IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2004, 3115 : 638 - 646
  • [46] Characterizing MultiMedia Retrieval Applications
    Lu, Yunping
    Wang, Xin
    Zhang, Weihua
    Li, Yi
    Zhao, Wenyun
    2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2015, : 270 - 279
  • [47] Tune retrieval in the multimedia library
    McNab, RJ
    Smith, LA
    Witten, IH
    Henderson, CL
    MULTIMEDIA TOOLS AND APPLICATIONS, 2000, 10 (2-3) : 113 - 132
  • [48] Performance Evaluation in Multimedia Retrieval
    Sauter, Loris
    Gasser, Ralph
    Schuldt, Heiko
    Bernstein, Abraham
    Rossetto, Luca
    ACM Transactions on Multimedia Computing, Communications and Applications, 2024, 21 (01)
  • [49] Where Is the User in Multimedia Retrieval?
    Worring, Marcel
    Sajda, Paul
    Santini, Simone
    Shamma, David A.
    Smeaton, Alan F.
    Yang, Qiang
    IEEE MULTIMEDIA, 2012, 19 (04) : 6 - 10
  • [50] Tune Retrieval in the Multimedia Library
    Rodger J. McNab
    Lloyd A. Smith
    Ian H. Witten
    Clare L. Henderson
    Multimedia Tools and Applications, 2000, 10 : 113 - 132