Modeling human judgment of digital imagery for multimedia retrieval

被引:8
|
作者
Volkmer, Timo [1 ]
Thom, James A. [1 ]
Tahaghoghi, Seyed M. M. [1 ]
机构
[1] RMIT Univ, Sch Comp Sci & Informat Technol, Melbourne, Vic 3001, Australia
关键词
annotation; latent class modeling;
D O I
10.1109/TMM.2007.900153
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The application of machine learning techniques to image and video search has been shown to boost the performance of multimedia retrieval systems, and promises to lead to more generalized semantic search approaches. In particular, the availability of large training collections allows model-driven search using a substantial number of semantic concepts. The training collections are obtained in a manual annotation process where human raters review images and assign predefined semantic concept labels. Besides being prone to human error, manual image annotation is biased by the view of the individual annotator because visual information almost always leaves room for ambiguity. Ideally, several independent judgments are obtained per image, and the inter-rater agreement is assessed. While disagreement between ratings bears valuable information on the annotation quality, it complicates the task of clearly classifying rated images based on multiple judgments. In the absence of a gold standard, evaluating multiple judgments and resolving disagreement between raters is not trivial. In this paper, we present an approach using latent structure analysis to solve this problem. We apply latent class modeling to the annotation data collected during the TRECVID 2005 Annotation Forum, and demonstrate how to use this statistic to clearly classify each image on the basis of varying numbers of ratings. We use latent class modeling to quantify the annotation quality and discuss the results in comparison with the well-known Kappa inter-rater agreement measure.
引用
收藏
页码:967 / 974
页数:8
相关论文
共 50 条
  • [1] Personalized Multimedia Retrieval in CADAL Digital Library
    Zhang, Yin
    Wu, Jiangqin
    Zhuang, Yueting
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2008, 9TH PACIFIC RIM CONFERENCE ON MULTIMEDIA, 2008, 5353 : 703 - 712
  • [2] IMAGERY CODES AND HUMAN INFORMATION RETRIEVAL
    SEAMON, JG
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1972, 96 (02): : 468 - &
  • [3] Inquiry with imagery: Historical archive retrieval with digital cameras
    Smith, BK
    Blankinship, E
    Ashford, A
    Baker, M
    Hirzel, T
    ACM MULTIMEDIA 99, PROCEEDINGS, 1999, : 405 - 408
  • [4] Machine annotation and retrieval for digital imagery of historical materials
    Wang J.Z.
    Grieb K.
    Zhang Y.
    Chen C.-C.
    Chen Y.
    Li J.
    International Journal on Digital Libraries, 2006, 6 (1) : 18 - 29
  • [5] Multimedia in digital media. Multimedia elements and retrieval systems in the leading Spanish online newspapers
    Guallar, Javier
    Rovira, Cristofol
    Ruiz, Sara
    PROFESIONAL DE LA INFORMACION, 2010, 19 (06): : 620 - 629
  • [6] Intelligent Retrieval Method for Multimedia Digital Audio Based on Deep Learning
    Zhang S.
    Lin Y.
    Chen L.
    Jiang C.
    Journal of Engineering Science and Technology Review, 2023, 16 (06) : 195 - 201
  • [7] Wavelets for content based image retrieval and digital watermarking for multimedia applications
    Chatterji, BN
    Kokare, M
    Reddy, AA
    Jha, RK
    ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 812 - 816
  • [8] ROLE OF DATA AND JUDGMENT IN MODELING HUMAN ERRORS
    CARNINO, A
    NUCLEAR ENGINEERING AND DESIGN, 1986, 93 (2-3) : 303 - 309
  • [9] Distributed semantic representations for modeling human judgment
    Bhatia, Sudeep
    Richie, Russell
    Zou, Wanling
    CURRENT OPINION IN BEHAVIORAL SCIENCES, 2019, 29 : 31 - 36
  • [10] Human Centered Multimedia Audio Data Retrieval in Computer Networks
    Manvi, S. S.
    Ganiger, Kavita
    Sutagundar, A. V.
    INTERNATIONAL CONFERENCE ON FUTURE COMPUTER AND COMMUNICATIONS, PROCEEDINGS, 2009, : 26 - +