Modeling human judgment of digital imagery for multimedia retrieval

被引：8

作者：

Volkmer, Timo ^{[1
]}

Thom, James A. ^{[1
]}

Tahaghoghi, Seyed M. M. ^{[1
]}

机构：

[1] RMIT Univ, Sch Comp Sci & Informat Technol, Melbourne, Vic 3001, Australia

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2007年 / 9卷 / 05期

关键词：

annotation; latent class modeling;

D O I：

10.1109/TMM.2007.900153

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The application of machine learning techniques to image and video search has been shown to boost the performance of multimedia retrieval systems, and promises to lead to more generalized semantic search approaches. In particular, the availability of large training collections allows model-driven search using a substantial number of semantic concepts. The training collections are obtained in a manual annotation process where human raters review images and assign predefined semantic concept labels. Besides being prone to human error, manual image annotation is biased by the view of the individual annotator because visual information almost always leaves room for ambiguity. Ideally, several independent judgments are obtained per image, and the inter-rater agreement is assessed. While disagreement between ratings bears valuable information on the annotation quality, it complicates the task of clearly classifying rated images based on multiple judgments. In the absence of a gold standard, evaluating multiple judgments and resolving disagreement between raters is not trivial. In this paper, we present an approach using latent structure analysis to solve this problem. We apply latent class modeling to the annotation data collected during the TRECVID 2005 Annotation Forum, and demonstrate how to use this statistic to clearly classify each image on the basis of varying numbers of ratings. We use latent class modeling to quantify the annotation quality and discuss the results in comparison with the well-known Kappa inter-rater agreement measure.

引用

页码：967 / 974

页数：8

共 50 条

[21] Multimedia document retrieval
Ozkarahan, Esen, 1600, Pergamon Press Inc, Tarrytown, NY, United States (31):
[22] Multimedia information retrieval
Lay, JA
Muneesawang, P
Guan, L
CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING 2001, VOLS I AND II, CONFERENCE PROCEEDINGS, 2001, : 619 - 624
[23] Multimedia Retrieval that Works
Aygun, Ramazan S.
Benesova, Wanda
IEEE 1ST CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2018), 2018, : 63 - 68
[24] Multimedia Retrieval that Matters
Hanjalic, Alan
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2013, 9 (01)
[25] Retrieval in multimedia presentations
Augusto Celentano
Ombretta Gaggi
Maria Luisa Sapino
Multimedia Systems, 2004, 10 : 72 - 82
[26] Multimedia retrieval algorithmics
Veltkamp, Remco C.
SOFSEM 2007: THEORY AND PRACTICE OF COMPUTER SCIENCE, PROCEEDINGS, 2007, 4362 : 138 - 154
[27] Multimedia Retrieval in and for XR
Pegia, Maria
Diplaris, Sotiris
Vrochidis, Stefanos
Schuldt, Heiko
Spiess, Florian
Arnold, Rahel
Bailer, Werner
PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 1324 - 1325
[28] Empirical comparison of image retrieval color similarity methods with human judgment
Chan, Hock Chuan
DISPLAYS, 2008, 29 (03) : 260 - 267
[29] Retrieval in multimedia presentations
Celentano, A
Gaggi, O
Sapino, ML
MULTIMEDIA SYSTEMS, 2004, 10 (01) : 72 - 82
[30] Multimedia retrieval benchmarks
Over, P
Leung, C
Ip, H
Grubinger, M
IEEE MULTIMEDIA, 2004, 11 (02) : 80 - 84

← 1 2 3 4 5 →