Multimodal indexing of digital audio-visual documents: A case study for cultural heritage data

被引:0
|
作者
Carmichael, James [1 ]
Larson, Martha [2 ]
Marlow, Jennifer [1 ]
Newman, Eamonn [3 ]
Clough, Paul [1 ]
Oomen, Johan [4 ]
Sav, Sorin [3 ]
机构
[1] Univ Sheffield, Sheffield S10 2TN, S Yorkshire, England
[2] Univ Amsterdam, NL-1012 WX Amsterdam, Netherlands
[3] Dublin City Univ, Dublin 9, Ireland
[4] Netherlands Inst Sound & Vis, Hilversum, Netherlands
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper describes a multimedia multimodal information access sub-system (MIAS) for digital audio-visual documents, typically presented in streaming media format. The system is designed to provide both professional and general users with entry points into video documents that are relevant to their information needs. In this work, we focus on the information needs of multimedia specialists at a Dutch cultural heritage institution with a large multimedia archive. A quantitative and qualitative assessment is made of the efficiency of search operations using our multimodal system and it is demonstrated that MIAS significantly facilitates information retrieval operations when searching within a video document.
引用
收藏
页码:77 / +
页数:2
相关论文
共 50 条
  • [1] DIGITAL AUDIO-VISUAL ARCHIVAL DOCUMENTS
    Cote-Lapointe, Simon
    DOCUMENTATION ET BIBLIOTHEQUES, 2019, 65 (03): : 39 - 57
  • [2] Large-Scale Processing, Indexing and Search System for Czech Audio-Visual Cultural Heritage Archives
    Nouza, Jan
    Blavka, Karel
    Zdansky, Jindrich
    Cerva, Petr
    Silovsky, Jan
    Bohac, Marek
    Chaloupka, Josef
    Kucharova, Michaela
    Seps, Ladislav
    2012 IEEE 14TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2012, : 337 - 342
  • [3] The indexing of persons in news sequences using audio-visual data
    Albiol, A
    Torres, L
    Delp, EJ
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING SIGNAL, PROCESSING EDUCATION, 2003, : 137 - 140
  • [4] Multimodal Learning Using 3D Audio-Visual Data or Audio-Visual Speech Recognition
    Su, Rongfeng
    Wang, Lan
    Liu, Xunying
    2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 40 - 43
  • [5] Acceptance of online audio-visual cultural heritage archive services: a study of the general public
    Ongena, Guido
    van de Wijngaert, Lidwien
    Huizer, Erik
    INFORMATION RESEARCH-AN INTERNATIONAL ELECTRONIC JOURNAL, 2013, 18 (02):
  • [6] Audio-visual interaction in multimodal communication
    Chellappa, R
    Chen, TH
    Katsaggelos, A
    IEEE SIGNAL PROCESSING MAGAZINE, 1997, 14 (04) : 37 - 38
  • [7] Audio-visual integration in multimodal communication
    Chen, T
    Rao, RR
    PROCEEDINGS OF THE IEEE, 1998, 86 (05) : 837 - 852
  • [8] Recognizing emotions for the audio-visual document indexing
    Le, XH
    Quénot, G
    Castelli, E
    ISCC2004: NINTH INTERNATIONAL SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2004, : 580 - 584
  • [9] Indexing audio-visual sequences by joint audio and video processing
    Saraceno, C
    Leonardi, R
    VSMM98: FUTUREFUSION - APPLICATION REALITIES FOR THE VIRTUAL AGE, VOLS 1 AND 2, 1998, : 686 - 691
  • [10] Audio-visual Memory and cultural Heritage - On Media Tradition in the German Broadcasting Archive
    Bassenge, Reinhard
    Leenings, Anke
    ZEITSCHRIFT FUR BIBLIOTHEKSWESEN UND BIBLIOGRAPHIE, 2012, 59 (3-4): : 182 - 191