Multimodal indexing of digital audio-visual documents: A case study for cultural heritage data

被引:0
|
作者
Carmichael, James [1 ]
Larson, Martha [2 ]
Marlow, Jennifer [1 ]
Newman, Eamonn [3 ]
Clough, Paul [1 ]
Oomen, Johan [4 ]
Sav, Sorin [3 ]
机构
[1] Univ Sheffield, Sheffield S10 2TN, S Yorkshire, England
[2] Univ Amsterdam, NL-1012 WX Amsterdam, Netherlands
[3] Dublin City Univ, Dublin 9, Ireland
[4] Netherlands Inst Sound & Vis, Hilversum, Netherlands
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper describes a multimedia multimodal information access sub-system (MIAS) for digital audio-visual documents, typically presented in streaming media format. The system is designed to provide both professional and general users with entry points into video documents that are relevant to their information needs. In this work, we focus on the information needs of multimedia specialists at a Dutch cultural heritage institution with a large multimedia archive. A quantitative and qualitative assessment is made of the efficiency of search operations using our multimodal system and it is demonstrated that MIAS significantly facilitates information retrieval operations when searching within a video document.
引用
收藏
页码:77 / +
页数:2
相关论文
共 50 条
  • [21] Ethnolinguistic Audio-visual Atlas of the Cultural Food Heritage of Bacau County - Elements of methodology
    Savin, Petronela
    Trandabat, Diana
    BRAIN-BROAD RESEARCH IN ARTIFICIAL INTELLIGENCE AND NEUROSCIENCE, 2018, 9 (01): : 125 - 131
  • [22] Data Augmentation for Audio-Visual Emotion Recognition with an Efficient Multimodal Conditional GAN
    Ma, Fei
    Li, Yang
    Ni, Shiguang
    Huang, Shao-Lun
    Zhang, Lin
    APPLIED SCIENCES-BASEL, 2022, 12 (01):
  • [23] System for Producing Subtitles to Internet Audio-Visual Documents
    Nouza, Jan
    Blavka, Karel
    Bohac, Marek
    Cerva, Petr
    Malek, Jiri
    2015 38TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2015,
  • [24] Segmentation Strategies for Passage Retrieval in Audio-Visual Documents
    Galuscakova, Petra
    SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 1143 - 1143
  • [25] Visualized voices: A case study of audio-visual synesthesia
    Fernay, Louise
    Reby, David
    Ward, Jamie
    NEUROCASE, 2012, 18 (01) : 50 - 56
  • [26] Contribution of the audio-visual heritage for the dissemination of the vernacular architecture
    Marques, J.
    VERNACULAR HERITAGE AND EARTHEN ARCHITECTURE: CONTRIBUTIONS FOR SUSTAINABLE DEVELOPMENT, 2014, : 803 - 807
  • [27] Semantic indexing of sports program sequences by audio-visual analysis
    Leonardi, R
    Migliorati, P
    Prandini, M
    2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 1, PROCEEDINGS, 2003, : 9 - 12
  • [28] Speaker dependent video indexing based on audio-visual interaction
    Tsekeridou, S
    Pitas, I
    1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 1, 1998, : 358 - 362
  • [29] Multimodal integration: Audio-visual integration by swarming mosquitoes
    Bomphrey, Richard J.
    CURRENT BIOLOGY, 2024, 34 (18) : R866 - R868
  • [30] DEEP MULTIMODAL LEARNING FOR AUDIO-VISUAL SPEECH RECOGNITION
    Mroueh, Youssef
    Marcheret, Etienne
    Goel, Vaibhava
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2130 - 2134