AUTOMATIC WEB VIDEO CATEGORIZATION USING AUDIO-VISUAL INFORMATION AND HIERARCHICAL CLUSTERING RF

被引:0
|
作者
Ionescu, B. [1 ,3 ]
Seyerlehner, K. [2 ]
Mironica, I. [1 ]
Vertan, C. [1 ]
Lambert, P. [3 ]
机构
[1] Univ Politehn Bucuresti, LAPI, Bucharest 061071, Romania
[2] Johannes Kepler Univ Linz, DCP, A-4040 Linz, Austria
[3] Univ Savoie, Polytech Annecy Chambery, LISTIC, F-74944 Annecy, France
基金
奥地利科学基金会;
关键词
audio-visual descriptors; video relevance feedback; web video genre classification; RELEVANCE FEEDBACK; CLASSIFICATION; RETRIEVAL; SVM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we discuss and audio-visual approach to automatic web video categorization. We propose content descriptors which exploit audio, temporal, and color content. The power of our descriptors was validated both in the context of a classification system and as part of an information retrieval approach. For this purpose, we used a real-world scenario, comprising 26 video categories from the blip.tv media platform (up to 421 hours of video footage). Additionally, to bridge the descriptor semantic gap, we propose a new relevance feedback technique which is based on hierarchical clustering. Experiments demonstrated that retrieval performance can be increased significantly and becomes comparable to that of high level semantic textual descriptors.
引用
收藏
页码:375 / 379
页数:5
相关论文
共 50 条
  • [1] An audio-visual approach to web video categorization
    Ionescu, Bogdan Emanuel
    Seyerlehner, Klaus
    Mironica, Ionut
    Vertan, Constantin
    Lambert, Patrick
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 70 (02) : 1007 - 1032
  • [2] An audio-visual approach to web video categorization
    Bogdan Emanuel Ionescu
    Klaus Seyerlehner
    Ionuţ Mironică
    Constantin Vertan
    Patrick Lambert
    [J]. Multimedia Tools and Applications, 2014, 70 : 1007 - 1032
  • [3] Video genre categorization and representation using audio-visual information
    Ionescu, Bogdan
    Seyerlehner, Klaus
    Rasche, Christoph
    Vertan, Constantin
    Lambert, Patrick
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2012, 21 (02)
  • [4] Enhanced video clustering using multiple riemannian manifold-valued descriptors and audio-visual information
    Hu, Wenbo
    Zhan, Hongjian
    Tian, Yinghong
    Xiong, Yujie
    Lu, Yue
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 246
  • [5] Human interaction categorization by using audio-visual cues
    Marin-Jimenez, M. J.
    Munoz-Salinas, R.
    Yeguas-Bolivar, E.
    Perez de la Blanca, N.
    [J]. MACHINE VISION AND APPLICATIONS, 2014, 25 (01) : 71 - 84
  • [6] Human interaction categorization by using audio-visual cues
    M. J. Marín-Jiménez
    R. Muñoz-Salinas
    E. Yeguas-Bolivar
    N. Pérez de la Blanca
    [J]. Machine Vision and Applications, 2014, 25 : 71 - 84
  • [7] Automatic story segmentation of news video based on audio-visual features and text information
    Wang, C
    Wang, Y
    Liu, HY
    He, YX
    [J]. 2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 3008 - 3011
  • [8] Automatic Audio-Visual Fusion for Aggression Detection Using Meta-Information
    Lefter, Iulia
    Burghouts, Gertjan J.
    Rothkrantz, Leon J. M.
    [J]. 2012 IEEE NINTH INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL-BASED SURVEILLANCE (AVSS), 2012, : 19 - 24
  • [9] VIDEO CAMERA IDENTIFICATION USING AUDIO-VISUAL FEATURES
    Milani, S.
    Cuccovillo, L.
    Tagliasacchi, M.
    Tubaro, S.
    Aichroth, P.
    [J]. 2014 5TH EUROPEAN WORKSHOP ON VISUAL INFORMATION PROCESSING (EUVIP 2014), 2014,
  • [10] Efficient Video Coding in H.264/AVC by using Audio-Visual Information
    Lee, Jong-Seok
    Ebrahimi, Touradj
    [J]. 2009 IEEE INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP 2009), 2009, : 402 - 407