Detecting semantic concepts from video using temporal gradients and audio classification

被引:0
|
作者
Rautiainen, M
Seppänen, T
Penttilä, J
Peltola, J
机构
[1] Univ Oulu, MediaTeam Oulu, FIN-90014 Oulu, Finland
[2] VTT Tech Res Ctr Finland, FIN-90571 Oulu, Finland
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we describe new methods to detect semantic concepts from digital video based on audible and visual content. Temporal Gradient Correlogram captures temporal correlations of gradient edge directions from sampled shot frames. Power-related physical features are extracted from short audio samples in video shots. Video shots containing people, cityscape, landscape, speech or instrumental sound are detected with trained self-organized maps and kNN classification results of audio samples. Test runs and evaluations in TREC 2002 Video Track show consistent performance for Temporal Gradient Correlogram and state-of-the-art precision in audio-based instrumental sound detection.
引用
收藏
页码:260 / 270
页数:11
相关论文
共 50 条
  • [1] DETECTING SEMANTIC CONCEPTS IN CONSUMER VIDEOS USING AUDIO
    Liang, Junwei
    Jin, Qin
    He, Xixi
    Yang, Gang
    Xu, Jieping
    Li, Xirong
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2279 - 2283
  • [2] Using topic concepts for semantic video shots classification
    Ayache, Stephane
    Quenot, Georges
    Gensel, Jerome
    Satoh, Shin'ichi
    IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2006, 4071 : 300 - 309
  • [3] Detecting Audio Events for Semantic Video Search
    Bugalho, M.
    Portelo, J.
    Trancoso, I.
    Pellegrini, T.
    Abad, A.
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1147 - 1150
  • [4] Near-Duplicate Video Detection Using Temporal Patterns of Semantic Concepts
    Min, Hyun-seok
    Choi, JaeYoung
    De Neve, Wesley
    Ro, Yong Man
    2009 11TH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2009), 2009, : 65 - 71
  • [5] Semantic video retrieval using audio analysis
    Bakker, EM
    Lew, MS
    IMAGE AND VIDEO RETRIEVAL, 2002, 2383 : 271 - 277
  • [6] EXPLORING AUDIO SEMANTIC CONCEPTS FOR EVENT-BASED VIDEO RETRIEVAL
    Wang, Yipei
    Rawat, Shourabh
    Metze, Florian
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [7] Audio-Based Semantic Concept Classification for Consumer Video
    Lee, Keansub
    Ellis, Daniel P. W.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1406 - 1416
  • [8] Semantic Analysis of Field Sports Video using a Petri-Net of Audio-Visual Concepts
    Bai, Liang
    Lao, Songyang
    Smeaton, Alan F.
    O'Connor, Noel E.
    Sadlier, David
    Sinclair, David
    COMPUTER JOURNAL, 2009, 52 (07): : 808 - 823
  • [9] Hierarchical structure for audio-video based semantic classification of sports video sequences
    Kolekar, MH
    Sengupta, S
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2005, PTS 1-4, 2005, 5960 : 401 - 409
  • [10] Learning semantic visual concepts from video
    Liu, JC
    Bhanu, B
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL II, PROCEEDINGS, 2002, : 1061 - 1064